Data Science (DS)
Envíos recientes

Early classification of time series using multiobjective optimization techniques
(Information Sciences, 20190423)In early classification of time series the objective is to build models which are able to make classpredictions for time series as accurately and as early as possible, when only a part of the series is available. It is ... 
Mallows and generalized Mallows model for matchings
(Bernoulli, 20190225)The Mallows and Generalized Mallows Models are two of the most popular probability models for distribu tions on permutations. In this paper, we consider both models under the Hamming distance. This models can be seen as ... 
perm mateda: A matlab toolbox of estimation of distribution algorithms for permutationbased combinatorial optimization problems
(ACM Transactions on Mathematical Software, 2018)Permutation problems are combinatorial optimization problems whose solutions are naturally codified as permutations. Due to their complexity, motivated principally by the factorial cardinality of the search space of ... 
Aggregated outputs by linear models: An application on marine litter beaching prediction
(Information Sciences, 20190101)In regression, a predictive model which is able to anticipate the output of a new case is learnt from a set of previous examples. The output or response value of these examples used for model training is known. When learning ... 
A betabinomial mixedeffects model approach for analysing longitudinal discrete and bounded outcomes
(Biometrical Journal, 201805)Patientreported outcomes (PROs) are currently being increasingly used as primary outcome measures in observational and experimental studies since they inform clinicians and researchers about the healthstatus of patients ... 
Hybridizing Cartesian Genetic Programming and Harmony Search for Adaptive Feature Construction in Supervised Learning Problems
(Applied Soft Computing, 20170228)The advent of the socalled Big Data paradigm has motivated a flurry of research aimed at enhancing machine learning models by following very di verse approaches. In this context this work focuses on the automatic con ... 
A statistical framework for radiation dose estimation with uncertainty quantification from the γH2AX assay
(PLoS ONE, 20181128)Over the last decade, the γ–H2AX focus assay, which exploits the phosphorylation of the H2AX histone following DNA double–strand–breaks, has made considerable progress towards acceptance as a reliable biomarker for exposure ... 
On the relevance of preprocessing in predictive maintenance for dynamic systems
(Predictive Maintenance in Dynamic Systems, 2018)The complexity involved in the process of realtime datadriven monitoring dynamic systems for predicted maintenance is usually huge. With more or less indepth any datadriven approach is sensitive to data preprocessing, ... 
Calibration Model Maintenance in Melamine Resin Production: Integrating Drift Detection, Smart Sample Selection and Model Adaptation
(Analytica Chimica Acta, 2018)The physicochemical properties of Melamine Formaldehyde (MF) based thermosets are largely influenced by the degree of polymerization (DP) in the underlying resin. Online supervision of the turbidity point by means of ... 
Spatiotemporal information coupling in network navigation
(IEEE Transactions on Information Theory, 201812)Network navigation, encompassing both spatial and temporal cooperation to locate mobile agents, is a key enabler for numerous emerging locationbased applications. In such cooperative networks, the positional information ... 
Crowd Learning with Candidate Labeling: an EMbased Solution
(Conference of the Spanish Association for Artificial Intelligence, 20180927)Crowdsourcing is widely used nowadays in machine learning for data labeling. Although in the traditional case annotators are asked to provide a single label for each instance, novel approaches allow annotators, in case ... 
A review on distance based time series classification
(Data Mining and Knowledge Discovery,, 20181101)Time series classification is an increasing research topic due to the vast amount of time series data that is being created over a wide variety of fields. The particularity of the data makes it a challenging task and ... 
Detection of Sand Dunes on Mars Using a Regular Vinebased Classification Approach
(Knowledge Based Systems, 201808)This paper deals with the problem of detecting sand dunes from remotely sensed images of the surface of Mars. We build on previous approaches that propose methods to extract informative features for the classification of ... 
Distancebased exponential probability models on constrained combinatorial optimization problems
(GECCO 2018 Companion  Proceedings of the 2018 Genetic and Evolutionary Computation Conference Companion, 20180830)Estimation of distribution algorithms have already demonstrated their utility when solving a broad range of combinatorial problems. However, there is still room for methodological improvements when approaching constrained ... 
Bayesian inference for algorithm ranking analysis
(GECCO 2018 Companion  Proceedings of the 2018 Genetic and Evolutionary Computation Conference Companion 6 July 2018, Pages 324325, 20180830)The statistical assessment of the empirical comparison of algorithms is an essential step in heuristic optimization. Classically, researchers have relied on the use of statistical tests. However, recently, concerns about ... 
Spacecraft Trajectory Optimization: A review of Models, Objectives, Approaches and Solutions
(Progress in Aerospace Sciences, 2018)This article is a survey paper on solving spacecraft trajectory optimization problems. The solving process is decomposed into four key steps of mathematical modeling of the problem, defining the objective functions, ... 
A note on Poisson goodnessoffit tests for ionizing radiation induced chromosomal aberration samples
(International Journal of Radiation Biology, 20180613)Purpose: To present Poisson exact goodnessoffit tests as alternatives and complements to the asymptotic utest, which is the most widely used in cytogenetic biodosimetry, to decide whether a sample of chromosomal aberrations ... 
Bayesian nonparametric inference for the covariateadjusted ROC curve
(20180530)Accurate diagnosis of disease is of fundamental importance in clinical practice and medical research. Before a medical diagnostic test is routinely used in practice, its ability to distinguish between diseased and nondiseased ... 
Dicentric dose estimates for patients undergoing radiotherapy in the RTGene study to assess blood dosimetric models and the new Bayesian method for gradient exposure
(20180601)The RTGene study was focused on the development and validation of new transcriptional biomarkers for prediction of individual radiotherapy (RT) patient responses to ionising radiation (IR). In parallel, for validation ... 
A note on the behavior of majority voting in multiclass domains with biased annotators
(IEEE Transactions on Knowledge and Data Engineering, 201805)Majority voting is a popular and robust strategy to aggregate different opinions in learning from crowds, where each worker labels examples ac cording to their own criteria. Although it has been extensively studied in the ...