Data Science (DS)
Browse by
Recent Submissions

From habitat to management: a simulation framework for improving statistical methods in fisheries science
(20210707)Monte Carlo simulation consists of computer experiments that involve creating data by pseudorandom sampling and has shown to be a powerful tool for studying the performance of statistical methods. In this thesis Monte ... 
A Review on Outlier/Anomaly Detection in Time Series Data
(2021)Recent advances in technology have brought major breakthroughs in data collection, enabling a large amount of data to be gathered over time and thus generating time series. Mining this data has become an important task for ... 
Water leak detection using selfsupervised time series classification
(2021)Leaks in water distribution networks cause a loss of water that needs to be com pensated to ensure a continuous supply for all customers. This compensation is achieved by increasing the flow of the network, which entails ... 
A cheap feature selection approach for the K means algorithm
(202105)The increase in the number of features that need to be analyzed in a wide variety of areas, such as genome sequencing, computer vision or sensor networks, represents a challenge for the Kmeans algorithm. In this regard, ... 
Statistical model for reproducibility in rankingbased feature selection
(20201105)The stability of feature subset selection algorithms has become crucial in realworld problems due to the need for consistent experimental results across different replicates. Specifically, in this paper, we analyze the ... 
On the symmetry of the Quadratic Assignment Problem through Elementary Landscape Decomposition
(202107)When designing metaheuristic strategies to optimize the quadratic assignment problem (QAP), it is important to take into account the specific characteristics of the instance to be solved. One of the characteristics that ... 
Exploring Gaps in DeepFool inSearch of More Effective Adversarial Perturbations
(2021)Adversarial examples are inputs subtly perturbed to produce a wrong prediction in machine learning models, while remaining perceptually similar to the original input. To find adversarial examples, some attack strategies ... 
Delineation of site‐specific management zones using estimation of distribution algorithms
(2021)In this paper, we present a novel methodology to solve the problem of delineating homogeneous sitespecific management zones (SSMZ) in agricultural fields. This problem consists of dividing the field into small regions for ... 
On the fair comparison of optimization algorithms in different machines
(2021)An experimental comparison of two or more optimization algorithms requires the same computational resources to be assigned to each algorithm. When a maximum runtime is set as the stopping criterion, all algorithms need to ... 
Efficient metaheuristics for spacecraft trajectory optimization
(202103)Metaheuristics has a long tradition in computer science. During the past few years, different types of metaheuristics, specially evolutionary algorithms got noticeable attention in dealing with realworld optimization ... 
Algorithms for Large Orienteering Problems
(202101)In this thesis, we have developed algorithms to solve largescale Orienteering Problems. The Orienteering Problem is a combinatorial optimization problem were given a weighted complete graph with vertex profits and a maximum ... 
A simulation framework for orbit propagation and space trajectory visualization
(2021)In this paper, an interactive tool for simulation of satellites dynamics and autonomous spacecraft guidance is presented. Different geopotential models for orbit propagation of Earthorbiting satellites are provided, which ... 
Minimax Classification with 01 Loss and Performance Guarantees
(20201201)Supervised classification techniques use training samples to find classification rules with small expected 01 loss. Conventional methods achieve efficient learning and outofsample generalization by minimizing surrogate ... 
Analysis of the sensitivity of the EndOfTurn Detection task to errors generated by the Automatic Speech Recognition process.
(2021)An EndOfTurn Detection Module (EOTDM) is an essential component of au tomatic Spoken Dialogue Systems. The capability of correctly detecting whether a user’s utterance has ended or not improves the accuracy in interpreting ... 
Probabilistic Load Forecasting Based on Adaptive Online Learning
(2020)Load forecasting is crucial for multiple energy management tasks such as scheduling generation capacity, planning supply and demand, and minimizing energy trade costs. Such relevance has increased even more in recent ... 
Identifying common treatments from Electronic Health Records with missing information. An application to breast cancer.
(20201229)The aim of this paper is to analyze the sequence of actions in the health system associated with a particular disease. In order to do that, using Electronic Health Records, we define a general methodology that allows us ... 
Migration in MultiPopulation Differential Evolution for Many Objective Optimization
(2020)The paper proposes a novel extension of many objective optimization using differential evolution (MaODE). MaODE solves a many objective optimization (MaOO) problem by parallel optimization of individual objectives. MaODE ... 
QLearning Induced Artificial Bee Colony for Noisy Optimization
(2020)The paper proposes a novel approach to adaptive selection of sample size for a trial solution of an evolutionary algorithm when noise of unknown distribution contaminates the objective surface. The sample size of a solution ... 
A Machine Learning Approach to Predict Healthcare Cost of Breast Cancer Patients
(2021)This paper presents a novel machine learning approach to per form an early prediction of the healthcare cost of breast cancer patients. The learning phase of our prediction method considers the following two steps: i) in ... 
Journey to the center of the linear ordering problem
(202006)A number of local search based algorithms have been designed to escape from the local optima, such as, iterated local search or variable neighborhood search. The neighborhood chosen for the local search as well as the ...