Search
Now showing items 1-10 of 38
Efficient Learning of Minimax Risk Classifiers in High Dimensions
(2023-08-01)
High-dimensional data is common in multiple areas, such as health care and genomics, where the
number of features can be tens of thousands. In
such scenarios, the large number of features often leads to inefficient ...
Learning the progression patterns of treatments using a probabilistic generative model
(2022-12-15)
Modeling a disease or the treatment of a patient has drawn much attention in recent years due to the vast amount of information that Electronic Health Records contain. This paper presents a probabilistic generative model ...
Implementing the Cumulative Difference Plot in the IOHanalyzer
(2022-07)
The IOHanalyzer is a web-based framework that enables an easy visualization and comparison of the quality of stochastic optimization algorithms. IOHanalyzer offers several graphical and statistical tools analyze the results ...
An active adaptation strategy for streaming time series classification based on elastic similarity measures
(2022-05-21)
In streaming time series classification problems, the goal is to predict the label associated to the most recently received observations over the stream according to a set of categorized reference patterns. In on-line ...
Generalized Maximum Entropy for Supervised Classification
(2022-04)
The maximum entropy principle advocates to
evaluate events’ probabilities using a distribution that maximizes
entropy among those that satisfy certain expectations’ constraints. Such principle can be generalized for ...
Rank aggregation for non-stationary data streams
(2022)
The problem of learning over non-stationary ranking streams arises naturally, particularly in recommender systems. The rankings represent the preferences of a population, and the non-stationarity means that the distribution ...
On the relative value of weak information of supervision for learning generative models: An empirical study
(2022)
Weakly supervised learning is aimed to learn predictive models from partially supervised data, an easy-to-collect alternative to the costly standard full supervision. During the last decade, the research community has ...
LASSO for streaming data with adaptative filtering
(2022)
Streaming data is ubiquitous in modern machine learning, and so the development of scalable algorithms to analyze this sort of information is a topic of current interest. On the other hand, the problem of l1-penalized ...
Are the statistical tests the best way to deal with the biomarker selection problem?
(2022)
Statistical tests are a powerful set of tools when applied correctly, but unfortunately the extended misuse of them has caused great concern. Among many other applications, they are used in the detection of biomarkers so ...
On the use of the descriptive variable for enhancing the aggregation of crowdsourced labels
(2022)
The use of crowdsourcing for annotating data has become a popular and cheap alternative to expert labelling. As a consequence, an aggregation task is required to combine the different labels provided and agree on a single ...