Skip to main content

Showing 1–13 of 13 results for author: García, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.07515  [pdf, other

    astro-ph.IM stat.AP

    Sequential Filtering Techniques for Simultaneous Tracking and Parameter Estimation

    Authors: Yannick Sztamfater Garcia, Joaquin Miguez, Manuel Sanjurjo-Rivo

    Abstract: The number of resident space objects is rising at an alarming rate. Mega-constellations and breakup events are proliferating in most orbital regimes, and safe navigation is becoming increasingly problematic. It is important to be able to track RSOs accurately and at an affordable computational cost. Orbital dynamics are highly nonlinear, and current operational methods assume Gaussian representati… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: 28 pages, 9 figures. Submitted to the Journal of Astronautical Sciences on 26 March, 2025

  2. arXiv:2403.02432  [pdf, other

    stat.ML cs.LG math.OC

    On the impact of measure pre-conditionings on general parametric ML models and transfer learning via domain adaptation

    Authors: Joaquín Sánchez García

    Abstract: We study a new technique for understanding convergence of learning agents under small modifications of data. We show that such convergence can be understood via an analogue of Fatou's lemma which yields gamma-convergence. We show it's relevance and applications in general machine learning tasks and domain adaptation transfer learning.

    Submitted 4 March, 2024; originally announced March 2024.

  3. arXiv:2312.17553  [pdf, other

    cs.CV stat.ML

    A Fully Automated Pipeline Using Swin Transformers for Deep Learning-Based Blood Segmentation on Head CT Scans After Aneurysmal Subarachnoid Hemorrhage

    Authors: Sergio Garcia Garcia, Santiago Cepeda, Ignacio Arrese, Rosario Sarabia

    Abstract: Background: Accurate volumetric assessment of spontaneous subarachnoid hemorrhage (SAH) is a labor-intensive task performed with current manual and semiautomatic methods that might be relevant for its clinical and prognostic implications. In the present research, we sought to develop and validate an artificial intelligence-driven, fully automated blood segmentation tool for SAH patients via noncon… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  4. arXiv:2003.02601  [pdf, other

    cs.LG stat.ML

    Fuzzy k-Nearest Neighbors with monotonicity constraints: Moving towards the robustness of monotonic noise

    Authors: Sergio González, Salvador García, Sheng-Tun Li, Robert John, Francisco Herrera

    Abstract: This paper proposes a new model based on Fuzzy k-Nearest Neighbors for classification with monotonic constraints, Monotonic Fuzzy k-NN (MonFkNN). Real-life data-sets often do not comply with monotonic constraints due to class noise. MonFkNN incorporates a new calculation of fuzzy memberships, which increases robustness against monotonic noise without the need for relabeling. Our proposal has been… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

    Comments: Accepted in Neurocomputing

  5. Recent Trends in the Use of Statistical Tests for Comparing Swarm and Evolutionary Computing Algorithms: Practical Guidelines and a Critical Review

    Authors: J. Carrasco, S. García, M. M. Rueda, S. Das, F. Herrera

    Abstract: A key aspect of the design of evolutionary and swarm intelligence algorithms is studying their performance. Statistical comparisons are also a crucial part which allows for reliable conclusions to be drawn. In the present paper we gather and examine the approaches taken from different perspectives to summarise the assumptions made by these statistical tests, the conclusions reached and the steps f… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

    Comments: 52 pages, 10 figures, 19 tables

    Journal ref: SWEVO, Volume 54, May 2020, 100665

  6. arXiv:2001.05759  [pdf, other

    cs.LG cs.DC stat.ML

    Smart Data driven Decision Trees Ensemble Methodology for Imbalanced Big Data

    Authors: Diego García-Gil, Salvador García, Ning Xiong, Francisco Herrera

    Abstract: Differences in data size per class, also known as imbalanced data distribution, have become a common problem affecting data quality. Big Data scenarios pose a new challenge to traditional imbalanced classification algorithms, since they are not prepared to work with such amount of data. Split data strategies and lack of data in the minority class due to the use of MapReduce paradigm have posed new… ▽ More

    Submitted 3 September, 2021; v1 submitted 16 January, 2020; originally announced January 2020.

  7. arXiv:1812.05944  [pdf, other

    cs.LG stat.ML

    A Tutorial on Distance Metric Learning: Mathematical Foundations, Algorithms, Experimental Analysis, Prospects and Challenges (with Appendices on Mathematical Background and Detailed Algorithms Explanation)

    Authors: Juan Luis Suárez-Díaz, Salvador García, Francisco Herrera

    Abstract: Distance metric learning is a branch of machine learning that aims to learn distances from the data, which enhances the performance of similarity-based algorithms. This tutorial provides a theoretical background and foundations on this topic and a comprehensive experimental analysis of the most-known algorithms. We start by describing the distance metric learning problem and its main mathematical… ▽ More

    Submitted 19 August, 2020; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: 36 pages with appendices

  8. A snapshot on nonstandard supervised learning problems: taxonomy, relationships and methods

    Authors: David Charte, Francisco Charte, Salvador García, Francisco Herrera

    Abstract: Machine learning is a field which studies how machines can alter and adapt their behavior, improving their actions according to the information they are given. This field is subdivided into multiple areas, among which the best known are supervised learning (e.g. classification and regression) and unsupervised learning (e.g. clustering and association rules). Within supervised learning, most stud… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    MSC Class: 68T05; 68T10

    Journal ref: Charte, D., Charte, F., García, S. et al. Prog Artif Intell (2018)

  9. arXiv:1810.09733  [pdf, ps, other

    cs.LG stat.ML

    OCAPIS: R package for Ordinal Classification And Preprocessing In Scala

    Authors: M. Cristina Heredia-Gómez, Salvador García, Pedro Antonio Gutiérrez, Francisco Herrera

    Abstract: Ordinal Data are those where a natural order exist between the labels. The classification and pre-processing of this type of data is attracting more and more interest in the area of machine learning, due to its presence in many common problems. Traditionally, ordinal classification problems have been approached as nominal problems. However, that implies not taking into account their natural order… ▽ More

    Submitted 17 March, 2019; v1 submitted 23 October, 2018; originally announced October 2018.

    Comments: 16 pages

  10. arXiv:1810.06021  [pdf, ps, other

    cs.DB cs.LG stat.ML

    DPASF: A Flink Library for Streaming Data preprocessing

    Authors: Alejandro Alcalde-Barros, Diego García-Gil, Salvador García, Francisco Herrera

    Abstract: Data preprocessing techniques are devoted to correct or alleviate errors in data. Discretization and feature selection are two of the most extended data preprocessing techniques. Although we can find many proposals for static Big Data preprocessing, there is little research devoted to the continuous Big Data problem. Apache Flink is a recent and novel Big Data framework, following the MapReduce pa… ▽ More

    Submitted 14 October, 2018; originally announced October 2018.

    Comments: 19 pages

  11. arXiv:1804.05774  [pdf, ps, other

    cs.LG stat.ML

    BELIEF: A distance-based redundancy-proof feature selection method for Big Data

    Authors: Sergio Ramírez-Gallego, Salvador García, Ning Xiong, Francisco Herrera

    Abstract: With the advent of Big Data era, data reduction methods are highly demanded given its ability to simplify huge data, and ease complex learning processes. Concretely, algorithms that are able to filter relevant dimensions from a set of millions are of huge importance. Although effective, these techniques suffer from the "scalability" curse as well. In this work, we propose a distributed feature w… ▽ More

    Submitted 16 April, 2018; originally announced April 2018.

    Comments: 30 pages, 6 figures

  12. arXiv:1501.04222  [pdf, ps, other

    stat.CO

    JavaNPST: Nonparametric Statistical Tests in Java

    Authors: J. Derrac, S. García, F. Herrera

    Abstract: Nonparametric statistical tests are useful procedures that can be applied in a wide range of situations, such as testing randomness or goodness of fit, one-sample, two-sample and multiple-sample analysis, association between bivariate samples or count data analysis. Their use is often preferred to parametric tests due to the fact that they require less restrictive assumptions about the population… ▽ More

    Submitted 17 January, 2015; originally announced January 2015.

    Comments: 19 pages, 1 figure. Statistical Software Library for JAVA

  13. arXiv:1106.5834  [pdf, ps, other

    math.ST stat.AP

    A method for generating realistic correlation matrices

    Authors: Johanna Hardin, Stephan Ramon Garcia, David Golan

    Abstract: Simulating sample correlation matrices is important in many areas of statistics. Approaches such as generating Gaussian data and finding their sample correlation matrix or generating random uniform $[-1,1]$ deviates as pairwise correlations both have drawbacks. We develop an algorithm for adding noise, in a highly controlled manner, to general correlation matrices. In many instances, our method yi… ▽ More

    Submitted 6 December, 2013; v1 submitted 28 June, 2011; originally announced June 2011.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOAS638 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS638

    Journal ref: Annals of Applied Statistics 2013, Vol. 7, No. 3, 1733-1762