Skip to main content

Showing 1–31 of 31 results for author: Soto, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.00930  [pdf, ps, other

    cs.DS cs.DM

    Inverse matroid optimization under subset constraints

    Authors: Kristóf Bérczi, Lydia Mirabel Mendoza-Cadena, José Soto

    Abstract: In the Inverse Matroid problem, we are given a matroid, a fixed basis $B$, and an initial weight function, and the goal is to minimally modify the weights -- measured by some function -- so that $B$ becomes a maximum-weight basis. The problem arises naturally in settings where one wishes to explain or enforce a given solution by minimally perturbing the input. We extend this classical problem by… ▽ More

    Submitted 2 July, 2025; v1 submitted 1 July, 2025; originally announced July 2025.

    Comments: 20 pages

  2. arXiv:2505.18978  [pdf, other

    cs.CL

    AI4Math: A Native Spanish Benchmark for University-Level Mathematical Reasoning in Large Language Models

    Authors: Miguel Angel Peñaloza Perez, Bruno Lopez Orozco, Jesus Tadeo Cruz Soto, Michelle Bruno Hernandez, Miguel Angel Alvarado Gonzalez, Sandra Malagon

    Abstract: Existing mathematical reasoning benchmarks are predominantly English only or translation-based, which can introduce semantic drift and mask languagespecific reasoning errors. To address this, we present AI4Math, a benchmark of 105 original university level math problems natively authored in Spanish. The dataset spans seven advanced domains (Algebra, Calculus, Geometry, Probability, Number Theory,… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 36 pages, 5 figures

    MSC Class: 68 ACM Class: I.2

  3. arXiv:2504.08646  [pdf, other

    cs.CV cs.RO

    MBE-ARI: A Multimodal Dataset Mapping Bi-directional Engagement in Animal-Robot Interaction

    Authors: Ian Noronha, Advait Prasad Jawaji, Juan Camilo Soto, Jiajun An, Yan Gu, Upinder Kaur

    Abstract: Animal-robot interaction (ARI) remains an unexplored challenge in robotics, as robots struggle to interpret the complex, multimodal communication cues of animals, such as body language, movement, and vocalizations. Unlike human-robot interaction, which benefits from established datasets and frameworks, animal-robot interaction lacks the foundational resources needed to facilitate meaningful bidire… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: Accepted to ICRA 2025

  4. arXiv:2411.12069  [pdf, ps, other

    cs.DS

    Matroid Secretary via Labeling Schemes

    Authors: Kristóf Bérczi, Vasilis Livanos, José Soto, Victor Verdugo

    Abstract: The Matroid Secretary Problem (MSP) is one of the most prominent settings for online resource allocation and optimal stopping. A decision-maker is presented with a ground set of elements $E$ revealed sequentially and in random order. Upon arrival, an irrevocable decision is made in a take-it-or-leave-it fashion, subject to a feasibility constraint on the set of selected elements captured by a matr… ▽ More

    Submitted 30 May, 2025; v1 submitted 18 November, 2024; originally announced November 2024.

    Comments: 33 pages, 3 figures

  5. arXiv:2410.12756  [pdf, ps, other

    cs.GT cs.DS

    Prophet Upper Bounds for Online Matching and Auctions

    Authors: José Soto, Victor Verdugo

    Abstract: In the online 2-bounded auction problem, we have a collection of items represented as nodes in a graph and bundles of size two represented by edges. Agents are presented sequentially, each with a random weight function over the bundles. The goal of the decision-maker is to find an allocation of bundles to agents of maximum weight so that every item is assigned at most once, i.e., the solution is a… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  6. arXiv:2405.17467  [pdf, other

    cs.LG cs.NE

    Sports center customer segmentation: a case study

    Authors: Juan Soto, Ramón Carmenaty, Miguel Lastra, Juan M. Fernández-Luna, José M. Benítez

    Abstract: Customer segmentation is a fundamental process to develop effective marketing strategies, personalize customer experience and boost their retention and loyalty. This problem has been widely addressed in the scientific literature, yet no definitive solution for every case is available. A specific case study characterized by several individualizing features is thoroughly analyzed and discussed in th… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  7. arXiv:2404.17214  [pdf, other

    cs.DS

    Set Selection with Uncertain Weights: Non-Adaptive Queries and Thresholds

    Authors: Christoph Dürr, Arturo Merino, José A. Soto, José Verschae

    Abstract: We study set selection problems where the weights are uncertain. Instead of its exact weight, only an uncertainty interval containing its true weight is available for each element. In some cases, some solutions are universally optimal; i.e., they are optimal for every weight that lies within the uncertainty intervals. However, it may be that no universal optimal solution exists, unless we are reve… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  8. arXiv:2311.00890  [pdf, other

    cs.DS

    Online Combinatorial Assignment in Independence Systems

    Authors: Javier Marinkovic, José A. Soto, Victor Verdugo

    Abstract: We consider an online multi-weighted generalization of several classic online optimization problems, called the online combinatorial assignment problem. We are given an independence system over a ground set of elements and agents that arrive online one by one. Upon arrival, each agent reveals a weight function over the elements of the ground set. If the independence system is given by the matching… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 31 pages

  9. arXiv:2306.11832  [pdf, other

    cs.IR cs.CL

    QuOTeS: Query-Oriented Technical Summarization

    Authors: Juan Ramirez-Orta, Eduardo Xamena, Ana Maguitman, Axel J. Soto, Flavia P. Zanoto, Evangelos Milios

    Abstract: Abstract. When writing an academic paper, researchers often spend considerable time reviewing and summarizing papers to extract relevant citations and data to compose the Introduction and Related Work sections. To address this problem, we propose QuOTeS, an interactive system designed to retrieve sentences related to a summary of the research from a collection of potential references and hence ass… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted at ICDAR 2023

  10. A Survey on Transactional Stream Processing

    Authors: Shuhao Zhang, Juan Soto, Volker Markl

    Abstract: Transactional stream processing (TSP) strives to create a cohesive model that merges the advantages of both transactional and stream-oriented guarantees. Over the past decade, numerous endeavors have contributed to the evolution of TSP solutions, uncovering similarities and distinctions among them. Despite these advances, a universally accepted standard approach for integrating transactional funct… ▽ More

    Submitted 31 May, 2023; v1 submitted 21 August, 2022; originally announced August 2022.

    ACM Class: H.2.4; A.1

    Journal ref: The VLDB Journal 33, 451- 479 (2024)

  11. arXiv:2111.02234  [pdf, other

    cs.DS

    Approximation Algorithms for Vertex-Connectivity Augmentation on the Cycle

    Authors: Waldo Gálvez, Francisco Sanhueza-Matamala, José A. Soto

    Abstract: Given a $k$-vertex-connected graph $G$ and a set $S$ of extra edges (links), the goal of the $k$-vertex-connectivity augmentation problem is to find a set $S' \subseteq S$ of minimum size such that adding $S'$ to $G$ makes it $(k+1)$-vertex-connected. Unlike the edge-connectivity augmentation problem, research for the vertex-connectivity version has been sparse. In this work we present the first… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: Accepted at The 19th International Workshop on Approximation and Online Algorithms (WAOA 2021)

    MSC Class: 68W25 ACM Class: F.2.2

  12. arXiv:2109.06264  [pdf, other

    cs.CL

    Post-OCR Document Correction with large Ensembles of Character Sequence-to-Sequence Models

    Authors: Juan Ramirez-Orta, Eduardo Xamena, Ana Maguitman, Evangelos Milios, Axel J. Soto

    Abstract: In this paper, we propose a novel method based on character sequence-to-sequence models to correct documents already processed with Optical Character Recognition (OCR) systems. The main contribution of this paper is a set of strategies to accurately process strings much longer than the ones used to train the sequence model while being sample- and resource-efficient, supported by thorough experimen… ▽ More

    Submitted 24 January, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

  13. arXiv:2104.02604  [pdf, other

    q-bio.BM cs.LG q-bio.QM

    Using Molecular Embeddings in QSAR Modeling: Does it Make a Difference?

    Authors: María Virginia Sabando, Ignacio Ponzoni, Evangelos E. Milios, Axel J. Soto

    Abstract: With the consolidation of deep learning in drug discovery, several novel algorithms for learning molecular representations have been proposed. Despite the interest of the community in developing new methods for learning molecular embeddings and their theoretical benefits, comparing molecular embeddings with each other and with traditional representations is not straightforward, which in turn hinde… ▽ More

    Submitted 28 July, 2021; v1 submitted 20 March, 2021; originally announced April 2021.

    Journal ref: Briefings in Bioinformatics, Volume 23, Issue 1, January 2022, bbab365

  14. arXiv:2011.06516  [pdf, other

    cs.GT cs.DS

    Sample-driven optimal stopping: From the secretary problem to the i.i.d. prophet inequality

    Authors: José Correa, Andrés Cristi, Boris Epstein, José Soto

    Abstract: We take a unifying approach to single selection optimal stopping problems with random arrival order and independent sampling of items. In the problem we consider, a decision maker (DM) initially gets to sample each of $N$ items independently with probability $p$, and can observe the relative rankings of these sampled items. Then, the DM faces the remaining items in an online fashion, observing the… ▽ More

    Submitted 9 August, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: 44 pages, 1 figure

  15. arXiv:2008.13150  [pdf, other

    cs.GR

    ChemVA: Interactive Visual Analysis of Chemical Compound Similarity in Virtual Screening

    Authors: María Virginia Sabando, Pavol Ulbrich, Matías Selzer, Jan Byška, Jan Mičan, Ignacio Ponzoni, Axel J. Soto, María Luján Ganuza, Barbora Kozlíková

    Abstract: In the modern drug discovery process, medicinal chemists deal with the complexity of analysis of large ensembles of candidate molecules. Computational tools, such as dimensionality reduction (DR) and classification, are commonly used to efficiently process the multidimensional space of features. These underlying calculations often hinder interpretability of results and prevent experts from assessi… ▽ More

    Submitted 30 August, 2020; originally announced August 2020.

    Comments: Accepted for the IEEE VIS 2020 conference

  16. arXiv:2001.00155  [pdf

    eess.SP cs.LG stat.ML

    DeepBeat: A multi-task deep learning approach to assess signal quality and arrhythmia detection in wearable devices

    Authors: Jessica Torres Soto, Euan Ashley

    Abstract: Wearable devices enable theoretically continuous, longitudinal monitoring of physiological measurements like step count, energy expenditure, and heart rate. Although the classification of abnormal cardiac rhythms such as atrial fibrillation from wearable devices has great potential, commercial algorithms remain proprietary and tend to focus on heart rate variability derived from green spectrum LED… ▽ More

    Submitted 25 January, 2020; v1 submitted 1 January, 2020; originally announced January 2020.

  17. arXiv:1907.06001  [pdf, other

    cs.DS cs.GT

    The Two-Sided Game of Googol and Sample-Based Prophet Inequalities

    Authors: José Correa, Andrés Cristi, Boris Epstein, José A. Soto

    Abstract: The secretary problem or the game of Googol are classic models for online selection problems that have received significant attention in the last five decades. We consider a variant of the problem and explore its connections to data-driven online selection. Specifically, we are given $n$ cards with arbitrary non-negative numbers written on both sides. The cards are randomly placed on $n$ consecuti… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

  18. arXiv:1904.11668  [pdf, other

    cs.DS cs.DM

    The minimum cost query problem on matroids with uncertainty areas

    Authors: Arturo I. Merino, José A. Soto

    Abstract: We study the minimum weight basis problem on matroid when elements' weights are uncertain. For each element we only know a set of possible values (an uncertainty area) that contains its real weight. In some cases there exist bases that are uniformly optimal, that is, they are minimum weight bases for every possible weight function obeying the uncertainty areas. In other cases, computing such a bas… ▽ More

    Submitted 26 April, 2019; originally announced April 2019.

    Comments: 20 pages. A preliminary version appears in ICALP 2019

  19. arXiv:1802.01997  [pdf, ps, other

    cs.DS

    Strong Algorithms for the Ordinal Matroid Secretary Problem

    Authors: José A. Soto, Abner Turkieltaub, Victor Verdugo

    Abstract: In the ordinal Matroid Secretary Problem (MSP), elements from a weighted matroid are presented in random order to an algorithm that must incrementally select a large weight independent set. However, the algorithm can only compare pairs of revealed elements without using its numerical value. An algorithm is $α$ probability-competitive if every element from the optimum appears with probability… ▽ More

    Submitted 6 February, 2018; originally announced February 2018.

    Comments: A preliminary version appeared at ACM-SIAM SODA 18

  20. arXiv:1705.06631  [pdf, other

    cs.DM math.CO

    Robust randomized matchings

    Authors: Jannik Matuschke, Martin Skutella, José A. Soto

    Abstract: The following game is played on a weighted graph: Alice selects a matching $M$ and Bob selects a number $k$. Alice's payoff is the ratio of the weight of the $k$ heaviest edges of $M$ to the maximum weight of a matching of size at most $k$. If $M$ guarantees a payoff of at least $α$ then it is called $α$-robust. In 2002, Hassin and Rubinstein gave an algorithm that returns a $1/\sqrt{2}$-robust ma… ▽ More

    Submitted 18 May, 2017; originally announced May 2017.

  21. arXiv:1702.01596  [pdf

    cs.DB

    A Survey of State Management in Big Data Processing Systems

    Authors: Quoc-Cuong To, Juan Soto, Volker Markl

    Abstract: State management and its use in diverse applications varies widely across big data processing systems. This is evident in both the research literature and existing systems, such as Apache Flink, Apache Samza, Apache Spark, and Apache Storm. Given the pivotal role that state management plays in various use cases, in this survey, we present some of the most important uses of state as an enabler, dis… ▽ More

    Submitted 1 August, 2018; v1 submitted 6 February, 2017; originally announced February 2017.

    Comments: 2 pages

  22. arXiv:1612.01829  [pdf, ps, other

    cs.DS

    Symmetry exploitation for Online Machine Covering with Bounded Migration

    Authors: Waldo Gálvez, José A. Soto, José Verschae

    Abstract: Online models that allow recourse are highly effective in situations where classical models are too pessimistic. One such problem is the online machine covering problem on identical machines. In this setting, jobs arrive one by one and must be assigned to machines with the objective of maximizing the minimum machine load. When a job arrives, we are allowed to reassign some jobs as long as their to… ▽ More

    Submitted 28 August, 2018; v1 submitted 6 December, 2016; originally announced December 2016.

    Comments: 26 pages, 3 figures; full version of ESA 2018 paper

    ACM Class: F.2.2

  23. arXiv:1411.2311  [pdf, other

    cs.CG cs.DS

    Independent sets and hitting sets of bicolored rectangular families

    Authors: José A. Soto, Claudio Telha

    Abstract: A bicolored rectangular family BRF is a collection of all axis-parallel rectangles contained in a given region Z of the plane formed by selecting a bottom-left corner from a set A and an upper-right corner from a set B. We prove that the maximum independent set and the minimum hitting set of a BRF have the same cardinality and devise polynomial time algorithms to compute both. As a direct conseque… ▽ More

    Submitted 9 November, 2014; originally announced November 2014.

    Comments: 36 pages, A preliminary version of this work appeared in IPCO 2011 under the name "Jump Number of Two-Directional Orthogonal Ray Graphs"

  24. arXiv:1310.1896  [pdf, other

    cs.DS cs.DM math.CO

    TSP Tours in Cubic Graphs: Beyond 4/3

    Authors: José R. Correa, Omar Larré, José A. Soto

    Abstract: After a sequence of improvements Boyd, Sitters, van der Ster, and Stougie proved that any 2-connected graph whose n vertices have degree 3, i.e., a cubic 2-connected graph, has a Hamiltonian tour of length at most (4/3)n, establishing in particular that the integrality gap of the subtour LP is at most 4/3 for cubic 2-connected graphs and matching the conjectured value of the famous 4/3 conjecture.… ▽ More

    Submitted 7 October, 2013; originally announced October 2013.

    Comments: 23 pages. A preliminary version appeared in ESA 2012

  25. arXiv:1309.6659  [pdf, other

    cs.CG

    Independent and Hitting Sets of Rectangles Intersecting a Diagonal Line : Algorithms and Complexity

    Authors: José R. Correa, Laurent Feuilloley, Pablo Pérez-Lantero, José A. Soto

    Abstract: Finding a maximum independent set (MIS) of a given fam- ily of axis-parallel rectangles is a basic problem in computational geom- etry and combinatorics. This problem has attracted significant atten- tion since the sixties, when Wegner conjectured that the corresponding duality gap, i.e., the maximum possible ratio between the maximum independent set and the minimum hitting set (MHS), is bounded b… ▽ More

    Submitted 3 January, 2014; v1 submitted 25 September, 2013; originally announced September 2013.

    Comments: 26 pages

  26. arXiv:1207.1333  [pdf, other

    cs.DS cs.DM cs.GT

    Advances on Matroid Secretary Problems: Free Order Model and Laminar Case

    Authors: Patrick Jaillet, José A. Soto, Rico Zenklusen

    Abstract: The most well-known conjecture in the context of matroid secretary problems claims the existence of a constant-factor approximation applicable to any matroid. Whereas this conjecture remains open, modified forms of it were shown to be true, when assuming that the assignment of weights to the secretaries is not adversarial but uniformly random (Soto [SODA 2011], Oveis Gharan and Vondrák [ESA 2011])… ▽ More

    Submitted 23 June, 2014; v1 submitted 5 July, 2012; originally announced July 2012.

  27. arXiv:1105.0474  [pdf, other

    math.CO cs.DM

    Generalizations and Variants of the Largest Non-crossing Matching Problem in Random Bipartite Graphs

    Authors: Marcos Kiwi, José A. Soto

    Abstract: We are interested in the statistics of the length of the longest increasing subsequence of 2-rowed lexicographically sorted arrays chosen according to distinct families of distributions D = (D_n)_n, and when n goes to infinity. This framework encompasses well studied problems such as the so called Longest Increasing Subsequence problem, the Longest Common Subsequence problem, problems concerning d… ▽ More

    Submitted 3 May, 2011; originally announced May 2011.

    Comments: 32 pages, 5 figures

    ACM Class: G.2.1

  28. arXiv:1102.3491  [pdf, ps, other

    cs.DS

    A simple PTAS for Weighted Matroid Matching on Strongly Base Orderable Matroids

    Authors: José A. Soto

    Abstract: We give a simple polynomial time approximation scheme for the weighted matroid matching problem on strongly base orderable matroids. We also show that even the unweighted version of this problem is NP-complete and not in oracle-coNP.

    Submitted 16 February, 2011; originally announced February 2011.

    Comments: 8 pages, 3 figures. To appear in LAGOS 2011

  29. arXiv:1007.2152  [pdf, ps, other

    cs.DS

    Matroid Secretary Problem in the Random Assignment Model

    Authors: José A. Soto

    Abstract: In the Matroid Secretary Problem, introduced by Babaioff et al. [SODA 2007], the elements of a given matroid are presented to an online algorithm in random order. When an element is revealed, the algorithm learns its weight and decides whether or not to select it under the restriction that the selected elements form an independent set in the matroid. The objective is to maximize the total weight… ▽ More

    Submitted 13 July, 2010; originally announced July 2010.

    Comments: 16 pages. Submitted to SODA 2011

  30. Symmetric Submodular Function Minimization Under Hereditary Family Constraints

    Authors: Michel X. Goemans, José A. Soto

    Abstract: We present an efficient algorithm to find non-empty minimizers of a symmetric submodular function over any family of sets closed under inclusion. This for example includes families defined by a cardinality constraint, a knapsack constraint, a matroid independence constraint, or any combination of such constraints. Our algorithm make $O(n^3)$ oracle calls to the submodular function where $n$ is the… ▽ More

    Submitted 13 July, 2010; originally announced July 2010.

    Comments: 13 pages, Submitted to SODA 2011

    Journal ref: SIAM J. Discrete Math., 27(2), 1123--1145. 2013

  31. arXiv:0910.0504  [pdf, other

    cs.DS

    Improved Analysis of a Max Cut Algorithm Based on Spectral Partitioning

    Authors: José Soto

    Abstract: Trevisan [SICOMP 2012] presented an algorithm for Max-Cut based on spectral partitioning techniques. This is the first algorithm for Max-Cut with an approximation guarantee strictly larger than 1/2 that is not based on semidefinite programming. Trevisan showed that its approximation ratio is of at least 0.531. In this paper we improve this bound up to 0.614247. We also define and extend this resul… ▽ More

    Submitted 2 December, 2014; v1 submitted 5 October, 2009; originally announced October 2009.

    Comments: 9 pages, 2 figures. Final version