Skip to main content

Showing 1–50 of 153 results for author: Campos, D

.
  1. arXiv:2505.17722  [pdf, ps, other

    physics.bio-ph cond-mat.soft cond-mat.stat-mech nlin.AO

    A Magnetic-like description of Oscillatory Behavior in Chemotactic Ants

    Authors: Rosa Flaquer-Galmés, Daniel Campos, Javier Cristín

    Abstract: We investigate the role of chemotaxis in the movement dynamics of Aphaenogaster Senilis ants. To do so, we design an experimental setup in which individual ants are exposed to a narrow pheromone trail to guide their motion. As expected, ants locate and navigate the trail by detecting chemical scents, exhibiting a characteristic zigzag pattern, moving at a nearly constant speed while oscillating pe… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 9 pages, 4 figures

  2. arXiv:2505.00263  [pdf, other

    cs.IR cs.CL

    EnronQA: Towards Personalized RAG over Private Documents

    Authors: Michael J. Ryan, Danmei Xu, Chris Nivera, Daniel Campos

    Abstract: Retrieval Augmented Generation (RAG) has become one of the most popular methods for bringing knowledge-intensive context to large language models (LLM) because of its ability to bring local context at inference time without the cost or data leakage risks associated with fine-tuning. A clear separation of private information from the LLM training has made RAG the basis for many enterprise LLM workl… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

    Comments: 26 pages, 4 figures, 6 tables

  3. arXiv:2504.15205  [pdf, other

    cs.CL cs.AI cs.IR

    Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges

    Authors: Nandan Thakur, Ronak Pradeep, Shivani Upadhyay, Daniel Campos, Nick Craswell, Jimmy Lin

    Abstract: Retrieval-augmented generation (RAG) enables large language models (LLMs) to generate answers with citations from source documents containing "ground truth", thereby reducing system hallucinations. A crucial factor in RAG evaluation is "support", whether the information in the cited documents supports the answer. To this end, we conducted a large-scale comparative study of 45 participant submissio… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: Accepted at SIGIR 2025 (short)

  4. arXiv:2504.15068  [pdf, other

    cs.IR cs.CL

    The Great Nugget Recall: Automating Fact Extraction and RAG Evaluation with Large Language Models

    Authors: Ronak Pradeep, Nandan Thakur, Shivani Upadhyay, Daniel Campos, Nick Craswell, Jimmy Lin

    Abstract: Large Language Models (LLMs) have significantly enhanced the capabilities of information access systems, especially with retrieval-augmented generation (RAG). Nevertheless, the evaluation of RAG systems remains a barrier to continued progress, a challenge we tackle in this work by proposing an automatic evaluation framework that is validated against human annotations. We believe that the nugget ev… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: To appear in SIGIR 2025. Significant updates and revisions to arXiv:2411.09607

  5. arXiv:2504.14903  [pdf, other

    cs.IR

    ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring

    Authors: Kaili Huang, Thejas Venkatesh, Uma Dingankar, Antonio Mallia, Daniel Campos, Jian Jiao, Christopher Potts, Matei Zaharia, Kwabena Boahen, Omar Khattab, Saarthak Sarup, Keshav Santhanam

    Abstract: We study serving retrieval models, specifically late interaction models like ColBERT, to many concurrent users at once and under a small budget, in which the index may not fit in memory. We present ColBERT-serve, a novel serving system that applies a memory-mapping strategy to the ColBERT index, reducing RAM usage by 90% and permitting its deployment on cheap servers, and incorporates a multi-stag… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: Accepted by ECIR 2025

  6. arXiv:2504.05851  [pdf, other

    cs.SE

    Identifying and Replicating Code Patterns Driving Performance Regressions in Software Systems

    Authors: Denivan Campos, Luana Martins, Emanuela Guglielmi, Michele Tucci, Daniele Di Pompeo, Simone Scalabrino, Vittorio Cortellessa, Dario Di Nucci, Rocco Oliveto

    Abstract: Context: Performance regressions negatively impact execution time and memory usage of software systems. Nevertheless, there is a lack of systematic methods to evaluate the effectiveness of performance test suites. Performance mutation testing, which introduces intentional defects (mutants) to measure and enhance fault-detection capabilities, is promising but underexplored. A key challenge is under… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: 9 pages, 22nd International Conference on Mining Software Repositories (MSR) - Registered Reports

  7. arXiv:2502.21178  [pdf

    cond-mat.stat-mech

    Prospection and dispersal in metapopulations: a perspective from opinion dynamics models

    Authors: Daniela Molas, Daniel Campos

    Abstract: Dispersal is often used by living beings to gather information from conspecifics, integrating it with personal experience to guide decision-making. This mechanism has only recently been studied experimentally, facilitated by advancements in tracking animal groups over extended periods. Such studies enable the analysis of the adaptive dynamics underlying sequential decisions and collective choices.… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

    Comments: 17 pages, 5 figures

  8. arXiv:2501.11524  [pdf, other

    astro-ph.SR astro-ph.HE

    GOTO065054+593624: a 8.5 mag amplitude dwarf nova identified in real time via Kilonova Seekers

    Authors: T. L. Killestein, G. Ramsay, M. Kennedy, L. Kelsey, D. Steeghs, S. Littlefair, B. Godson, J. Lyman, M. Pursiainen, B. Warwick, C. Krawczyk, L. K. Nuttall, E. Wickens, S. D. Alexandrov, C. M. da Silva, R. Leadbeater, K. Ackley, M. J. Dyer, F. Jiménez-Ibarra, K. Ulaczyk, D. K. Galloway, V. S. Dhillon, P. O'Brien, K. Noysena, R. Kotak , et al. (40 additional authors not shown)

    Abstract: Dwarf novae are astrophysical laboratories for probing the nature of accretion, binary mass transfer, and binary evolution -- yet their diverse observational characteristics continue to challenge our theoretical understanding. We here present the discovery of, and subsequent observing campaign on GOTO065054+593624 (hereafter GOTO0650), a dwarf nova of the WZ Sge type, discovered in real-time by ci… ▽ More

    Submitted 8 May, 2025; v1 submitted 20 January, 2025; originally announced January 2025.

    Comments: 14 pages, 15 figures. Accepted to A&A

  9. arXiv:2412.14581  [pdf, other

    cs.CL

    CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation

    Authors: Youngwon Lee, Seung-won Hwang, Daniel Campos, Filip Graliński, Zhewei Yao, Yuxiong He

    Abstract: With the adoption of retrieval-augmented generation (RAG), large language models (LLMs) are expected to ground their generation to the retrieved contexts. Yet, this is hindered by position bias of LLMs, failing to evenly attend to all contexts. Previous work has addressed this by synthesizing contexts with perturbed positions of gold segment, creating a position-diversified train set. We extend th… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

  10. arXiv:2412.10684  [pdf, other

    cs.CL

    Inference Scaling for Bridging Retrieval and Augmented Generation

    Authors: Youngwon Lee, Seung-won Hwang, Daniel Campos, Filip Graliński, Zhewei Yao, Yuxiong He

    Abstract: Retrieval-augmented generation (RAG) has emerged as a popular approach to steering the output of a large language model (LLM) by incorporating retrieved contexts as inputs. However, existing work observed the generator bias, such that improving the retrieval results may negatively affect the outcome. In this work, we show such bias can be mitigated, from inference scaling, aggregating inference ca… ▽ More

    Submitted 14 December, 2024; originally announced December 2024.

  11. arXiv:2412.04506  [pdf, other

    cs.CL cs.IR cs.LG

    Arctic-Embed 2.0: Multilingual Retrieval Without Compromise

    Authors: Puxuan Yu, Luke Merrick, Gaurav Nuti, Daniel Campos

    Abstract: This paper presents the training methodology of Arctic-Embed 2.0, a set of open-source text embedding models built for accurate and efficient multilingual retrieval. While prior works have suffered from degraded English retrieval quality, Arctic-Embed 2.0 delivers competitive retrieval quality on multilingual and English-only benchmarks, and supports Matryoshka Representation Learning (MRL) for ef… ▽ More

    Submitted 13 December, 2024; v1 submitted 3 December, 2024; originally announced December 2024.

    Comments: 10 pages, 5 figures, 3 tables

  12. arXiv:2411.09607  [pdf, other

    cs.IR cs.CL

    Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework

    Authors: Ronak Pradeep, Nandan Thakur, Shivani Upadhyay, Daniel Campos, Nick Craswell, Jimmy Lin

    Abstract: This report provides an initial look at partial results from the TREC 2024 Retrieval-Augmented Generation (RAG) Track. We have identified RAG evaluation as a barrier to continued progress in information access (and more broadly, natural language processing and artificial intelligence), and it is our hope that we can contribute to tackling the many challenges in this space. The central hypothesis w… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

  13. arXiv:2411.08275  [pdf, other

    cs.IR cs.CL

    A Large-Scale Study of Relevance Assessments with Large Language Models: An Initial Look

    Authors: Shivani Upadhyay, Ronak Pradeep, Nandan Thakur, Daniel Campos, Nick Craswell, Ian Soboroff, Hoa Trang Dang, Jimmy Lin

    Abstract: The application of large language models to provide relevance assessments presents exciting opportunities to advance information retrieval, natural language processing, and beyond, but to date many unknowns remain. This paper reports on the results of a large-scale evaluation (the TREC 2024 RAG Track) where four different relevance assessment approaches were deployed in situ: the "standard" fully… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

  14. arXiv:2411.04975  [pdf, ps, other

    cs.CL cs.AI cs.DC cs.LG

    SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications

    Authors: Gabriele Oliaro, Zhihao Jia, Daniel Campos, Aurick Qiao

    Abstract: Speculative decoding is widely adopted to reduce latency in large language model (LLM) inference by leveraging smaller draft models capable of handling diverse user tasks. However, emerging AI applications, such as LLM-based agents, present unique workload characteristics: instead of diverse independent requests, agentic frameworks typically submit repetitive inference requests, such as multi-agen… ▽ More

    Submitted 2 June, 2025; v1 submitted 7 November, 2024; originally announced November 2024.

  15. arXiv:2411.03724  [pdf, other

    cs.CV

    Estimation of Psychosocial Work Environment Exposures Through Video Object Detection. Proof of Concept Using CCTV Footage

    Authors: Claus D. Hansen, Thuy Hai Le, David Campos

    Abstract: This paper examines the use of computer vision algorithms to estimate aspects of the psychosocial work environment using CCTV footage. We present a proof of concept for a methodology that detects and tracks people in video footage and estimates interactions between customers and employees by estimating their poses and calculating the duration of their encounters. We propose a pipeline that combine… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

    Comments: 11 pages, 9 figures, presented at IWOAR 9th International Workshop on Sensor-Based Activity Recognition and Artificial Intelligence, September 26-27, Potsdam, Germany

  16. arXiv:2410.16562  [pdf

    cs.CY

    Vernacularizing Taxonomies of Harm is Essential for Operationalizing Holistic AI Safety

    Authors: Wm. Matthew Kennedy, Daniel Vargas Campos

    Abstract: Operationalizing AI ethics and safety principles and frameworks is essential to realizing the potential benefits and mitigating potential harms caused by AI systems. To that end, actors across industry, academia, and regulatory bodies have created formal taxonomies of harm to support operationalization efforts. These include novel holistic methods that go beyond exclusive reliance on technical ben… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: Accepted to the Proceedings of the Conference on AI Ethics and Society (AIES), 2024

  17. arXiv:2409.06211  [pdf, other

    cs.LG cs.CL

    STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning

    Authors: Jaeseong Lee, seung-won hwang, Aurick Qiao, Daniel F Campos, Zhewei Yao, Yuxiong He

    Abstract: Mixture-of-experts (MoEs) have been adopted for reducing inference costs by sparsely activating experts in Large language models (LLMs). Despite this reduction, the massive number of experts in MoEs still makes them expensive to serve. In this paper, we study how to address this, by pruning MoEs. Among pruning methodologies, unstructured pruning has been known to achieve the highest performance fo… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

  18. arXiv:2409.02694  [pdf, ps, other

    gr-qc astro-ph.HE hep-th

    Gravitational Surface Tension as the Origin for the Black Hole Entropy

    Authors: S. D. Campos, R. H. Longaresi

    Abstract: In this work, we explore the thermodynamics of black holes using the Gouy-Stodola theorem, traditionally applied to mechanical systems relating entropy production to the difference between reversible and irreversible work. We model black holes as gravitational bubbles with surface tension defined at the event horizon, deriving the Bekenstein-Hawking entropy relation for non-rotating black holes. O… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: 16 pages, no figures

  19. arXiv:2408.08467  [pdf, other

    eess.SY

    A New Control Law for TS Fuzzy Models: Less Conservative LMI Conditions by Using Membership Functions Derivative

    Authors: Leonardo Amaral Mozelli, Victor Costa da Silva Campos

    Abstract: This note proposes a new type of Parallel Distributed Controller (PDC) for Takagi-Sugeno (TS) fuzzy models. Our idea consists of using two control terms based on state feedback, one composed of a convex combination of linear gains weighted by the normalized membership grade, as in traditional PDC, and the other composed of linear gains weighted by the time-derivatives of the membership functions.… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 20 pages, 4 figures

    MSC Class: 93C42; 93D15; 37B25

  20. arXiv:2407.16086  [pdf, ps, other

    math.PR

    Itô's Formula for Itô processes defined with respect to a cylindrical-martingale valued measure

    Authors: Santiago Cambronero, David Campos, C. A. Fonseca-Mora, Darío Mena

    Abstract: Using the theory of stochastic integration developed recently by the authors, in this paper we prove an Itô formula for Hilbert space-valued Itô processes defined with respect to a cylindrical-martingale valued measure. As part of our study, we develop some tools from stochastic analysis as are the predictable and optional quadratic variation of the stochastic integral, the continuous and purely d… ▽ More

    Submitted 15 December, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    MSC Class: 60H05; 60H15; 60B11; 60G48

  21. Managing O-RAN Networks: xApp Development from Zero to Hero

    Authors: Joao F. Santos, Alexandre Huff, Daniel Campos, Kleber V. Cardoso, Cristiano B. Both, Luiz A. DaSilva

    Abstract: The Open Radio Access Network (O-RAN) Alliance proposes an open architecture that disaggregates the RAN and supports executing custom control logic in near-real time from third-party applications, the xApps. Despite O-RAN's efforts, the creation of xApps remains a complex and time-consuming endeavor, aggravated by the sometimes fragmented, outdated, or deprecated documentation from the O-RAN Softw… ▽ More

    Submitted 5 February, 2025; v1 submitted 12 July, 2024; originally announced July 2024.

  22. arXiv:2406.16828  [pdf, other

    cs.IR cs.AI cs.CL

    Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track

    Authors: Ronak Pradeep, Nandan Thakur, Sahel Sharifymoghaddam, Eric Zhang, Ryan Nguyen, Daniel Campos, Nick Craswell, Jimmy Lin

    Abstract: Did you try out the new Bing Search? Or maybe you fiddled around with Google AI~Overviews? These might sound familiar because the modern-day search stack has recently evolved to include retrieval-augmented generation (RAG) systems. They allow searching and incorporating real-time data into large language models (LLMs) to provide a well-informed, attributed, concise summary in contrast to the tradi… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  23. arXiv:2405.12366  [pdf, ps, other

    quant-ph cond-mat.mtrl-sci cond-mat.other

    Mimicking Negative Mass Properties

    Authors: S. D. Campos

    Abstract: In the present work, one analyzes two systems trying to obtain physical conditions where some properties attributed to negative mass can be mimicked by positive mass particles. The first one is the well-known 1/2-spin system described by the Dirac equation in the presence of an external electromagnetic field. Assuming some physical restrictions, one obtains that the use of $e\rightarrow-e$ can lea… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 12 pages, no figures

  24. arXiv:2405.07929  [pdf, ps, other

    math.AP math.CV math.FA

    On the Existence and Smoothness of the Navier-Stokes Equation I

    Authors: Brian David Vasquez Campos

    Abstract: In this paper, we give a sufficient condition to guarantee the existence of a smooth solution of the Navier-Stokes Equation with the nice decreasing properties at infinity. In this way, we prove the existence of smooth physically reasonable solutions to the Navier-Stokes problem. Additionally, we show the existence of a smooth curve of entire vector fields of order 2 that extends the solution to t… ▽ More

    Submitted 6 December, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  25. arXiv:2405.07767  [pdf, other

    cs.IR cs.AI

    Synthetic Test Collections for Retrieval Evaluation

    Authors: Hossein A. Rahmani, Nick Craswell, Emine Yilmaz, Bhaskar Mitra, Daniel Campos

    Abstract: Test collections play a vital role in evaluation of information retrieval (IR) systems. Obtaining a diverse set of user queries for test collection construction can be challenging, and acquiring relevance judgments, which indicate the appropriateness of retrieved documents to a query, is often costly and resource-intensive. Generating synthetic datasets using Large Language Models (LLMs) has recen… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: SIGIR 2024

  26. arXiv:2405.05374  [pdf, other

    cs.CL cs.AI cs.IR

    Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models

    Authors: Luke Merrick, Danmei Xu, Gaurav Nuti, Daniel Campos

    Abstract: This report describes the training dataset creation and recipe behind the family of \texttt{arctic-embed} text embedding models (a set of five models ranging from 22 to 334 million parameters with weights open-sourced under an Apache-2 license). At the time of their release, each model achieved state-of-the-art retrieval accuracy for models of their size on the MTEB Retrieval leaderboard, with the… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 17 pages, 11 Figures, 9 tables

  27. QCore: Data-Efficient, On-Device Continual Calibration for Quantized Models -- Extended Version

    Authors: David Campos, Bin Yang, Tung Kieu, Miao Zhang, Chenjuan Guo, Christian S. Jensen

    Abstract: We are witnessing an increasing availability of streaming data that may contain valuable information on the underlying processes. It is thus attractive to be able to deploy machine learning models on edge devices near sensors such that decisions can be made instantaneously, rather than first having to transmit incoming data to servers. To enable deployment on edge devices with limited storage and… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 15 pages. An extended version of "QCore: Data-Efficient, On-Device Continual Calibration for Quantized Models" accepted at PVLDB 2024

    Journal ref: Proceedings of the VLDB Endowment, 17, 11 (2024), 2708-2721

  28. A Semi-Lagrangian Approach for Time and Energy Path Planning Optimization in Static Flow Fields

    Authors: Víctor C. da S. Campos, Armando A. Neto, Douglas G. Macharet

    Abstract: Efficient path planning for autonomous mobile robots is a critical problem across numerous domains, where optimizing both time and energy consumption is paramount. This paper introduces a novel methodology that considers the dynamic influence of an environmental flow field and considers geometric constraints, including obstacles and forbidden zones, enriching the complexity of the planning problem… ▽ More

    Submitted 13 March, 2025; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: 50 pages, accepted manuscript; Preprint submitted to Journal of the Franklin Institute (accepted manuscript)

  29. The Active Asteroids Citizen Science Program: Overview and First Results

    Authors: Colin Orion Chandler, Chadwick A. Trujillo, William J. Oldroyd, Jay K. Kueny, William A. Burris, Henry H. Hsieh, Jarod A. DeSpain, Nima Sedaghat, Scott S. Sheppard, Kennedy A. Farrell, David E. Trilling, Annika Gustafsson, Mark Jesus Mendoza Magbanua, Michele T. Mazzucato, Milton K. D. Bosch, Tiffany Shaw-Diaz, Virgilio Gonano, Al Lamperti, José A. da Silva Campos, Brian L. Goodwin, Ivan A. Terentev, Charles J. A. Dukes, Sam Deen

    Abstract: We present the Citizen Science program Active Asteroids and describe discoveries stemming from our ongoing project. Our NASA Partner program is hosted on the Zooniverse online platform and launched on 2021 August 31, with the goal of engaging the community in the search for active asteroids -- asteroids with comet-like tails or comae. We also set out to identify other unusual active solar system o… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 35 pages, 5 figures, 3 tables

  30. arXiv:2401.16849  [pdf, other

    cond-mat.stat-mech

    Intermittent random walks under stochastic resetting

    Authors: Rosa Flaquer-Galmés, Daniel Campos, Vicenç Méndez

    Abstract: We analyze a one-dimensional intermittent random walk on an unbounded domain in the presence of stochastic resetting. In this process, the walker alternates between local intensive search, diffusion, and rapid ballistic relocations in which it does not react to the target. We demonstrate that Poissonian resetting leads to the existence of a non-equilibrium steady state. We calculate the distributi… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  31. arXiv:2401.16295  [pdf, ps, other

    math.SP math.OA

    A Class of Matrix Schrödinger Bispectral Operators

    Authors: Brian D. Vasquez Campos

    Abstract: We prove the bispectrality of some class of matrix Schrödinger operators with polynomial potentials that satisfy a second-order matrix autonomous differential equation. The physical equation is constructed using the formal theory of the Laurent series and after that obtaining local solutions using estimations in the Frobenius norm. Furthermore, the characterization of the algebra of polynomial eig… ▽ More

    Submitted 1 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    MSC Class: 35A35 47Fxx 16Dxx

  32. arXiv:2401.01125  [pdf, other

    cond-mat.stat-mech

    First-passage time of a Brownian searcher with stochastic resetting to random positions

    Authors: Vicenç Mendez, Rosa Flaquer-Galmés, Daniel Campos

    Abstract: We study the effect of a resetting point randomly distributed around the origin on the mean first passage time of a Brownian searcher moving in one dimension. We compare the search efficiency with that corresponding to reset to the origin and find that the mean first passage time of the latter can be larger or smaller than the distributed case, depending on whether the resetting points are symmetr… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 13 pages, 15 figures

    MSC Class: 60G50; 60G05

  33. arXiv:2311.18647  [pdf, other

    hep-ex astro-ph.CO physics.ins-det

    Long-term temporal stability of the DarkSide-50 dark matter detector

    Authors: The DarkSide-50 Collaboration, :, P. Agnes, I. F. M. Albuquerque, T. Alexander, A. K. Alton, M. Ave, H. O. Back, G. Batignani, K. Biery, V. Bocci, W. M. Bonivento, B. Bottino, S. Bussino, M. Cadeddu, M. Cadoni, F. Calaprice, A. Caminata, M. D. Campos, N. Canci, M. Caravati, N. Cargioli, M. Cariello, M. Carlini, V. Cataudella , et al. (121 additional authors not shown)

    Abstract: The stability of a dark matter detector on the timescale of a few years is a key requirement due to the large exposure needed to achieve a competitive sensitivity. It is especially crucial to enable the detector to potentially detect any annual event rate modulation, an expected dark matter signature. In this work, we present the performance history of the DarkSide-50 dual-phase argon time project… ▽ More

    Submitted 22 May, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: 13 pages, 5 figures

    Journal ref: JINST 19 P05057 (2024)

  34. arXiv:2311.17828  [pdf, other

    cond-mat.stat-mech

    Dynamic redundancy as a mechanism to optimize collective random searches

    Authors: Daniel Campos, Vicenç Méndez

    Abstract: We explore the case of a group of random walkers looking for a target randomly located in space, such that the number of walkers is not constant but new ones can join the search, or those that are active can abandon it, with constant rates $r_b$ and $r_d$, respectively. Exact analytical solutions are provided both for the fastest-first-passage time and for the collective search time required to re… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: 15 pages, 6 figures

  35. arXiv:2311.07861  [pdf, other

    cs.IR cs.AI

    Overview of the TREC 2023 Product Product Search Track

    Authors: Daniel Campos, Surya Kallumadi, Corby Rosset, Cheng Xiang Zhai, Alessandro Magnani

    Abstract: This is the first year of the TREC Product search track. The focus this year was the creation of a reusable collection and evaluation of the impact of the use of metadata and multi-modal data on retrieval accuracy. This year we leverage the new product search corpus, which includes contextual metadata. Our analysis shows that in the product search domain, traditional retrieval systems are highly e… ▽ More

    Submitted 15 November, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 14 pages, 4 figures, 11 tables - TREC 2023

  36. arXiv:2308.10914  [pdf, ps, other

    math.FA math.CA

    Riesz spaces of signed charges on semi-rings

    Authors: Santiago Cambronero, David Campos, C. A. Fonseca-Mora, Darío Mena

    Abstract: A constructive definition of the supremum of a family of set functions is exploited in the context of Riesz spaces of signed measures and finitely additive functions (signed charges) on semi-rings. We explore applications, particularly to establish a Jordan decomposition for signed charges on semi-rings, whether the structure of Riesz space is present or not.

    Submitted 25 November, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

    MSC Class: 28A10; 28B05

  37. arXiv:2308.10374  [pdf, ps, other

    math.PR math.FA

    Cylindrical Martingale-Valued Measures, Stochastic Integration and SPDEs

    Authors: Santiago Cambronero, David Campos, C. A. Fonseca-Mora, Darío Mena

    Abstract: We develop a theory of Hilbert-space valued stochastic integration with respect to cylindrical martingale-valued measures. As part of our construction, we expand the concept of quadratic variation, introduced by Veraar and Yaroslavtsev (2016), to the case of cylindrical martingale-valued measures that are allowed to have discontinuous paths (this is carried out within the context of separable Bana… ▽ More

    Submitted 21 November, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

    Comments: This is an updated version with new examples and remarks

    MSC Class: 60H05; 60H15; 60B11; 60G48

  38. arXiv:2308.01281  [pdf, other

    hep-th gr-qc math-ph

    Boundary conditions and infrared divergences

    Authors: Lissa de Souza Campos, Claudio Dappiaggi, Luca Sinibaldi

    Abstract: We review the procedure to construct quasi-free ground states, for real scalar fields whose dynamics is dictated by the Klein-Gordon equation, on standard static Lorentzian manifolds with a time-like boundary. We observe that, depending on the assigned boundary condition of Robin type, this procedure does not always lead to the existence of a suitable bi-distribution… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 11 pages, 2 figures

  39. arXiv:2307.14369  [pdf, ps, other

    cond-mat.stat-mech gr-qc hep-th

    On Negative Mass, Partition Function and Entropy

    Authors: S. D. Campos

    Abstract: This work examines some aspects related to the existence of negative mass. The requirement for the partition function to converge leads to two distinct approaches. Initially, convergence is achieved by assuming a negative absolute temperature, which results in an imaginary partition function and complex entropy. Subsequently, convergence is maintained by keeping the absolute temperature positive w… ▽ More

    Submitted 14 November, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: A version of this manuscript was accepted for publication in Mod. Phys. Lett. A

  40. Search for dark matter annual modulation with DarkSide-50

    Authors: The DarkSide-50 Collaboration, :, P. Agnes, I. F. M. Albuquerque, T. Alexander, A. K. Alton, M. Ave, H. O. Back, G. Batignani, K. Biery, V. Bocci, W. M. Bonivento, B. Bottino, S. Bussino, M. Cadeddu, M. Cadoni, F. Calaprice, A. Caminata, M. D. Campos, N. Canci, M. Caravati, N. Cargioli, M. Cariello, M. Carlini, V. Cataudella , et al. (121 additional authors not shown)

    Abstract: Dark matter induced event rate in an Earth-based detector is predicted to show an annual modulation as a result of the Earth's orbital motion around the Sun. We searched for this modulation signature using the ionization signal of the DarkSide-50 liquid argon time projection chamber. No significant signature compatible with dark matter is observed in the electron recoil equivalent energy range abo… ▽ More

    Submitted 22 November, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: 10 pages, 5 figures

    Journal ref: Phys.Rev.D110, 102006 (2024)

  41. arXiv:2305.03431  [pdf, other

    cs.SE

    Hearing the voice of experts: Unveiling Stack Exchange communities' knowledge of test smells

    Authors: Luana Martins, Denivan Campos, Railana Santana, Joselito Mota Junior, Heitor Costa, Ivan Machado

    Abstract: Refactorings are transformations to improve the code design without changing overall functionality and observable behavior. During the refactoring process of smelly test code, practitioners may struggle to identify refactoring candidates and define and apply corrective strategies. This paper reports on an empirical study aimed at understanding how test smells and test refactorings are discussed on… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Preprint of the manuscript accepted for publication at CHASE 2023

  42. arXiv:2304.03401  [pdf, other

    cs.IR cs.AI cs.CL

    Noise-Robust Dense Retrieval via Contrastive Alignment Post Training

    Authors: Daniel Campos, ChengXiang Zhai, Alessandro Magnani

    Abstract: The success of contextual word representations and advances in neural information retrieval have made dense vector-based retrieval a standard approach for passage and document ranking. While effective and efficient, dual-encoders are brittle to variations in query distributions and noisy queries. Data augmentation can make models more robust but introduces overhead to training set generation and r… ▽ More

    Submitted 10 April, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: 8 pages, 6 figures, 30 tables

  43. arXiv:2304.02721  [pdf, other

    cs.CL cs.AI

    To Asymmetry and Beyond: Structured Pruning of Sequence to Sequence Models for Improved Inference Efficiency

    Authors: Daniel Campos, ChengXiang Zhai

    Abstract: Sequence-to-sequence language models can be used to produce abstractive summaries which are coherent, relevant, and concise. Still, model sizes can make deployment in latency-sensitive or web-scale implementations difficult. This paper studies the relationship between model size, structured pruning, inference efficiency, and summarization accuracy on widely used summarization datasets. We show tha… ▽ More

    Submitted 12 June, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: SustaiNLP2023 @ ACL 2023,9 pages, 6 figures, 33 tables

  44. arXiv:2304.01016  [pdf, other

    cs.CL cs.AI cs.IR

    Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders

    Authors: Daniel Campos, Alessandro Magnani, ChengXiang Zhai

    Abstract: In this paper, we consider the problem of improving the inference latency of language model-based dense retrieval systems by introducing structural compression and model size asymmetry between the context and query encoders. First, we investigate the impact of pre and post-training compression on the MSMARCO, Natural Questions, TriviaQA, SQUAD, and SCIFACT, finding that asymmetry in the dual encod… ▽ More

    Submitted 1 June, 2023; v1 submitted 31 March, 2023; originally announced April 2023.

    Comments: SustaiNLP2023 @ ACL 2023, 8 pages, 4 figures, 30 tables

  45. arXiv:2304.00114  [pdf, other

    cs.IR cs.AI cs.CL

    Dense Sparse Retrieval: Using Sparse Language Models for Inference Efficient Dense Retrieval

    Authors: Daniel Campos, ChengXiang Zhai

    Abstract: Vector-based retrieval systems have become a common staple for academic and industrial search applications because they provide a simple and scalable way of extending the search to leverage contextual representations for documents and queries. As these vector-based systems rely on contextual language models, their usage commonly requires GPUs, which can be expensive and difficult to manage. Given… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  46. arXiv:2303.17612  [pdf, other

    cs.CL cs.AI cs.LG

    oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes

    Authors: Daniel Campos, Alexandre Marques, Mark Kurtz, ChengXiang Zhai

    Abstract: In this paper, we introduce the range of oBERTa language models, an easy-to-use set of language models which allows Natural Language Processing (NLP) practitioners to obtain between 3.8 and 24.3 times faster models without expertise in model compression. Specifically, oBERTa extends existing work on pruning, knowledge distillation, and quantization and leverages frozen embeddings improves distilla… ▽ More

    Submitted 6 June, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: SustaiNLP2023 @ ACL 2023,9 pages, 2 figures, 45 tables

  47. arXiv:2302.12721  [pdf, other

    cs.LG cs.DB

    LightTS: Lightweight Time Series Classification with Adaptive Ensemble Distillation -- Extended Version

    Authors: David Campos, Miao Zhang, Bin Yang, Tung Kieu, Chenjuan Guo, Christian S. Jensen

    Abstract: Due to the sweeping digitalization of processes, increasingly vast amounts of time series data are being produced. Accurate classification of such time series facilitates decision making in multiple domains. State-of-the-art classification accuracy is often achieved by ensemble learning where results are synthesized from multiple base models. This characteristic implies that ensemble learning need… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: 15 pages. An extended version of "LightTS: Lightweight Time Series Classification with Adaptive Ensemble Distillation" accepted at SIGMOD 2023

    Journal ref: Proceedings of the ACM on Management of Data 1, 2 (2023), 171:1-171:27

  48. Search for low mass dark matter in DarkSide-50: the bayesian network approach

    Authors: The DarkSide-50 Collaboration, :, P. Agnes, I. F. M. Albuquerque, T. Alexander, A. K. Alton, M. Ave, H. O. Back, G. Batignani, K. Biery, V. Bocci, W. M. Bonivento, B. Bottino, S. Bussino, M. Cadeddu, M. Cadoni, F. Calaprice, A. Caminata, M. D. Campos, N. Canci, M. Caravati, N. Cargioli, M. Cariello, M. Carlini, V. Cataudella , et al. (119 additional authors not shown)

    Abstract: We present a novel approach for the search of dark matter in the DarkSide-50 experiment, relying on Bayesian Networks. This method incorporates the detector response model into the likelihood function, explicitly maintaining the connection with the quantity of interest. No assumptions about the linearity of the problem or the shape of the probability distribution functions are required, and there… ▽ More

    Submitted 26 April, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: 24 pages, 12 figures, 1 table

    Journal ref: Eur. Phys. J. C 83, 322 (2023)

  49. arXiv:2211.15927  [pdf, ps, other

    cs.CL cs.LG

    Compressing Cross-Lingual Multi-Task Models at Qualtrics

    Authors: Daniel Campos, Daniel Perry, Samir Joshi, Yashmeet Gambhir, Wei Du, Zhengzheng Xing, Aaron Colak

    Abstract: Experience management is an emerging business area where organizations focus on understanding the feedback of customers and employees in order to improve their end-to-end experiences. This results in a unique set of machine learning problems to help understand how people feel, discover issues they care about, and find which actions need to be taken on data that are different in content and distrib… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: accepted to IAAI-23 (part of AAAI-23)

    ACM Class: I.2.7

  50. arXiv:2210.05755  [pdf

    physics.ed-ph

    Uma proposta metodologica para a aprendizagem: reflexao sobre as praticas pedagogicas da Estatistica ao elaborar os instrumentos de pesquisa sociais

    Authors: Manoel Benedito Nirdo da Silva Campos

    Abstract: Presents a differentiated teaching proposal that allows the student to be the agent in the construction of knowledge, overcoming the difficulties that Mathematics presents. Aiming to understand how the use of statistical tools can contribute to the improvement of the teaching-learning process and the construction of statistical knowledge, studied with students from the University Campus of Rondono… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: in Portuguese language. A methodological proposal for learning: reflection on the pedagogical practices of Statistics when developing social research instruments. Teaching statistics, Mathematical modeling, Pedagogical practices in higher education, Educational technology. Ensino de estatisticas, Modelagem matematica, Práticas pedagogicas no ensino superior, Tecnologia educacional

    MSC Class: 62-01

    Journal ref: Para de Minas, MG : Virtual Books, 2022. 164p