Skip to main content

Showing 1–50 of 219,461 results for author: A.

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.21552  [pdf, ps, other

    cs.CV cs.AI cs.LG cs.MM cs.RO

    Whole-Body Conditioned Egocentric Video Prediction

    Authors: Yutong Bai, Danny Tran, Amir Bar, Yann LeCun, Trevor Darrell, Jitendra Malik

    Abstract: We train models to Predict Ego-centric Video from human Actions (PEVA), given the past video and an action represented by the relative 3D body pose. By conditioning on kinematic pose trajectories, structured by the joint hierarchy of the body, our model learns to simulate how physical human actions shape the environment from a first-person point of view. We train an auto-regressive conditional dif… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Project Page: https://dannytran123.github.io/PEVA

  2. arXiv:2506.21549  [pdf, ps, other

    cs.CV

    SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark

    Authors: Alex Costanzino, Pierluigi Zama Ramirez, Luigi Lella, Matteo Ragaglia, Alessandro Oliva, Giuseppe Lisanti, Luigi Di Stefano

    Abstract: We propose SiM3D, the first benchmark considering the integration of multiview and multimodal information for comprehensive 3D anomaly detection and segmentation (ADS), where the task is to produce a voxel-based Anomaly Volume. Moreover, SiM3D focuses on a scenario of high interest in manufacturing: single-instance anomaly detection, where only one object, either real or synthetic, is available fo… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  3. arXiv:2506.21546  [pdf, ps, other

    cs.CV cs.AI cs.CL cs.LG

    HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation

    Authors: Xinzhuo Li, Adheesh Juvekar, Xingyou Liu, Muntasir Wahed, Kiet A. Nguyen, Ismini Lourentzou

    Abstract: Recent progress in vision-language segmentation has significantly advanced grounded visual understanding. However, these models often exhibit hallucinations by producing segmentation masks for objects not grounded in the image content or by incorrectly labeling irrelevant regions. Existing evaluation protocols for segmentation hallucination primarily focus on label or textual hallucinations withou… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Project webpage: https://plan-lab.github.io/hallusegbench/

  4. arXiv:2506.21543  [pdf, ps, other

    math.ST cs.IT math.PR

    Detecting weighted hidden cliques

    Authors: Urmisha Chatterjee, Karissa Huang, Ritabrata Karmakar, B. R. Vinay Kumar, Gábor Lugosi, Nandan Malhotra, Anirban Mandal, Maruf Alam Tarafdar

    Abstract: We study a generalization of the classical hidden clique problem to graphs with real-valued edge weights. Formally, we define a hypothesis testing problem. Under the null hypothesis, edges of a complete graph on $n$ vertices are associated with independent and identically distributed edge weights from a distribution $P$. Under the alternate hypothesis, $k$ vertices are chosen at random and the edg… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    MSC Class: 62F03

  5. arXiv:2506.21538  [pdf, ps, other

    cs.CV cs.IR cs.LG

    Maximal Matching Matters: Preventing Representation Collapse for Robust Cross-Modal Retrieval

    Authors: Hani Alomari, Anushka Sivakumar, Andrew Zhang, Chris Thomas

    Abstract: Cross-modal image-text retrieval is challenging because of the diverse possible associations between content from different modalities. Traditional methods learn a single-vector embedding to represent semantics of each sample, but struggle to capture nuanced and diverse relationships that can exist across modalities. Set-based approaches, which represent each sample with multiple embeddings, offer… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Accepted at the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025 Main)

  6. arXiv:2506.21535  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Exploring the Design Space of 3D MLLMs for CT Report Generation

    Authors: Mohammed Baharoon, Jun Ma, Congyu Fang, Augustin Toma, Bo Wang

    Abstract: Multimodal Large Language Models (MLLMs) have emerged as a promising way to automate Radiology Report Generation (RRG). In this work, we systematically investigate the design space of 3D MLLMs, including visual input representation, projectors, Large Language Models (LLMs), and fine-tuning techniques for 3D CT report generation. We also introduce two knowledge-based report augmentation methods tha… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  7. arXiv:2506.21532  [pdf, ps, other

    cs.CL cs.AI cs.CY

    "What's Up, Doc?": Analyzing How Users Seek Health Information in Large-Scale Conversational AI Datasets

    Authors: Akshay Paruchuri, Maryam Aziz, Rohit Vartak, Ayman Ali, Best Uchehara, Xin Liu, Ishan Chatterjee, Monica Agrawal

    Abstract: People are increasingly seeking healthcare information from large language models (LLMs) via interactive chatbots, yet the nature and inherent risks of these conversations remain largely unexplored. In this paper, we filter large-scale conversational AI datasets to achieve HealthChat-11K, a curated dataset of 11K real-world conversations composed of 25K user messages. We use HealthChat-11K and a c… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 25 pages, 6 figures, 4 tables, corresponds to initial HealthChat-11K dataset release

  8. arXiv:2506.21514  [pdf, ps, other

    cs.CV

    G$^{2}$D: Boosting Multimodal Learning with Gradient-Guided Distillation

    Authors: Mohammed Rakib, Arunkumar Bagavathi

    Abstract: Multimodal learning aims to leverage information from diverse data modalities to achieve more comprehensive performance. However, conventional multimodal models often suffer from modality imbalance, where one or a few modalities dominate model optimization, leading to suboptimal feature representation and underutilization of weak modalities. To address this challenge, we introduce Gradient-Guided… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Accepted at ICCV 2025

  9. arXiv:2506.21511  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Gaussian Invariant Markov Chain Monte Carlo

    Authors: Michalis K. Titsias, Angelos Alexopoulos, Siran Liu, Petros Dellaportas

    Abstract: We develop sampling methods, which consist of Gaussian invariant versions of random walk Metropolis (RWM), Metropolis adjusted Langevin algorithm (MALA) and second order Hessian or Manifold MALA. Unlike standard RWM and MALA we show that Gaussian invariant sampling can lead to ergodic estimators with improved statistical efficiency. This is due to a remarkable property of Gaussian invariance that… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 29, 2 figures

  10. arXiv:2506.21508  [pdf, ps, other

    cs.CL cs.AI cs.IR cs.LG

    skLEP: A Slovak General Language Understanding Benchmark

    Authors: Marek Šuppa, Andrej Ridzik, Daniel Hládek, Tomáš Javůrek, Viktória Ondrejová, Kristína Sásiková, Martin Tamajka, Marián Šimko

    Abstract: In this work, we introduce skLEP, the first comprehensive benchmark specifically designed for evaluating Slovak natural language understanding (NLU) models. We have compiled skLEP to encompass nine diverse tasks that span token-level, sentence-pair, and document-level challenges, thereby offering a thorough assessment of model capabilities. To create this benchmark, we curated new, original datase… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: ACL 2025 Findings

    MSC Class: 68T50 ACM Class: I.2.7

  11. arXiv:2506.21506  [pdf, ps, other

    cs.AI cs.CL

    Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

    Authors: Boyu Gou, Zanming Huang, Yuting Ning, Yu Gu, Michael Lin, Weijian Qi, Andrei Kopanev, Botao Yu, Bernal Jiménez Gutiérrez, Yiheng Shu, Chan Hee Song, Jiaman Wu, Shijie Chen, Hanane Nour Moussa, Tianshu Zhang, Jian Xie, Yifei Li, Tianci Xue, Zeyi Liao, Kai Zhang, Boyuan Zheng, Zhaowei Cai, Viktor Rozgic, Morteza Ziyadi, Huan Sun , et al. (1 additional authors not shown)

    Abstract: Agentic search such as Deep Research systems, where large language models autonomously browse the web, synthesize information, and return comprehensive citation-backed answers, represents a major shift in how users interact with web-scale information. While promising greater efficiency and cognitive offloading, the growing complexity and open-endedness of agentic search have outpaced existing eval… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Project Homepage: https://osu-nlp-group.github.io/Mind2Web2/

  12. arXiv:2506.21495  [pdf, ps, other

    cs.CL

    Bridging Offline and Online Reinforcement Learning for LLMs

    Authors: Jack Lanchantin, Angelica Chen, Janice Lan, Xian Li, Swarnadeep Saha, Tianlu Wang, Jing Xu, Ping Yu, Weizhe Yuan, Jason E Weston, Sainbayar Sukhbaatar, Ilia Kulikov

    Abstract: We investigate the effectiveness of reinforcement learning methods for finetuning large language models when transitioning from offline to semi-online to fully online regimes for both verifiable and non-verifiable tasks. Our experiments cover training on verifiable math as well as non-verifiable instruction following with a set of benchmark evaluations for both. Across these settings, we extensive… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  13. arXiv:2506.21490  [pdf, ps, other

    cs.AI cs.HC cs.MA

    Ad-Hoc Human-AI Coordination Challenge

    Authors: Tin Dizdarević, Ravi Hammond, Tobias Gessler, Anisoara Calinescu, Jonathan Cook, Matteo Gallici, Andrei Lupu, Jakob Nicolaus Foerster

    Abstract: Achieving seamless coordination between AI agents and humans is crucial for real-world applications, yet it remains a significant open challenge. Hanabi is a cooperative card game featuring imperfect information, constrained communication, theory of mind requirements, and coordinated action -- making it an ideal testbed for human-AI coordination. However, its use for human-AI interaction has been… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Published at ICML 2025

  14. arXiv:2506.21487  [pdf

    cs.AR

    OptGM: An Optimized Gate Merging Method to Mitigate NBTI in Digital Circuits

    Authors: Maryam Ghane, Amir M. Hajisadeghi, Hamid R. Zarandi

    Abstract: This paper presents OptGM, an optimized gate merging method designed to mitigate negative bias temperature instability (NBTI) in digital circuits. First, the proposed approach effectively identifies NBTI-critical internal nodes, defined as those with a signal probability exceeding a predefined threshold. Next, based on the proposed optimized algorithm, the sensitizer gate (which drives the critica… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  15. arXiv:2506.21476  [pdf, ps, other

    cs.CV

    Global and Local Entailment Learning for Natural World Imagery

    Authors: Srikumar Sastry, Aayush Dhakal, Eric Xing, Subash Khanal, Nathan Jacobs

    Abstract: Learning the hierarchical structure of data in vision-language models is a significant challenge. Previous works have attempted to address this challenge by employing entailment learning. However, these approaches fail to model the transitive nature of entailment explicitly, which establishes the relationship between order and semantics within a representation space. In this work, we introduce Rad… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Accepted at ICCV 2025

  16. arXiv:2506.21467  [pdf, ps, other

    cs.DC

    Efficient and Reuseable Cloud Configuration Search Using Discovery Spaces

    Authors: Michael Johnston, Burkhard Ringlein, Christoph Hagleitner, Alessandro Pomponio, Vassilis Vassiliadis, Christian Pinto, Srikumar Venugopal

    Abstract: Finding the optimal set of cloud resources to deploy a given workload at minimal cost while meeting a defined service level agreement is an active area of research. Combining tens of parameters applicable across a large selection of compute, storage, and services offered by cloud providers with similar numbers of application-specific parameters leads to configuration spaces with millions of deploy… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    ACM Class: C.4

  17. arXiv:2506.21463  [pdf, ps, other

    cs.CL cs.LG cs.SD eess.AS

    Aligning Spoken Dialogue Models from User Interactions

    Authors: Anne Wu, Laurent Mazaré, Neil Zeghidour, Alexandre Défossez

    Abstract: We propose a novel preference alignment framework for improving spoken dialogue models on real-time conversations from user interactions. Current preference learning methods primarily focus on text-based language models, and are not directly suited to the complexities of real-time speech interactions, with richer dynamics (e.g. interruption, interjection) and no explicit segmentation between speak… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Accepted at ICML 2025

  18. A Keyword-Based Technique to Evaluate Broad Question Answer Script

    Authors: Tamim Al Mahmud, Md Gulzar Hussain, Sumaiya Kabir, Hasnain Ahmad, Mahmudus Sobhan

    Abstract: Evaluation is the method of assessing and determining the educational system through various techniques such as verbal or viva-voice test, subjective or objective written test. This paper presents an efficient solution to evaluate the subjective answer script electronically. In this paper, we proposed and implemented an integrated system that examines and evaluates the written answer script. This… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: ACM Conference Proceedings (9 Pages)

    Journal ref: Proceedings of the 2020 9th International Conference on Software and Computer Applications, Pages 167 - 171, 2020

  19. arXiv:2506.21453  [pdf, ps, other

    cs.LG eess.SY math.OC

    Towards an Optimal Control Perspective of ResNet Training

    Authors: Jens Püttschneider, Simon Heilig, Asja Fischer, Timm Faulwasser

    Abstract: We propose a training formulation for ResNets reflecting an optimal control problem that is applicable for standard architectures and general loss functions. We suggest bridging both worlds via penalizing intermediate outputs of hidden states corresponding to stage cost terms in optimal control. For standard ResNets, we obtain intermediate outputs by propagating the state through the subsequent sk… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Accepted for presentation at the High-dimensional Learning Dynamics (HiLD) workshop at ICML 2025

  20. arXiv:2506.21451  [pdf

    cs.CV cs.LG

    A Comprehensive Dataset for Underground Miner Detection in Diverse Scenario

    Authors: Cyrus Addy, Ajay Kumar Gurumadaiah, Yixiang Gao, Kwame Awuah-Offei

    Abstract: Underground mining operations face significant safety challenges that make emergency response capabilities crucial. While robots have shown promise in assisting with search and rescue operations, their effectiveness depends on reliable miner detection capabilities. Deep learning algorithms offer potential solutions for automated miner detection, but require comprehensive training datasets, which a… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  21. arXiv:2506.21446  [pdf, other

    cs.CV

    Controllable 3D Placement of Objects with Scene-Aware Diffusion Models

    Authors: Mohamed Omran, Dimitris Kalatzis, Jens Petersen, Amirhossein Habibian, Auke Wiggers

    Abstract: Image editing approaches have become more powerful and flexible with the advent of powerful text-conditioned generative models. However, placing objects in an environment with a precise location and orientation still remains a challenge, as this typically requires carefully crafted inpainting masks or prompts. In this work, we show that a carefully designed visual map, combined with coarse object… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  22. arXiv:2506.21444  [pdf, ps, other

    cs.CV

    Benchmarking Deep Learning and Vision Foundation Models for Atypical vs. Normal Mitosis Classification with Cross-Dataset Evaluation

    Authors: Sweta Banerjee, Viktoria Weiss, Taryn A. Donovan, Rutger A. Fick, Thomas Conrad, Jonas Ammeling, Nils Porsche, Robert Klopfleisch, Christopher Kaltenecker, Katharina Breininger, Marc Aubreville, Christof A. Bertram

    Abstract: Atypical mitoses mark a deviation in the cell division process that can be an independent prognostically relevant marker for tumor malignancy. However, their identification remains challenging due to low prevalence, at times subtle morphological differences from normal mitoses, low inter-rater agreement among pathologists, and class imbalance in datasets. Building on the Atypical Mitosis dataset f… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  23. arXiv:2506.21443  [pdf, ps, other

    cs.CL cs.AI

    Domain Knowledge-Enhanced LLMs for Fraud and Concept Drift Detection

    Authors: Ali Şenol, Garima Agrawal, Huan Liu

    Abstract: Detecting deceptive conversations on dynamic platforms is increasingly difficult due to evolving language patterns and Concept Drift (CD)-i.e., semantic or topical shifts that alter the context or intent of interactions over time. These shifts can obscure malicious intent or mimic normal dialogue, making accurate classification challenging. While Large Language Models (LLMs) show strong performanc… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  24. arXiv:2506.21440  [pdf, ps, other

    cs.SD cs.LG eess.AS eess.SP

    Learnable Adaptive Time-Frequency Representation via Differentiable Short-Time Fourier Transform

    Authors: Maxime Leiber, Yosra Marnissi, Axel Barrau, Sylvain Meignen, Laurent Massoulié

    Abstract: The short-time Fourier transform (STFT) is widely used for analyzing non-stationary signals. However, its performance is highly sensitive to its parameters, and manual or heuristic tuning often yields suboptimal results. To overcome this limitation, we propose a unified differentiable formulation of the STFT that enables gradient-based optimization of its parameters. This approach addresses the li… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: DSTFT, STFT, spectrogram, time-frequency, IEEE Transactions on Signal Processing, 10 pages

  25. arXiv:2506.21426  [pdf, ps, other

    physics.soc-ph cs.SI econ.GN physics.data-an q-fin.RM

    Evolution and determinants of firm-level systemic risk in local production networks

    Authors: Anna Mancini, Balázs Lengyel, Riccardo Di Clemente, Giulio Cimini

    Abstract: Recent crises like the COVID-19 pandemic and geopolitical tensions have exposed vulnerabilities and caused disruptions of supply chains, leading to product shortages, increased costs, and economic instability. This has prompted increasing efforts to assess systemic risk, namely the effects of firm disruptions on entire economies. However, the ability of firms to react to crises by rewiring their s… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 15 pages, 4 figures

  26. arXiv:2506.21411  [pdf, ps, other

    cs.LG

    Distributed Cross-Channel Hierarchical Aggregation for Foundation Models

    Authors: Aristeidis Tsaris, Isaac Lyngaas, John Lagregren, Mohamed Wahib, Larry York, Prasanna Balaprakash, Dan Lu, Feiyi Wang, Xiao Wang

    Abstract: Vision-based scientific foundation models hold significant promise for advancing scientific discovery and innovation. This potential stems from their ability to aggregate images from diverse sources such as varying physical groundings or data acquisition systems and to learn spatio-temporal correlations using transformer architectures. However, tokenizing and aggregating images can be compute-inte… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  27. arXiv:2506.21408  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Scalable Bayesian Low-Rank Adaptation of Large Language Models via Stochastic Variational Subspace Inference

    Authors: Colin Samplawski, Adam D. Cobb, Manoj Acharya, Ramneet Kaur, Susmit Jha

    Abstract: Despite their widespread use, large language models (LLMs) are known to hallucinate incorrect information and be poorly calibrated. This makes the uncertainty quantification of these models of critical importance, especially in high-stakes domains, such as autonomy and healthcare. Prior work has made Bayesian deep learning-based approaches to this problem more tractable by performing inference ove… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Accepted at UAI 2025

  28. arXiv:2506.21406  [pdf, ps, other

    cs.NI

    Flowcut Switching: High-Performance Adaptive Routing with In-Order Delivery Guarantees

    Authors: Tommaso Bonato, Daniele De Sensi, Salvatore Di Girolamo, Abdulla Bataineh, David Hewson, Duncan Roweth, Torsten Hoefler

    Abstract: Network latency severely impacts the performance of applications running on supercomputers. Adaptive routing algorithms route packets over different available paths to reduce latency and improve network utilization. However, if a switch routes packets belonging to the same network flow on different paths, they might arrive at the destination out-of-order due to differences in the latency of these… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  29. arXiv:2506.21374  [pdf, ps, other

    cs.LG cs.AI

    Pay Attention to Small Weights

    Authors: Chao Zhou, Tom Jacobs, Advait Gadhikar, Rebekka Burkholz

    Abstract: Finetuning large pretrained neural networks is known to be resource-intensive, both in terms of memory and computational cost. To mitigate this, a common approach is to restrict training to a subset of the model parameters. By analyzing the relationship between gradients and weights during finetuning, we observe a notable pattern: large gradients are often associated with small-magnitude weights.… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  30. arXiv:2506.21358  [pdf, ps, other

    cs.CV cs.RO

    ToosiCubix: Monocular 3D Cuboid Labeling via Vehicle Part Annotations

    Authors: Behrooz Nasihatkon, Hossein Resani, Amirreza Mehrzadian

    Abstract: Many existing methods for 3D cuboid annotation of vehicles rely on expensive and carefully calibrated camera-LiDAR or stereo setups, limiting their accessibility for large-scale data collection. We introduce ToosiCubix, a simple yet powerful approach for annotating ground-truth cuboids using only monocular images and intrinsic camera parameters. Our method requires only about 10 user clicks per ve… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  31. arXiv:2506.21357  [pdf, ps, other

    cs.CV

    CoPa-SG: Dense Scene Graphs with Parametric and Proto-Relations

    Authors: Julian Lorenz, Mrunmai Phatak, Robin Schön, Katja Ludwig, Nico Hörmann, Annemarie Friedrich, Rainer Lienhart

    Abstract: 2D scene graphs provide a structural and explainable framework for scene understanding. However, current work still struggles with the lack of accurate scene graph data. To overcome this data bottleneck, we present CoPa-SG, a synthetic scene graph dataset with highly precise ground truth and exhaustive relation annotations between all objects. Moreover, we introduce parametric and proto-relations,… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  32. Real-time Terrain Analysis for Off-road Autonomous Vehicles

    Authors: Edwina Lewis, Aditya Parameshwaran, Laura Redmond, Yue Wang

    Abstract: This research addresses critical autonomous vehicle control challenges arising from road roughness variation, which induces course deviations and potential loss of road contact during steering operations. We present a novel real-time road roughness estimation system employing Bayesian calibration methodology that processes axle accelerations to predict terrain roughness with quantifiable confidenc… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Journal ref: SAE Technical Papers 2025-01-8343

  33. arXiv:2506.21338  [pdf

    cs.LG cs.HC

    AGTCNet: A Graph-Temporal Approach for Principled Motor Imagery EEG Classification

    Authors: Galvin Brice S. Lim, Brian Godwin S. Lim, Argel A. Bandala, John Anthony C. Jose, Timothy Scott C. Chu, Edwin Sybingco

    Abstract: Brain-computer interface (BCI) technology utilizing electroencephalography (EEG) marks a transformative innovation, empowering motor-impaired individuals to engage with their environment on equal footing. Despite its promising potential, developing subject-invariant and session-invariant BCI systems remains a significant challenge due to the inherent complexity and variability of neural activity a… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  34. arXiv:2506.21337  [pdf, ps, other

    math.CO cs.DM

    Symmetry classes of Hamiltonian cycles

    Authors: Julia Baligacs, Sofia Brenner, Annette Lutz, Lena Volk

    Abstract: We initiate the study of Hamiltonian cycles up to symmetries of the underlying graph. Our focus lies on the extremal case of Hamiltonian-transitive graphs, i.e., Hamiltonian graphs where, for every pair of Hamiltonian cycles, there is a graph automorphism mapping one cycle to the other. This generalizes the extensively studied uniquely Hamiltonian graphs. In this paper, we show that Cayley graphs… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 27 pages, 13 figures

    MSC Class: 05C60; 05C38; 05C76; 68R05; 68R10

  35. Automatic Reviewers Assignment to a Research Paper Based on Allied References and Publications Weight

    Authors: Tamim Al Mahmud, B M Mainul Hossain, Dilshad Ara

    Abstract: Everyday, a vast stream of research documents is submitted to conferences, anthologies, journals, newsletters, annual reports, daily papers, and various periodicals. Many such publications use independent external specialists to review submissions. This process is called peer review, and the reviewers are called referees. However, it is not always possible to pick the best referee for reviewing. M… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: IEEE Conference Proceedings (5 Pages)

    Journal ref: 2018 4th International Conference on Computing, Communication and Automation (ICCCA), Greater Noida, India, 2018, pp. 1-5

  36. arXiv:2506.21330  [pdf, ps, other

    cs.CV cs.AI

    Holistic Surgical Phase Recognition with Hierarchical Input Dependent State Space Models

    Authors: Haoyang Wu, Tsun-Hsuan Wang, Mathias Lechner, Ramin Hasani, Jennifer A. Eckhoff, Paul Pak, Ozanan R. Meireles, Guy Rosman, Yutong Ban, Daniela Rus

    Abstract: Surgical workflow analysis is essential in robot-assisted surgeries, yet the long duration of such procedures poses significant challenges for comprehensive video analysis. Recent approaches have predominantly relied on transformer models; however, their quadratic attention mechanism restricts efficient processing of lengthy surgical videos. In this paper, we propose a novel hierarchical input-dep… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  37. arXiv:2506.21327  [pdf, ps, other

    cs.DC

    Enabling Bitcoin Smart Contracts on the Internet Computer

    Authors: Ryan Croote, Islam El-Ashi, Thomas Locher, Yvonne-Anne Pignolet

    Abstract: There is growing interest in providing programmatic access to the value locked in Bitcoin, which famously offers limited programmability itself. Various approaches have been put forth in recent years, with the vast majority of proposed mechanisms either building new functionality on top of Bitcoin or leveraging a bridging mechanism to enable smart contracts that make use of ``wrapped'' bitcoins on… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Published at ICDCS 2025, waiting for DOI

  38. arXiv:2506.21300  [pdf, ps, other

    cs.SE

    An object-centric core metamodel for IoT-enhanced event logs

    Authors: Yannis Bertrand, Christian Imenkamp, Lukas Malburg, Matthias Ehrendorfer, Marco Franceschetti, Joscha Grüger, Francesco Leotta, Jürgen Mangler, Ronny Seiger, Agnes Koschmider, Stefanie Rinderle-Ma, Barbara Weber, Estefania Serral

    Abstract: Advances in Internet-of-Things (IoT) technologies have prompted the integration of IoT devices with business processes (BPs) in many organizations across various sectors, such as manufacturing, healthcare and smart spaces. The proliferation of IoT devices leads to the generation of large amounts of IoT data providing a window on the physical context of BPs, which facilitates the discovery of new i… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  39. arXiv:2506.21298  [pdf, ps, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    Exploring Adapter Design Tradeoffs for Low Resource Music Generation

    Authors: Atharva Mehta, Shivam Chauhan, Monojit Choudhury

    Abstract: Fine-tuning large-scale music generation models, such as MusicGen and Mustango, is a computationally expensive process, often requiring updates to billions of parameters and, therefore, significant hardware resources. Parameter-Efficient Fine-Tuning (PEFT) techniques, particularly adapter-based methods, have emerged as a promising alternative, enabling adaptation with minimal trainable parameters… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 9 pages, 5 figures

  40. arXiv:2506.21297  [pdf, ps, other

    cs.SE cs.DC

    Exploring Micro Frontends: A Case Study Application in E-Commerce

    Authors: Ricardo Hideki Hangai Kojo, Luiz Fernando Corte Real, Renato Cordeiro Ferreira, Thatiane de Oliveira Rosa, Alfredo Goldman

    Abstract: In the micro frontends architectural style, the frontend is divided into smaller components, which can range from a simple button to an entire page. The goal is to improve scalability, resilience, and team independence, albeit at the cost of increased complexity and infrastructure demands. This paper seeks to understand when it is worth adopting micro frontends, particularly in the context of indu… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 11 pages, 2 figures (2 diagrams), submitted to the workshop AMP 2025

    ACM Class: D.2.11; D.2.13; D.2.7

  41. arXiv:2506.21288  [pdf, ps, other

    cs.CL cs.AI cs.IR cs.LG

    Small Encoders Can Rival Large Decoders in Detecting Groundedness

    Authors: Istabrak Abbes, Gabriele Prato, Quentin Fournier, Fernando Rodriguez, Alaa Boukhary, Adam Elwood, Sarath Chandar

    Abstract: Augmenting large language models (LLMs) with external context significantly improves their performance in natural language processing (NLP) tasks. However, LLMs struggle to answer queries reliably when the provided context lacks information, often resorting to ungrounded speculation or internal knowledge. Groundedness - generating responses strictly supported by the context - is essential for ensu… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  42. arXiv:2506.21287  [pdf, ps, other

    cs.CV

    HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation

    Authors: Diego Biagini, Nassir Navab, Azade Farshad

    Abstract: Surgical Video Synthesis has emerged as a promising research direction following the success of diffusion models in general-domain video generation. Although existing approaches achieve high-quality video generation, most are unconditional and fail to maintain consistency with surgical actions and phases, lacking the surgical understanding and fine-grained guidance necessary for factual simulation… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Accepted at MICCAI 2025

  43. arXiv:2506.21285  [pdf, ps, other

    cs.CL

    Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-Tuning

    Authors: Xin Xu, Tianhao Chen, Fan Zhang, Wanlong Liu, Pengxiang Li, Ajay Kumar Jaiswal, Yuchen Yan, Jishan Hu, Yang Wang, Hao Chen, Shiwei Liu, Shizhe Diao, Can Yang, Lu Yin

    Abstract: While slow-thinking large language models (LLMs) exhibit reflection-like reasoning, commonly referred to as the "aha moment:, their ability to generate informative critiques and refine prior solutions remains limited. In this paper, we introduce Double-Checker, a principled framework designed to enhance the reasoning capabilities of slow-thinking LLMs by fostering explicit self-critique and iterat… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 10 pages

  44. arXiv:2506.21281  [pdf, ps, other

    cs.DM

    Playing Snake on a Graph

    Authors: Denise Graafsma, Bodo Manthey, Alexander Skopalik

    Abstract: Snake is a classic computer game, which has been around for decades. Based on this game, we study the game of Snake on arbitrary undirected graphs. A snake forms a simple path that has to move to an apple while avoiding colliding with itself. When the snake reaches the apple, it grows longer, and a new apple appears. A graph on which the snake has a strategy to keep eating apples until it covers a… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  45. arXiv:2506.21274  [pdf, ps, other

    cs.CL

    Cat and Mouse -- Can Fake Text Generation Outpace Detector Systems?

    Authors: Andrea McGlinchey, Peter J Barclay

    Abstract: Large language models can produce convincing "fake text" in domains such as academic writing, product reviews, and political news. Many approaches have been investigated for the detection of artificially generated text. While this may seem to presage an endless "arms race", we note that newer LLMs use ever more parameters, training data, and energy, while relatively simple classifiers demonstrate… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: (Submitted for publication)

  46. arXiv:2506.21266  [pdf, ps, other

    cs.SE cs.CY

    KOALA: a Configurable Tool for Collecting IDE Data When Solving Programming Tasks

    Authors: Daniil Karol, Elizaveta Artser, Ilya Vlasov, Yaroslav Golubev, Hieke Keuning, Anastasiia Birillo

    Abstract: Collecting data of students solving programming tasks is incredibly valuable for researchers and educators. It allows verifying that the students correctly apply the features and concepts they are taught, or finding students' misconceptions. However, existing data collection tools have limitations, e.g., no control over the granularity of the collected code, not collecting the specific events of t… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Accepted to CompEd'25, 7 pages, 4 figures

  47. arXiv:2506.21240  [pdf, ps, other

    cs.LG

    Zero-Shot Learning for Obsolescence Risk Forecasting

    Authors: Elie Saad, Aya Mrabah, Mariem Besbes, Marc Zolghadri, Victor Czmil, Claude Baron, Vincent Bourgeois

    Abstract: Component obsolescence poses significant challenges in industries reliant on electronic components, causing increased costs and disruptions in the security and availability of systems. Accurate obsolescence risk prediction is essential but hindered by a lack of reliable data. This paper proposes a novel approach to forecasting obsolescence risk using zero-shot learning (ZSL) with large language mo… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  48. arXiv:2506.21220  [pdf, ps, other

    cs.LG cs.CL

    Complexity-aware fine-tuning

    Authors: Andrey Goncharov, Daniil Vyazhev, Petr Sychev, Edvard Khalafyan, Alexey Zaytsev

    Abstract: General-purpose Large Language Models (LLMs) are frequently fine-tuned through supervised fine-tuning (SFT) to enhance performance in specific domains. Better results can be achieved by distilling the chain-of-thought of a larger model at the cost of numerous expensive calls and a much greater amount of data. We propose a novel blueprint for efficient fine-tuning that uses reasoning only for compl… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  49. arXiv:2506.21216  [pdf, ps, other

    cs.DS cs.DM

    Edge Clique Partition and Cover Beyond Independence

    Authors: Fedor V. Fomin, Petr A. Golovach, Danil Sagunov, Kirill Simonov

    Abstract: Covering and partitioning the edges of a graph into cliques are classical problems at the intersection of combinatorial optimization and graph theory, having been studied through a range of algorithmic and complexity-theoretic lenses. Despite the well-known fixed-parameter tractability of these problems when parameterized by the total number of cliques, such a parameterization often fails to be me… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: An extended abstract of this paper appears in the proceedings of ESA 2025

  50. arXiv:2506.21209  [pdf, ps, other

    cs.CV cs.AI

    BitMark for Infinity: Watermarking Bitwise Autoregressive Image Generative Models

    Authors: Louis Kerner, Michel Meintz, Bihe Zhao, Franziska Boenisch, Adam Dziedzic

    Abstract: State-of-the-art text-to-image models like Infinity generate photorealistic images at an unprecedented speed. These models operate in a bitwise autoregressive manner over a discrete set of tokens that is practically infinite in size. However, their impressive generative power comes with a growing risk: as their outputs increasingly populate the Internet, they are likely to be scraped and reused as… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.