Skip to main content

Showing 1–50 of 347 results for author: Lee, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.18337  [pdf, ps, other

    cs.CL

    TranslationCorrect: A Unified Framework for Machine Translation Post-Editing with Predictive Error Assistance

    Authors: Syed Mekael Wasti, Shou-Yi Hung, Christopher Collins, En-Shiun Annie Lee

    Abstract: Machine translation (MT) post-editing and research data collection often rely on inefficient, disconnected workflows. We introduce TranslationCorrect, an integrated framework designed to streamline these tasks. TranslationCorrect combines MT generation using models like NLLB, automated error prediction using models like XCOMET or LLM APIs (providing detailed reasoning), and an intuitive post-editi… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: Preprint

  2. LegiGPT: Party Politics and Transport Policy with Large Language Model

    Authors: Hyunsoo Yun, Eun Hak Lee

    Abstract: Given the significant influence of lawmakers' political ideologies on legislative decision-making, analyzing their impact on transportation-related policymaking is of critical importance. This study introduces a novel framework that integrates a large language model (LLM) with explainable artificial intelligence (XAI) to analyze transportation-related legislative proposals. Legislative bill data f… ▽ More

    Submitted 27 June, 2025; v1 submitted 19 June, 2025; originally announced June 2025.

    Comments: Updated title to match published version. Added DOI and journal reference to PDF

    Journal ref: Transport Policy, 2025

  3. arXiv:2506.01789  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.CV eess.AS

    Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability

    Authors: Genta Indra Winata, David Anugraha, Emmy Liu, Alham Fikri Aji, Shou-Yi Hung, Aditya Parashar, Patrick Amadeus Irawan, Ruochen Zhang, Zheng-Xin Yong, Jan Christian Blaise Cruz, Niklas Muennighoff, Seungone Kim, Hanyang Zhao, Sudipta Kar, Kezia Erina Suryoraharjo, M. Farid Adilazuarda, En-Shiun Annie Lee, Ayu Purwarianti, Derry Tanti Wijaya, Monojit Choudhury

    Abstract: High-quality datasets are fundamental to training and evaluating machine learning models, yet their creation-especially with accurate human annotations-remains a significant challenge. Many dataset paper submissions lack originality, diversity, or rigorous quality control, and these shortcomings are often overlooked during peer review. Submissions also frequently omit essential details about datas… ▽ More

    Submitted 3 June, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

    Comments: Preprint

  4. arXiv:2506.00662  [pdf

    q-bio.GN cs.LG

    Uncertainty-Aware Genomic Classification of Alzheimer's Disease: A Transformer-Based Ensemble Approach with Monte Carlo Dropout

    Authors: Taeho Jo, Eun Hye Lee, Alzheimer's Disease Sequencing Project

    Abstract: INTRODUCTION: Alzheimer's disease (AD) is genetically complex, complicating robust classification from genomic data. METHODS: We developed a transformer-based ensemble model (TrUE-Net) using Monte Carlo Dropout for uncertainty estimation in AD classification from whole-genome sequencing (WGS). We combined a transformer that preserves single-nucleotide polymorphism (SNP) sequence structure with a c… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  5. arXiv:2506.00481  [pdf, other

    cs.CL cs.AI

    PVP: An Image Dataset for Personalized Visual Persuasion with Persuasion Strategies, Viewer Characteristics, and Persuasiveness Ratings

    Authors: Junseo Kim, Jongwook Han, Dongmin Choi, Jongwook Yoon, Eun-Ju Lee, Yohan Jo

    Abstract: Visual persuasion, which uses visual elements to influence cognition and behaviors, is crucial in fields such as advertising and political communication. With recent advancements in artificial intelligence, there is growing potential to develop persuasive systems that automatically generate persuasive images tailored to individuals. However, a significant bottleneck in this area is the lack of com… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: ACL 2025 Main. Code and dataset are released at: https://github.com/holi-lab/PVP_Personalized_Visual_Persuasion

  6. arXiv:2505.22677  [pdf, ps, other

    cs.CV

    Using Cross-Domain Detection Loss to Infer Multi-Scale Information for Improved Tiny Head Tracking

    Authors: Jisu Kim, Alex Mattingly, Eung-Joo Lee, Benjamin S. Riggan

    Abstract: Head detection and tracking are essential for downstream tasks, but current methods often require large computational budgets, which increase latencies and ties up resources (e.g., processors, memory, and bandwidth). To address this, we propose a framework to enhance tiny head detection and tracking by optimizing the balance between performance and efficiency. Our framework integrates (1) a cross-… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: To appear at IEEE International Conference on Automatic Face and Gesture 2025 (FG2025)

  7. arXiv:2505.21939  [pdf, ps, other

    cs.DS

    Improved Approximation Algorithms for Chromatic and Pseudometric-Weighted Correlation Clustering

    Authors: Dahoon Lee, Chenglin Fan, Euiwoong Lee

    Abstract: Correlation Clustering (CC) is a foundational problem in unsupervised learning that models binary similarity relations using labeled graphs. While classical CC has been widely studied, many real-world applications involve more nuanced relationships, either multi-class categorical interactions or varying confidence levels in edge labels. To address these, two natural generalizations have been propo… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  8. arXiv:2505.21919  [pdf, ps, other

    cs.ET cs.AI cs.DC

    Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference

    Authors: Yue Zhu, Hao Yu, Chen Wang, Zhuoran Liu, Eun Kyung Lee

    Abstract: The increasing adoption of large language models (LLMs) with extended context windows necessitates efficient Key-Value Cache (KVC) management to optimize inference performance. Inference workloads like Retrieval-Augmented Generation (RAG) and agents exhibit high cache reusability, making efficient caching critical to reducing redundancy and improving speed. We analyze real-world KVC access pattern… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: This paper has been accepted at IEEE Cloud 2025 as WIP paper. The final version will appear in IEEE Xplore

  9. arXiv:2505.08854  [pdf, ps, other

    cs.CV cs.AI cs.RO

    Generative AI for Autonomous Driving: Frontiers and Opportunities

    Authors: Yuping Wang, Shuo Xing, Cui Can, Renjie Li, Hongyuan Hua, Kexin Tian, Zhaobin Mo, Xiangbo Gao, Keshu Wu, Sulong Zhou, Hengxu You, Juntong Peng, Junge Zhang, Zehao Wang, Rui Song, Mingxuan Yan, Walter Zimmer, Xingcheng Zhou, Peiran Li, Zhaohan Lu, Chia-Ju Chen, Yue Huang, Ryan A. Rossi, Lichao Sun, Hongkai Yu , et al. (22 additional authors not shown)

    Abstract: Generative Artificial Intelligence (GenAI) constitutes a transformative technological wave that reconfigures industries through its unparalleled capabilities for content creation, reasoning, planning, and multimodal understanding. This revolutionary force offers the most promising path yet toward solving one of engineering's grandest challenges: achieving reliable, fully autonomous driving, partic… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  10. arXiv:2505.03777  [pdf, other

    cs.LG

    MolMole: Molecule Mining from Scientific Literature

    Authors: LG AI Research, Sehyun Chun, Jiye Kim, Ahra Jo, Yeonsik Jo, Seungyul Oh, Seungjun Lee, Kwangrok Ryoo, Jongmin Lee, Seung Hwan Kim, Byung Jun Kang, Soonyoung Lee, Jun Ha Park, Chanwoo Moon, Jiwon Ham, Haein Lee, Heejae Han, Jaeseung Byun, Soojong Do, Minju Ha, Dongyun Kim, Kyunghoon Bae, Woohyung Lim, Edward Hwayoung Lee, Yongmin Park , et al. (9 additional authors not shown)

    Abstract: The extraction of molecular structures and reaction data from scientific documents is challenging due to their varied, unstructured chemical formats and complex document layouts. To address this, we introduce MolMole, a vision-based deep learning framework that unifies molecule detection, reaction diagram parsing, and optical chemical structure recognition (OCSR) into a single pipeline for automat… ▽ More

    Submitted 7 May, 2025; v1 submitted 30 April, 2025; originally announced May 2025.

    Comments: 15 pages, 12 figures

  11. arXiv:2505.03770  [pdf, other

    cs.AI

    Proceedings of 1st Workshop on Advancing Artificial Intelligence through Theory of Mind

    Authors: Mouad Abrini, Omri Abend, Dina Acklin, Henny Admoni, Gregor Aichinger, Nitay Alon, Zahra Ashktorab, Ashish Atreja, Moises Auron, Alexander Aufreiter, Raghav Awasthi, Soumya Banerjee, Joe M. Barnby, Rhea Basappa, Severin Bergsmann, Djallel Bouneffouf, Patrick Callaghan, Marc Cavazza, Thierry Chaminade, Sonia Chernova, Mohamed Chetouan, Moumita Choudhury, Axel Cleeremans, Jacek B. Cywinski, Fabio Cuzzolin , et al. (83 additional authors not shown)

    Abstract: This volume includes a selection of papers presented at the Workshop on Advancing Artificial Intelligence through Theory of Mind held at AAAI 2025 in Philadelphia US on 3rd March 2025. The purpose of this volume is to provide an open access and curated anthology for the ToM and AI research community.

    Submitted 28 April, 2025; originally announced May 2025.

    Comments: workshop proceedings

  12. arXiv:2505.03062  [pdf, other

    cs.SE

    Testing SSD Firmware with State Data-Aware Fuzzing: Accelerating Coverage in Nondeterministic I/O Environments

    Authors: Gangho Yoon, Eunseok Lee

    Abstract: Solid-State Drive (SSD) firmware manages complex internal states, including flash memory maintenance. Due to nondeterministic I/O operations, traditional testing methods struggle to rapidly achieve coverage of firmware code areas that require extensive I/O accumulation. To address this challenge, we propose a state data-aware fuzzing approach that leverages SSD firmware's internal state to guide i… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: 6 pages, 3 figures. This paper has been accepted at the 29th International Conference on Evaluation and Assessment in Software Engineering (EASE 2025)

    ACM Class: D.2.5; D.2.4

  13. arXiv:2505.01364  [pdf

    cs.CV

    Monitoring morphometric drift in lifelong learning segmentation of the spinal cord

    Authors: Enamundram Naga Karthik, Sandrine Bédard, Jan Valošek, Christoph S. Aigner, Elise Bannier, Josef Bednařík, Virginie Callot, Anna Combes, Armin Curt, Gergely David, Falk Eippert, Lynn Farner, Michael G Fehlings, Patrick Freund, Tobias Granberg, Cristina Granziera, RHSCIR Network Imaging Group, Ulrike Horn, Tomáš Horák, Suzanne Humphreys, Markus Hupp, Anne Kerbrat, Nawal Kinany, Shannon Kolind, Petr Kudlička , et al. (31 additional authors not shown)

    Abstract: Morphometric measures derived from spinal cord segmentations can serve as diagnostic and prognostic biomarkers in neurological diseases and injuries affecting the spinal cord. While robust, automatic segmentation methods to a wide variety of contrasts and pathologies have been developed over the past few years, whether their predictions are stable as the model is updated using new datasets has not… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  14. arXiv:2505.01015  [pdf, ps, other

    cs.CL cs.AI

    Value Portrait: Assessing Language Models' Values through Psychometrically and Ecologically Valid Items

    Authors: Jongwook Han, Dongmin Choi, Woojung Song, Eun-Ju Lee, Yohan Jo

    Abstract: The importance of benchmarks for assessing the values of language models has been pronounced due to the growing need of more authentic, human-aligned responses. However, existing benchmarks rely on human or machine annotations that are vulnerable to value-related biases. Furthermore, the tested scenarios often diverge from real-world contexts in which models are commonly used to generate text and… ▽ More

    Submitted 11 June, 2025; v1 submitted 2 May, 2025; originally announced May 2025.

    Comments: This paper has been accepted for publication at ACL 2025

    ACM Class: I.2.7

  15. arXiv:2504.20027  [pdf, ps, other

    cs.DS

    All-Subsets Important Separators with Applications to Sample Sets, Balanced Separators and Vertex Sparsifiers in Directed Graphs

    Authors: Aditya Anand, Euiwoong Lee, Jason Li, Thatchaphol Saranurak

    Abstract: Given a directed graph $G$ with $n$ vertices and $m$ edges, a parameter $k$ and two disjoint subsets $S,T \subseteq V(G)$, we show that the number of all-subsets important separators, which is the number of $A$-$B$ important vertex separators of size at most $k$ over all $A \subseteq S$ and $B \subseteq T$, is at most $β(|S|, |T|, k) = 4^k {|S| \choose \leq k} {|T| \choose \leq 2k}$, where… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: Abstract shortened

  16. arXiv:2504.19051  [pdf, ps, other

    cs.DS

    Min-CSPs on Complete Instances II: Polylogarithmic Approximation for Min-NAE-3-SAT

    Authors: Aditya Anand, Euiwoong Lee, Davide Mazzali, Amatya Sharma

    Abstract: This paper studies complete $k$-Constraint Satisfaction Problems (CSPs), where an $n$-variable instance has exactly one nontrivial constraint for each subset of $k$ variables, i.e., it has $\binom{n}{k}$ constraints. A recent work started a systematic study of complete $k$-CSPs [Anand, Lee, Sharma, SODA'25], and showed a quasi-polynomial time algorithm that decides if there is an assignment satisf… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

  17. arXiv:2504.12060  [pdf, ps, other

    cs.DS

    Static to Dynamic Correlation Clustering

    Authors: Nairen Cao, Vincent Cohen-Addad, Euiwoong Lee, Shi Li, David Rasmussen Lolck, Alantha Newman, Mikkel Thorup, Lukas Vogl, Shuyi Yan, Hanwen Zhang

    Abstract: Correlation clustering is a well-studied problem, first proposed by Bansal, Blum, and Chawla [BBC04]. The input is an unweighted, undirected graph. The problem is to cluster the vertices so as to minimizing the number of edges between vertices in different clusters and missing edges between vertices inside the same cluster. This problem has a wide application in data mining and machine learning. W… ▽ More

    Submitted 22 April, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

  18. arXiv:2504.06201  [pdf

    quant-ph cs.CE

    Quantum Annealing for Combinatorial Optimization: A Benchmarking Study

    Authors: Seongmin Kim, Sang-Woo Ahn, In-Saeng Suh, Alexander W. Dowling, Eungkyu Lee, Tengfei Luo

    Abstract: Quantum annealing (QA) has the potential to significantly improve solution quality and reduce time complexity in solving combinatorial optimization problems compared to classical optimization methods. However, due to the limited number of qubits and their connectivity, the QA hardware did not show such an advantage over classical methods in past benchmarking studies. Recent advancements in QA with… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  19. arXiv:2504.04553  [pdf, other

    cs.HC

    Chain of Understanding: Supporting Code Understanding with Large Language Models

    Authors: Jie Gao, Yue Xue, Xiaofei Xie, SoeMin Thant, Erika Lee

    Abstract: Code auditing demands a robust understanding of codebases - an especially challenging task for end-user developers with limited expertise. To address this, we conducted formative interviews with experienced auditors and identified a Chain-of-Understanding approach, in which Large Language Models (LLMs) guide developers through hierarchical code comprehension - from high-level overviews to specific… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

    Comments: 15 pages, 11 figures, 3 tables

  20. arXiv:2504.04224  [pdf, other

    cs.SE eess.SY

    Exploration of Approaches for Robustness and Safety in a Low Code Open Environment for Factory Automation

    Authors: Gustavo Quiros A., Yi Peng Zhu, Tao Cui, Shaokai Lin, Marten Lohstroh, Edward A. Lee

    Abstract: This report is a compilation of technical knowledge and concepts that were produced by the authors and additional contributors in the context of the collaboration projects "Abstraction Requirements for Language of Choice in Industrial Automation" (FY21-22) and "Approaches for Robust and Safe Low-Code" (FY23-24) from Siemens Technology and the University of California, Berkeley. The primary objecti… ▽ More

    Submitted 5 April, 2025; originally announced April 2025.

    Comments: 15 pages, 4 figures, technical report

  21. arXiv:2504.03888  [pdf, other

    cs.HC cs.AI

    Investigating Affective Use and Emotional Well-being on ChatGPT

    Authors: Jason Phang, Michael Lampe, Lama Ahmad, Sandhini Agarwal, Cathy Mengying Fang, Auren R. Liu, Valdemar Danry, Eunhae Lee, Samantha W. T. Chan, Pat Pataranutaporn, Pattie Maes

    Abstract: As AI chatbots see increased adoption and integration into everyday life, questions have been raised about the potential impact of human-like or anthropomorphic AI on users. In this work, we investigate the extent to which interactions with ChatGPT (with a focus on Advanced Voice Mode) may impact users' emotional well-being, behaviors and experiences through two parallel studies. To study the affe… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  22. arXiv:2504.03758  [pdf, other

    cs.CY cs.CV cs.GR

    Improved visual-information-driven model for crowd simulation and its modular application

    Authors: Xuanwen Liang, Jiayu Chen, Eric Wai Ming Lee, Wei Xie

    Abstract: Data-driven crowd simulation models offer advantages in enhancing the accuracy and realism of simulations, and improving their generalizability is essential for promoting application. Current data-driven approaches are primarily designed for a single scenario, with very few models validated across more than two scenarios. It is still an open question to develop data-driven crowd simulation models… ▽ More

    Submitted 11 April, 2025; v1 submitted 2 April, 2025; originally announced April 2025.

  23. arXiv:2504.01153  [pdf, other

    cs.HC cs.AI cs.LG

    Catch Me if You Search: When Contextual Web Search Results Affect the Detection of Hallucinations

    Authors: Mahjabin Nahar, Eun-Ju Lee, Jin Won Park, Dongwon Lee

    Abstract: While we increasingly rely on large language models (LLMs) for various tasks, these models are known to produce inaccurate content or `hallucinations' with potentially disastrous consequences. The recent integration of web search results into LLMs prompts the question of whether people utilize them to verify the generated content, thereby accurately detecting hallucinations. An online experiment (… ▽ More

    Submitted 6 May, 2025; v1 submitted 1 April, 2025; originally announced April 2025.

  24. arXiv:2504.01141  [pdf, other

    cs.DC

    A Preliminary Model of Coordination-free Consistency

    Authors: Shulu Li, Edward A. Lee

    Abstract: Building consistent distributed systems has largely depended on complex coordination strategies that are not only tricky to implement, but also take a toll on performance as they require nodes to wait for coordination messages. In this paper, we explore the conditions under which no coordination is required to guarantee consistency. We present a simple and succinct theoretical model for distribute… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

    ACM Class: F.0; C.2.4

  25. arXiv:2503.22582  [pdf, other

    cs.CL

    Beyond Vanilla Fine-Tuning: Leveraging Multistage, Multilingual, and Domain-Specific Methods for Low-Resource Machine Translation

    Authors: Sarubi Thillainathan, Songchen Yuan, En-Shiun Annie Lee, Sanath Jayasena, Surangika Ranathunga

    Abstract: Fine-tuning multilingual sequence-to-sequence large language models (msLLMs) has shown promise in developing neural machine translation (NMT) systems for low-resource languages (LRLs). However, conventional single-stage fine-tuning methods struggle in extremely low-resource NMT settings, where training data is very limited. This paper contributes to artificial intelligence by proposing two approac… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

  26. Solving the Correlation Cluster LP in Sublinear Time

    Authors: Nairen Cao, Vincent Cohen-Addad, Shi Li, Euiwoong Lee, David Rasmussen Lolck, Alantha Newman, Mikkel Thorup, Lukas Vogl, Shuyi Yan, Hanwen Zhang

    Abstract: Correlation Clustering is a fundamental and widely-studied problem in unsupervised learning and data mining. The input is a graph and the goal is to construct a clustering minimizing the number of inter-cluster edges plus the number of missing intra-cluster edges. CCL+24 introduced the cluster LP for Correlation Clustering, which they argued captures the problem much more succinctly than previou… ▽ More

    Submitted 31 March, 2025; v1 submitted 26 March, 2025; originally announced March 2025.

  27. arXiv:2503.20020  [pdf, other

    cs.RO

    Gemini Robotics: Bringing AI into the Physical World

    Authors: Gemini Robotics Team, Saminda Abeyruwan, Joshua Ainslie, Jean-Baptiste Alayrac, Montserrat Gonzalez Arenas, Travis Armstrong, Ashwin Balakrishna, Robert Baruch, Maria Bauza, Michiel Blokzijl, Steven Bohez, Konstantinos Bousmalis, Anthony Brohan, Thomas Buschmann, Arunkumar Byravan, Serkan Cabi, Ken Caluwaerts, Federico Casarini, Oscar Chang, Jose Enrique Chen, Xi Chen, Hao-Tien Lewis Chiang, Krzysztof Choromanski, David D'Ambrosio, Sudeep Dasari , et al. (93 additional authors not shown)

    Abstract: Recent advancements in large multimodal models have led to the emergence of remarkable generalist capabilities in digital domains, yet their translation to physical agents such as robots remains a significant challenge. This report introduces a new family of AI models purposefully designed for robotics and built upon the foundation of Gemini 2.0. We present Gemini Robotics, an advanced Vision-Lang… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  28. arXiv:2503.17473  [pdf, other

    cs.HC

    How AI and Human Behaviors Shape Psychosocial Effects of Chatbot Use: A Longitudinal Randomized Controlled Study

    Authors: Cathy Mengying Fang, Auren R. Liu, Valdemar Danry, Eunhae Lee, Samantha W. T. Chan, Pat Pataranutaporn, Pattie Maes, Jason Phang, Michael Lampe, Lama Ahmad, Sandhini Agarwal

    Abstract: AI chatbots, especially those with voice capabilities, have become increasingly human-like, with more users seeking emotional support and companionship from them. Concerns are rising about how such interactions might impact users' loneliness and socialization with real people. We conducted a four-week randomized, controlled, IRB-approved experiment (n=981, >300K messages) to investigate how AI cha… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  29. arXiv:2503.16461  [pdf, other

    cs.HC cs.AI

    Rank-O-ToM: Unlocking Emotional Nuance Ranking to Enhance Affective Theory-of-Mind

    Authors: JiHyun Kim, JuneHyoung Kwon, MiHyeon Kim, Eunju Lee, YoungBin Kim

    Abstract: Facial Expression Recognition (FER) plays a foundational role in enabling AI systems to interpret emotional nuances, a critical aspect of affective Theory of Mind (ToM). However, existing models often struggle with poor calibration and a limited capacity to capture emotional intensity and complexity. To address this, we propose Ranking the Emotional Nuance for Theory of Mind (Rank-O-ToM), a framew… ▽ More

    Submitted 24 February, 2025; originally announced March 2025.

    Comments: Accepted to AAAI 2025 Theory of Mind for AI (ToM4AI) Workshop (Spotlight) JiHyun Kim, JuneHyoung Kwon, MiHyeon Kim, and Eunju Lee contributed equally as co-first authors. YoungBin Kim is the corresponding author

  30. arXiv:2503.15897  [pdf, other

    cs.CV cs.LG

    Learning 3D Scene Analogies with Neural Contextual Scene Maps

    Authors: Junho Kim, Gwangtak Bae, Eun Sun Lee, Young Min Kim

    Abstract: Understanding scene contexts is crucial for machines to perform tasks and adapt prior knowledge in unseen or noisy 3D environments. As data-driven learning is intractable to comprehensively encapsulate diverse ranges of layouts and open spaces, we propose teaching machines to identify relational commonalities in 3D spaces. Instead of focusing on point-wise or object-wise representations, we introd… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  31. arXiv:2503.15377  [pdf

    cs.DC

    Genomic data processing with GenomeFlow

    Authors: Junseok Park, Eduardo A. Maury, Changhoon Oh, Donghoon Shin, Danielle Denisko, Eunjung Alice Lee

    Abstract: Advances in genome sequencing technologies generate massive amounts of sequence data that are increasingly analyzed and shared through public repositories. On-demand infrastructure services on cloud computing platforms enable the processing of such large-scale genomic sequence data in distributed processing environments with a significant reduction in analysis time. However, parallel processing on… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  32. arXiv:2503.13834  [pdf, other

    cs.CV

    See-Saw Modality Balance: See Gradient, and Sew Impaired Vision-Language Balance to Mitigate Dominant Modality Bias

    Authors: JuneHyoung Kwon, MiHyeon Kim, Eunju Lee, Juhwan Choi, YoungBin Kim

    Abstract: Vision-language (VL) models have demonstrated strong performance across various tasks. However, these models often rely on a specific modality for predictions, leading to "dominant modality bias.'' This bias significantly hurts performance, especially when one modality is impaired. In this study, we analyze model behavior under dominant modality bias and theoretically show that unaligned gradients… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

    Comments: Accepted to NAACL 2025 Main

  33. arXiv:2503.12524  [pdf, other

    cs.CL cs.AI

    EXAONE Deep: Reasoning Enhanced Language Models

    Authors: LG AI Research, Kyunghoon Bae, Eunbi Choi, Kibong Choi, Stanley Jungkyu Choi, Yemuk Choi, Seokhee Hong, Junwon Hwang, Hyojin Jeon, Kijeong Jeon, Gerrard Jeongwon Jo, Hyunjik Jo, Jiyeon Jung, Hyosang Kim, Joonkee Kim, Seonghwan Kim, Soyeon Kim, Sunkyoung Kim, Yireun Kim, Yongil Kim, Youchul Kim, Edward Hwayoung Lee, Haeju Lee, Honglak Lee, Jinsik Lee , et al. (7 additional authors not shown)

    Abstract: We present EXAONE Deep series, which exhibits superior capabilities in various reasoning tasks, including math and coding benchmarks. We train our models mainly on the reasoning-specialized dataset that incorporates long streams of thought processes. Evaluation results show that our smaller models, EXAONE Deep 2.4B and 7.8B, outperform other models of comparable size, while the largest model, EXAO… ▽ More

    Submitted 19 March, 2025; v1 submitted 16 March, 2025; originally announced March 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2412.04862, arXiv:2408.03541

  34. arXiv:2503.11691  [pdf

    cs.ET cond-mat.mes-hall

    Direct-Write Printed Contacts to Layered and 2D Materials

    Authors: Sharadh Jois, Erica Lee, Philip Li, Tsegereda Esatu, Jason Fleischer, Edwin Quinn, Genda Gu, Vadym Kulichenko, Luis Balicas, Son T. Le, Samuel W. LaGasse, Aubrey T. Hanbicki, Adam L. Friedman

    Abstract: Advancements in fabrication methods have shaped new computing device technologies. Among these methods, depositing electrical contacts to the channel material is fundamental to device characterization. Novel layered and two-dimensional (2D) materials are promising for next-generation computing electronic channel materials. Direct-write printing of conductive inks is introduced as a surprisingly ef… ▽ More

    Submitted 10 April, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

  35. arXiv:2503.10972  [pdf, ps, other

    cs.DS

    A $(2+\varepsilon)$-Approximation Algorithm for Metric $k$-Median

    Authors: Vincent Cohen-Addad, Fabrizio Grandoni, Euiwoong Lee, Chris Schwiegelshohn, Ola Svensson

    Abstract: In the classical NP-hard metric $k$-median problem, we are given a set of $n$ clients and centers with metric distances between them, along with an integer parameter $k\geq 1$. The objective is to select a subset of $k$ open centers that minimizes the total distance from each client to its closest open center. In their seminal work, Jain, Mahdian, Markakis, Saberi, and Vazirani presented the Gre… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  36. arXiv:2503.08311  [pdf, other

    cs.DC cs.LG

    Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference

    Authors: Pol G. Recasens, Ferran Agullo, Yue Zhu, Chen Wang, Eun Kyung Lee, Olivier Tardieu, Jordi Torres, Josep Ll. Berral

    Abstract: Large language models have been widely adopted across different tasks, but their auto-regressive generation nature often leads to inefficient resource utilization during inference. While batching is commonly used to increase throughput, performance gains plateau beyond a certain batch size, especially with smaller models, a phenomenon that existing literature typically explains as a shift to the c… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: Pol G. Recasens, Ferran Agullo: equal contribution

  37. arXiv:2503.03207  [pdf, other

    cs.PL

    PolyVer: A Compositional Approach for Polyglot System Modeling and Verification

    Authors: Pei-Wei Chen, Shaokai Lin, Adwait Godbole, Ramneet Singh, Elizabeth Polgreen, Edward A. Lee, Sanjit A. Seshia

    Abstract: Several software systems are polyglot; that is, they comprise programs implemented in a combination of programming languages. Verifiers that directly run on mainstream programming languages are currently customized for single languages. Thus, to verify polyglot systems, one usually translates them into a common verification language or formalism on which the verifier runs. In this paper, we presen… ▽ More

    Submitted 12 March, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

    Comments: 27 pages, 8 figures; acknowledgements added, typos fixed

  38. arXiv:2502.12959  [pdf, other

    cs.CL cs.AI

    AlignFreeze: Navigating the Impact of Realignment on the Layers of Multilingual Models Across Diverse Languages

    Authors: Steve Bakos, Félix Gaschi, David Guzmán, Riddhi More, Kelly Chutong Li, En-Shiun Annie Lee

    Abstract: Realignment techniques are often employed to enhance cross-lingual transfer in multilingual language models, still, they can sometimes degrade performance in languages that differ significantly from the fine-tuned source language. This paper introduces AlignFreeze, a method that freezes either the layers' lower half or upper half during realignment. Through controlled experiments on 4 tasks, 3 mod… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 24 pages, 2 figures, to be published in Proceedings of NAACL 2025

  39. arXiv:2502.10460  [pdf, other

    cs.LG

    SenDaL: An Effective and Efficient Calibration Framework of Low-Cost Sensors for Daily Life

    Authors: Seokho Ahn, Hyungjin Kim, Euijong Lee, Young-Duk Seo

    Abstract: The collection of accurate and noise-free data is a crucial part of Internet of Things (IoT)-controlled environments. However, the data collected from various sensors in daily life often suffer from inaccuracies. Additionally, IoT-controlled devices with low-cost sensors lack sufficient hardware resources to employ conventional deep-learning models. To overcome this limitation, we propose sensors… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

    Comments: Accepted by IEEE IoTJ

  40. arXiv:2502.09814  [pdf, other

    cs.CL

    INJONGO: A Multicultural Intent Detection and Slot-filling Dataset for 16 African Languages

    Authors: Hao Yu, Jesujoba O. Alabi, Andiswa Bukula, Jian Yun Zhuang, En-Shiun Annie Lee, Tadesse Kebede Guge, Israel Abebe Azime, Happy Buzaaba, Blessing Kudzaishe Sibanda, Godson K. Kalipe, Jonathan Mukiibi, Salomon Kabongo Kabenamualu, Mmasibidi Setaka, Lolwethu Ndolela, Nkiruka Odu, Rooweither Mabuya, Shamsuddeen Hassan Muhammad, Salomey Osei, Sokhar Samb, Juliet W. Murage, Dietrich Klakow, David Ifeoluwa Adelani

    Abstract: Slot-filling and intent detection are well-established tasks in Conversational AI. However, current large-scale benchmarks for these tasks often exclude evaluations of low-resource languages and rely on translations from English benchmarks, thereby predominantly reflecting Western-centric concepts. In this paper, we introduce Injongo -- a multicultural, open-source benchmark dataset for 16 African… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  41. HyGEN: Regularizing Negative Hyperedge Generation for Accurate Hyperedge Prediction

    Authors: Song Kyung Yu, Da Eun Lee, Yunyong Ko, Sang-Wook Kim

    Abstract: Hyperedge prediction is a fundamental task to predict future high-order relations based on the observed network structure. Existing hyperedge prediction methods, however, suffer from the data sparsity problem. To alleviate this problem, negative sampling methods can be used, which leverage non-existing hyperedges as contrastive information for model training. However, the following important chall… ▽ More

    Submitted 18 February, 2025; v1 submitted 9 February, 2025; originally announced February 2025.

    Comments: 5 pages, 4 figures, 3 tables, the Web Conference (WWW) 2025

  42. arXiv:2501.18105  [pdf, other

    cs.DS

    Facility Location on High-dimensional Euclidean Spaces

    Authors: Euiwoong Lee, Kijun Shin

    Abstract: Recent years have seen great progress in the approximability of fundamental clustering and facility location problems on high-dimensional Euclidean spaces, including $k$-Means and $k$-Median. While they admit strictly better approximation ratios than their general metric versions, their approximation ratios are still higher than the hardness ratios for general metrics, leaving the possibility that… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

    Comments: ITCS '25

  43. arXiv:2501.17260  [pdf, other

    cs.CV cs.AI cs.LG

    ViT-2SPN: Vision Transformer-based Dual-Stream Self-Supervised Pretraining Networks for Retinal OCT Classification

    Authors: Mohammadreza Saraei, Igor Kozak, Eung-Joo Lee

    Abstract: Optical Coherence Tomography (OCT) is a non-invasive imaging modality essential for diagnosing various eye diseases. Despite its clinical significance, developing OCT-based diagnostic tools faces challenges, such as limited public datasets, sparse annotations, and privacy concerns. Although deep learning has made progress in automating OCT analysis, these challenges remain unresolved. To address t… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

  44. arXiv:2501.16724  [pdf, other

    cs.CV

    B-RIGHT: Benchmark Re-evaluation for Integrity in Generalized Human-Object Interaction Testing

    Authors: Yoojin Jang, Junsu Kim, Hayeon Kim, Eun-ki Lee, Eun-sol Kim, Seungryul Baek, Jaejun Yoo

    Abstract: Human-object interaction (HOI) is an essential problem in artificial intelligence (AI) which aims to understand the visual world that involves complex relationships between humans and objects. However, current benchmarks such as HICO-DET face the following limitations: (1) severe class imbalance and (2) varying number of train and test sets for certain classes. These issues can potentially lead to… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

  45. arXiv:2501.16346  [pdf, other

    cs.LG cs.AI

    Self-supervised Graph Transformer with Contrastive Learning for Brain Connectivity Analysis towards Improving Autism Detection

    Authors: Yicheng Leng, Syed Muhammad Anwar, Islem Rekik, Sen He, Eung-Joo Lee

    Abstract: Functional Magnetic Resonance Imaging (fMRI) provides useful insights into the brain function both during task or rest. Representing fMRI data using correlation matrices is found to be a reliable method of analyzing the inherent connectivity of the brain in the resting and active states. Graph Neural Networks (GNNs) have been widely used for brain network analysis due to their inherent explainabil… ▽ More

    Submitted 18 January, 2025; originally announced January 2025.

  46. A Framework for Mining Collectively-Behaving Bots in MMORPGs

    Authors: Hyunsoo Kim, Jun Hee Kim, Jaeman Son, Jihoon Song, Eunjo Lee

    Abstract: In MMORPGs (Massively Multiplayer Online Role-Playing Games), abnormal players (bots) using unauthorized automated programs to carry out pre-defined behaviors systematically and repeatedly are commonly observed. Bots usually engage in these activities to gain in-game money, which they eventually trade for real money outside the game. Such abusive activities negatively impact the in-game experience… ▽ More

    Submitted 1 July, 2025; v1 submitted 15 January, 2025; originally announced January 2025.

    Journal ref: Published in: Proceedings of the International Conference on Pattern Recognition (ICPR 2024)

  47. arXiv:2501.09802  [pdf

    cs.CR

    W3ID: A Quantum Computing-Secure Digital Identity System Redefining Standards for Web3 and Digital Twins

    Authors: Joseph Yun, Eli Lifton, Eunseo Lee, Yohan Yun, Abigail Song, Joshua Lee, Cristian Jimenez-Bert, Benedict Song, Yejun Lee, Alex Seo, Sijung Yun

    Abstract: The rapid advancements in quantum computing present significant threats to existing encryption standards and internet security. Simultaneously, the advent of Web 3.0 marks a transformative era in internet history, emphasizing enhanced data security, decentralization, and user ownership. This white paper introduces the W3ID, an abbreviation of Web3 standard meeting universal digital ID, which is a… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  48. arXiv:2412.19522  [pdf, other

    cs.CL

    Exploiting Domain-Specific Parallel Data on Multilingual Language Models for Low-resource Language Translation

    Authors: Surangika Ranathungaa, Shravan Nayak, Shih-Ting Cindy Huang, Yanke Mao, Tong Su, Yun-Hsiang Ray Chan, Songchen Yuan, Anthony Rinaldi, Annie En-Shiun Lee

    Abstract: Neural Machine Translation (NMT) systems built on multilingual sequence-to-sequence Language Models (msLMs) fail to deliver expected results when the amount of parallel data for a language, as well as the language's representation in the model are limited. This restricts the capabilities of domain-specific NMT systems for low-resource languages (LRLs). As a solution, parallel data from auxiliary d… ▽ More

    Submitted 27 December, 2024; originally announced December 2024.

  49. arXiv:2412.13558  [pdf, other

    eess.IV cs.CL cs.CV cs.LG

    Read Like a Radiologist: Efficient Vision-Language Model for 3D Medical Imaging Interpretation

    Authors: Changsun Lee, Sangjoon Park, Cheong-Il Shin, Woo Hee Choi, Hyun Jeong Park, Jeong Eun Lee, Jong Chul Ye

    Abstract: Recent medical vision-language models (VLMs) have shown promise in 2D medical image interpretation. However extending them to 3D medical imaging has been challenging due to computational complexities and data scarcity. Although a few recent VLMs specified for 3D medical imaging have emerged, all are limited to learning volumetric representation of a 3D medical image as a set of sub-volumetric feat… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

  50. arXiv:2412.04862  [pdf, other

    cs.CL

    EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

    Authors: LG AI Research, Soyoung An, Kyunghoon Bae, Eunbi Choi, Kibong Choi, Stanley Jungkyu Choi, Seokhee Hong, Junwon Hwang, Hyojin Jeon, Gerrard Jeongwon Jo, Hyunjik Jo, Jiyeon Jung, Yountae Jung, Hyosang Kim, Joonkee Kim, Seonghwan Kim, Soyeon Kim, Sunkyoung Kim, Yireun Kim, Yongil Kim, Youchul Kim, Edward Hwayoung Lee, Haeju Lee, Honglak Lee, Jinsik Lee , et al. (8 additional authors not shown)

    Abstract: This technical report introduces the EXAONE 3.5 instruction-tuned language models, developed and released by LG AI Research. The EXAONE 3.5 language models are offered in three configurations: 32B, 7.8B, and 2.4B. These models feature several standout capabilities: 1) exceptional instruction following capabilities in real-world scenarios, achieving the highest scores across seven benchmarks, 2) ou… ▽ More

    Submitted 9 December, 2024; v1 submitted 6 December, 2024; originally announced December 2024.

    Comments: arXiv admin note: text overlap with arXiv:2408.03541