Skip to main content

Showing 1–50 of 285 results for author: Lee, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3278 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  2. arXiv:2507.05415  [pdf, ps, other

    cs.CR

    Layered, Overlapping, and Inconsistent: A Large-Scale Analysis of the Multiple Privacy Policies and Controls of U.S. Banks

    Authors: Lu Xian, Van Tran, Lauren Lee, Meera Kumar, Yichen Zhang, Florian Schaub

    Abstract: Privacy policies are often complex. An exception is the two-page standardized notice that U.S. financial institutions must provide under the Gramm-Leach-Bliley Act (GLBA). However, banks now operate websites, mobile apps, and other services that involve complex data sharing practices that require additional privacy notices and do-not-sell opt-outs. We conducted a large-scale analysis of how U.S. b… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: Accepted for publication in CCS 2025. This is a pre-publication version

  3. arXiv:2507.02193  [pdf

    q-bio.TO cs.CE

    A Multi-Scale Finite Element Method for Investigating Fiber Remodeling in Hypertrophic Cardiomyopathy

    Authors: Mohammad Mehri, Kenneth S. Campbell, Lik Chuan Lee, Jonathan F. Wenk

    Abstract: A significant hallmark of hypertrophic cardiomyopathy (HCM) is fiber disarray, which is associated with various cardiac events such as heart failure. Quantifying fiber disarray remains critical for understanding the disease s complex pathophysiology. This study investigates the role of heterogeneous HCM-induced cellular abnormalities in the development of fiber disarray and their subsequent impact… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

  4. arXiv:2506.11650  [pdf, ps, other

    cs.RO cs.AI

    Robot Context Protocol (RCP): A Runtime-Agnostic Interface for Agent-Aware Robot Control

    Authors: Lambert Lee, Joshua Lau

    Abstract: The Robot Context Protocol (RCP) is a lightweight, middleware-agnostic communication protocol designed to simplify the complexity of robotic systems and enable seamless interaction between robots, users, and autonomous agents. RCP provides a unified and semantically meaningful interface that decouples client-facing operations from backend implementations, supporting a wide range of deployment envi… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  5. arXiv:2506.08987  [pdf, ps, other

    cs.CE

    Rapid cardiac activation prediction for cardiac resynchronization therapy planning using geometric deep learning

    Authors: Ehsan Naghavi, Haifeng Wang, Vahid Ziaei Rad, Julius Guccione, Ghassan Kassab, Vishnu Boddeti, Seungik Baek, Lik-Chuan Lee

    Abstract: Cardiac resynchronization therapy (CRT) is a common intervention for patients with dyssynchronous heart failure, yet approximately one-third of recipients fail to respond due to suboptimal lead placement. Identifying optimal pacing sites remains challenging, largely due to patient-specific anatomical variability and the limitations of current individualized planning strategies. In a step towards c… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  6. arXiv:2506.08917  [pdf, ps, other

    quant-ph cs.AI

    Quantum Adiabatic Generation of Human-Like Passwords

    Authors: Sascha Mücke, Raoul Heese, Thore Gerlach, David Biesner, Loong Kuan Lee, Nico Piatkowski

    Abstract: Generative Artificial Intelligence (GenAI) for Natural Language Processing (NLP) is the predominant AI technology to date. An important perspective for Quantum Computing (QC) is the question whether QC has the potential to reduce the vast resource requirements for training and operating GenAI models. While large-scale generative NLP tasks are currently out of reach for practical quantum computers,… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: 9 pages, 4 figures

  7. arXiv:2506.03941  [pdf, ps, other

    cs.CL cs.AI cs.CY physics.soc-ph

    Hanging in the Balance: Pivotal Moments in Crisis Counseling Conversations

    Authors: Vivian Nguyen, Lillian Lee, Cristian Danescu-Niculescu-Mizil

    Abstract: During a conversation, there can come certain moments where its outcome hangs in the balance. In these pivotal moments, how one responds can put the conversation on substantially different trajectories leading to significantly different outcomes. Systems that can detect when such moments arise could assist conversationalists in domains with highly consequential outcomes, such as mental health cris… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: To appear in the Proceedings of ACL 2025. Code and demo available in ConvoKit (convokit.cornell.edu)

  8. arXiv:2506.03373  [pdf, ps, other

    cs.CV cs.AI

    A Foundation Model for Spatial Proteomics

    Authors: Muhammad Shaban, Yuzhou Chang, Huaying Qiu, Yao Yu Yeo, Andrew H. Song, Guillaume Jaume, Yuchen Wang, Luca L. Weishaupt, Tong Ding, Anurag Vaidya, Abdallah Lamane, Daniel Shao, Mohammed Zidane, Yunhao Bai, Paige McCallum, Shuli Luo, Wenrui Wu, Yang Wang, Precious Cramer, Chi Ngai Chan, Pierre Stephan, Johanna Schaffenrath, Jia Le Lee, Hendrik A. Michel, Caiwei Tian , et al. (35 additional authors not shown)

    Abstract: Foundation models have begun to transform image analysis by acting as pretrained generalist backbones that can be adapted to many tasks even when post-training data are limited, yet their impact on spatial proteomics, imaging that maps proteins at single-cell resolution, remains limited. Here, we introduce KRONOS, a foundation model built for spatial proteomics. KRONOS was trained in a self-superv… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  9. arXiv:2505.11769  [pdf, ps, other

    cs.CV

    Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Boosting Off-Road Segmentation via Photometric Distortion and Exponential Moving Average

    Authors: Wonjune Kim, Lae-kyoung Lee, Su-Yong An

    Abstract: We report on the application of a high-capacity semantic segmentation pipeline to the GOOSE 2D Semantic Segmentation Challenge for unstructured off-road environments. Using a FlashInternImage-B backbone together with a UPerNet decoder, we adapt established techniques, rather than designing new ones, to the distinctive conditions of off-road scenes. Our training recipe couples strong photometric di… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Winners of the GOOSE 2D Semantic Segmentation Challenge at the IEEE ICRA Workshop on Field Robotics 2025

  10. arXiv:2505.01364  [pdf

    cs.CV

    Monitoring morphometric drift in lifelong learning segmentation of the spinal cord

    Authors: Enamundram Naga Karthik, Sandrine Bédard, Jan Valošek, Christoph S. Aigner, Elise Bannier, Josef Bednařík, Virginie Callot, Anna Combes, Armin Curt, Gergely David, Falk Eippert, Lynn Farner, Michael G Fehlings, Patrick Freund, Tobias Granberg, Cristina Granziera, RHSCIR Network Imaging Group, Ulrike Horn, Tomáš Horák, Suzanne Humphreys, Markus Hupp, Anne Kerbrat, Nawal Kinany, Shannon Kolind, Petr Kudlička , et al. (31 additional authors not shown)

    Abstract: Morphometric measures derived from spinal cord segmentations can serve as diagnostic and prognostic biomarkers in neurological diseases and injuries affecting the spinal cord. While robust, automatic segmentation methods to a wide variety of contrasts and pathologies have been developed over the past few years, whether their predictions are stable as the model is updated using new datasets has not… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  11. arXiv:2504.21477  [pdf, other

    cs.HC

    A Comprehensive Survey of Electrical Stimulation Haptic Feedback in Human-Computer Interaction

    Authors: Simin Yang, Xian Wang, Yang Li, Lik-Hang Lee, Tristan Camille Braud, Pan Hui

    Abstract: Haptic perception and feedback play a pivotal role in interactive experiences, forming an essential component of human-computer interaction (HCI). In recent years, the field of haptic interaction has witnessed significant advancements, particularly in the area of electrical haptic feedback, driving innovation across various domains. To gain a comprehensive understanding of the current state of res… ▽ More

    Submitted 7 May, 2025; v1 submitted 30 April, 2025; originally announced April 2025.

    Comments: 23 pages, 7 figures

  12. arXiv:2504.13845  [pdf, other

    cs.HC

    Towards Enhanced Learning through Presence: A Systematic Review of Presence in Virtual Reality Across Tasks and Disciplines

    Authors: Zheng Wei, Junxiang Liao, Lik-Hang Lee, Huamin Qu, Xian Xu

    Abstract: The rising interest in Virtual Reality (VR) technology has sparked a desire to create immersive learning platforms capable of handling various tasks across environments. Through immersive interfaces, users can engage deeply with virtual environments, enhancing both learning outcomes and task performance. In fields such as education, engineering, and collaboration, presence has emerged as a critica… ▽ More

    Submitted 8 February, 2025; originally announced April 2025.

  13. arXiv:2504.12419  [pdf, other

    cs.LG math.OC quant-ph

    Standardization of Multi-Objective QUBOs

    Authors: Loong Kuan Lee, Thore Thassilo Gerlach, Nico Piatkowski

    Abstract: Multi-objective optimization involving Quadratic Unconstrained Binary Optimization (QUBO) problems arises in various domains. A fundamental challenge in this context is the effective balancing of multiple objectives, each potentially operating on very different scales. This imbalance introduces complications such as the selection of appropriate weights when scalarizing multiple objectives into a s… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    Comments: 7 pages, 3 figures

  14. arXiv:2504.00174  [pdf, other

    cs.LG cs.AI

    MetaCLBench: Meta Continual Learning Benchmark on Resource-Constrained Edge Devices

    Authors: Sijia Li, Young D. Kwon, Lik-Hang Lee, Pan Hui

    Abstract: Meta-Continual Learning (Meta-CL) has emerged as a promising approach to minimize manual labeling efforts and system resource requirements by enabling Continual Learning (CL) with limited labeled samples. However, while existing methods have shown success in image-based tasks, their effectiveness remains unexplored for sequential time-series data from sensor systems, particularly audio inputs. To… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

  15. arXiv:2502.17657  [pdf, other

    stat.AP cs.AI

    StatLLM: A Dataset for Evaluating the Performance of Large Language Models in Statistical Analysis

    Authors: Xinyi Song, Lina Lee, Kexin Xie, Xueying Liu, Xinwei Deng, Yili Hong

    Abstract: The coding capabilities of large language models (LLMs) have opened up new opportunities for automatic statistical analysis in machine learning and data science. However, before their widespread adoption, it is crucial to assess the accuracy of code generated by LLMs. A major challenge in this evaluation lies in the absence of a benchmark dataset for statistical code (e.g., SAS and R). To fill in… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

    Comments: 25 pages, 7 figures

  16. arXiv:2502.16021  [pdf, ps, other

    cs.DS cs.LG

    Learning Neural Networks with Distribution Shift: Efficiently Certifiable Guarantees

    Authors: Gautam Chandrasekaran, Adam R. Klivans, Lin Lin Lee, Konstantinos Stavropoulos

    Abstract: We give the first provably efficient algorithms for learning neural networks with distribution shift. We work in the Testable Learning with Distribution Shift framework (TDS learning) of Klivans et al. (2024), where the learner receives labeled examples from a training distribution and unlabeled examples from a test distribution and must either output a hypothesis with low test error or reject if… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: To appear in The Thirteenth International Conference on Learning Representations (ICLR 2025) 38 pages

  17. arXiv:2502.13117  [pdf, other

    stat.AP cs.AI

    Performance Evaluation of Large Language Models in Statistical Programming

    Authors: Xinyi Song, Kexin Xie, Lina Lee, Ruizhe Chen, Jared M. Clark, Hao He, Haoran He, Jie Min, Xinlei Zhang, Simin Zheng, Zhiyang Zhang, Xinwei Deng, Yili Hong

    Abstract: The programming capabilities of large language models (LLMs) have revolutionized automatic code generation and opened new avenues for automatic statistical analysis. However, the validity and quality of these generated codes need to be systematically evaluated before they can be widely adopted. Despite their growing prominence, a comprehensive evaluation of statistical code generated by LLMs remai… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 27 pages, 8 figures

  18. arXiv:2502.12944  [pdf, other

    cs.LG

    Performance of Zero-Shot Time Series Foundation Models on Cloud Data

    Authors: William Toner, Thomas L. Lee, Artjom Joosen, Rajkarn Singh, Martin Asenov

    Abstract: Time series foundation models (FMs) have emerged as a popular paradigm for zero-shot multi-domain forecasting. FMs are trained on numerous diverse datasets and claim to be effective forecasters across multiple different time series domains, including cloud data. In this work we investigate this claim, exploring the effectiveness of FMs on cloud data. We demonstrate that many well-known FMs fail to… ▽ More

    Submitted 19 May, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Comments: 5 pages, presented at the "I Can't Believe It's Not Better" workshop at ICLR 2025

  19. arXiv:2502.12920  [pdf, other

    cs.LG stat.ML

    Lightweight Online Adaption for Time Series Foundation Model Forecasts

    Authors: Thomas L. Lee, William Toner, Rajkarn Singh, Artjom Joosen, Martin Asenov

    Abstract: Foundation models (FMs) have emerged as a promising approach for time series forecasting. While effective, FMs typically remain fixed during deployment due to the high computational costs of learning them online. Consequently, deployed FMs fail to adapt their forecasts to current data characteristics, despite the availability of online feedback from newly arriving data. This raises the question of… ▽ More

    Submitted 26 March, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Comments: 8 pages, Preprint

  20. arXiv:2502.11451  [pdf, other

    cs.CL

    From Personas to Talks: Revisiting the Impact of Personas on LLM-Synthesized Emotional Support Conversations

    Authors: Shenghan Wu, Yang Deng, Yimo Zhu, Wynne Hsu, Mong Li Lee

    Abstract: The rapid advancement of Large Language Models (LLMs) has revolutionized the generation of emotional support conversations (ESC), offering scalable solutions with reduced costs and enhanced data privacy. This paper explores the role of personas in the creation of ESC by LLMs. Our research utilizes established psychological frameworks to measure and infuse persona traits into LLMs, which then gener… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  21. arXiv:2502.03660  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.LG

    Energy & Force Regression on DFT Trajectories is Not Enough for Universal Machine Learning Interatomic Potentials

    Authors: Santiago Miret, Kin Long Kelvin Lee, Carmelo Gonzales, Sajid Mannan, N. M. Anoop Krishnan

    Abstract: Universal Machine Learning Interactomic Potentials (MLIPs) enable accelerated simulations for materials discovery. However, current research efforts fail to impactfully utilize MLIPs due to: 1. Overreliance on Density Functional Theory (DFT) for MLIP training data creation; 2. MLIPs' inability to reliably and accurately perform large-scale molecular dynamics (MD) simulations for diverse materials;… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  22. arXiv:2502.03638  [pdf, other

    cond-mat.mtrl-sci cs.LG

    SymmCD: Symmetry-Preserving Crystal Generation with Diffusion Models

    Authors: Daniel Levy, Siba Smarak Panigrahi, Sékou-Oumar Kaba, Qiang Zhu, Kin Long Kelvin Lee, Mikhail Galkin, Santiago Miret, Siamak Ravanbakhsh

    Abstract: Generating novel crystalline materials has the potential to lead to advancements in fields such as electronics, energy storage, and catalysis. The defining characteristic of crystals is their symmetry, which plays a central role in determining their physical properties. However, existing crystal generation methods either fail to generate materials that display the symmetries of real-world crystals… ▽ More

    Submitted 23 May, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: 24 pages, 10 figures, International Conference on Learning Representations (ICLR) 2025

  23. TeamPortal: Exploring Virtual Reality Collaboration Through Shared and Manipulating Parallel Views

    Authors: Xian Wang, Luyao Shen, Lei Chen, Mingming Fan, Lik-Hang Lee

    Abstract: Virtual Reality (VR) offers a unique collaborative experience, with parallel views playing a pivotal role in Collaborative Virtual Environments by supporting the transfer and delivery of items. Sharing and manipulating partners' views provides users with a broader perspective that helps them identify the targets and partner actions. We proposed TeamPortal accordingly and conducted two user studies… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

  24. arXiv:2501.14728  [pdf, other

    cs.MM cs.CL cs.CV cs.CY

    Mitigating GenAI-powered Evidence Pollution for Out-of-Context Multimodal Misinformation Detection

    Authors: Zehong Yan, Peng Qi, Wynne Hsu, Mong Li Lee

    Abstract: While large generative artificial intelligence (GenAI) models have achieved significant success, they also raise growing concerns about online information security due to their potential misuse for generating deceptive content. Out-of-context (OOC) multimodal misinformation detection, which often retrieves Web evidence to identify the repurposing of images in false contexts, faces the issue of rea… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

    Comments: 12 pages, 11 figures

  25. arXiv:2501.14568  [pdf, ps, other

    cs.AI quant-ph

    Hybrid Quantum-Classical Multi-Agent Pathfinding

    Authors: Thore Gerlach, Loong Kuan Lee, Frédéric Barbaresco, Nico Piatkowski

    Abstract: Multi-Agent Path Finding (MAPF) focuses on determining conflict-free paths for multiple agents navigating through a shared space to reach specified goal locations. This problem becomes computationally challenging, particularly when handling large numbers of agents, as frequently encountered in practical applications like coordinating autonomous vehicles. Quantum Computing (QC) is a promising candi… ▽ More

    Submitted 9 July, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 11 pages, accepted at ICML 2025

  26. TAACKIT: Track Annotation and Analytics with Continuous Knowledge Integration Tool

    Authors: Lily Lee, Julian Fontes, Andrew Weinert, Laura Schomacker, Daniel Stabile, Jonathan Hou

    Abstract: Machine learning (ML) is a powerful tool for efficiently analyzing data, detecting patterns, and forecasting trends across various domains such as text, audio, and images. The availability of annotation tools to generate reliably annotated data is crucial for advances in ML applications. In the domain of geospatial tracks, the lack of such tools to annotate and validate data impedes rapid and acce… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Journal ref: AIxDKE 2024

  27. arXiv:2412.12540  [pdf, other

    cs.LG physics.chem-ph

    Stiefel Flow Matching for Moment-Constrained Structure Elucidation

    Authors: Austin Cheng, Alston Lo, Kin Long Kelvin Lee, Santiago Miret, Alán Aspuru-Guzik

    Abstract: Molecular structure elucidation is a fundamental step in understanding chemical phenomena, with applications in identifying molecules in natural products, lab syntheses, forensic samples, and the interstellar medium. We consider the task of predicting a molecule's all-atom 3D structure given only its molecular formula and moments of inertia, motivated by the ability of rotational spectroscopy to m… ▽ More

    Submitted 2 March, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

    Comments: ICLR 2025

  28. arXiv:2411.18623  [pdf, other

    cs.CV

    Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

    Authors: Yueru Jia, Jiaming Liu, Sixiang Chen, Chenyang Gu, Zhilue Wang, Longzan Luo, Lily Lee, Pengwei Wang, Zhongyuan Wang, Renrui Zhang, Shanghang Zhang

    Abstract: 3D geometric information is essential for manipulation tasks, as robots need to perceive the 3D environment, reason about spatial relationships, and interact with intricate spatial configurations. Recent research has increasingly focused on the explicit extraction of 3D features, while still facing challenges such as the lack of large-scale robotic 3D data and the potential loss of spatial geometr… ▽ More

    Submitted 14 December, 2024; v1 submitted 27 November, 2024; originally announced November 2024.

  29. arXiv:2411.11191  [pdf, other

    quant-ph cond-mat.mtrl-sci cs.LG

    Accelerating Quantum Emitter Characterization with Latent Neural Ordinary Differential Equations

    Authors: Andrew H. Proppe, Kin Long Kelvin Lee, Weiwei Sun, Chantalle J. Krajewska, Oliver Tye, Moungi G. Bawendi

    Abstract: Deep neural network models can be used to learn complex dynamics from data and reconstruct sparse or noisy signals, thereby accelerating and augmenting experimental measurements. Evaluating the quantum optical properties of solid-state single-photon emitters is a time-consuming task that typically requires interferometric photon correlation experiments, such as Photon correlation Fourier spectrosc… ▽ More

    Submitted 17 November, 2024; originally announced November 2024.

  30. arXiv:2411.05146  [pdf

    cs.HC

    Break Times: Virtual Reality Art Therapy

    Authors: Yi Rou Yap, Yun Li Lee

    Abstract: This paper presents a Virtual Reality (VR) art therapy known as "Break Times" which aims to enhance students' mental well-being and foster creative expression. The proposed "Break Times" application mimics the art therapy sessions in the VR environment design. Pilot user acceptance test with 10 participants showed a notable reduction in stress levels, with 50% reporting normal stress levels post-i… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: Part of proceedings of 6th International Conference AsiaHaptics 2024

  31. arXiv:2411.05133  [pdf

    cs.HC

    Innovative Weight Simulation in Virtual Reality Cube Games: A Pseudo-Haptic Approach

    Authors: Woan Ning Lim, Edric Yi Junn Leong, Yun Li Lee, Kian Meng Yap

    Abstract: This paper presents an innovative pseudo-haptic model for weight simulation in virtual reality (VR) environments. By integrating visual feedback with voluntary exerted force through a passive haptic glove, the model creates haptic illusions of weight perception. Two VR cube games were developed to evaluate the model's effectiveness. The first game assesses participants' ability to discriminate rel… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: Part of proceedings of 6th International Conference AsiaHaptics 2024

  32. arXiv:2410.17632  [pdf, other

    cs.CL cs.AI

    LMLPA: Language Model Linguistic Personality Assessment

    Authors: Jingyao Zheng, Xian Wang, Simo Hosio, Xiaoxian Xu, Lik-Hang Lee

    Abstract: Large Language Models (LLMs) are increasingly used in everyday life and research. One of the most common use cases is conversational interactions, enabled by the language generation capabilities of LLMs. Just as between two humans, a conversation between an LLM-powered entity and a human depends on the personality of the conversants. However, measuring the personality of a given LLM is currently a… ▽ More

    Submitted 11 November, 2024; v1 submitted 23 October, 2024; originally announced October 2024.

    ACM Class: I.2

  33. arXiv:2410.14964  [pdf, other

    cs.CL

    ChronoFact: Timeline-based Temporal Fact Verification

    Authors: Anab Maulana Barik, Wynne Hsu, Mong Li Lee

    Abstract: Temporal claims, often riddled with inaccuracies, are a significant challenge in the digital misinformation landscape. Fact-checking systems that can accurately verify such claims are crucial for combating misinformation. Current systems struggle with the complexities of evaluating the accuracy of these claims, especially when they include multiple, overlapping, or recurring events. We introduce a… ▽ More

    Submitted 14 May, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

  34. arXiv:2410.08131  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    Deconstructing equivariant representations in molecular systems

    Authors: Kin Long Kelvin Lee, Mikhail Galkin, Santiago Miret

    Abstract: Recent equivariant models have shown significant progress in not just chemical property prediction, but as surrogates for dynamical simulations of molecules and materials. Many of the top performing models in this category are built within the framework of tensor products, which preserves equivariance by restricting interactions and transformations to those that are allowed by symmetry selection r… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: Accepted in the Findings track at the AI4Mat workshop, NeurIPS 2024 Vancouver, BC

  35. arXiv:2410.07147  [pdf, other

    cs.CL cs.AI cs.CY

    Taking a turn for the better: Conversation redirection throughout the course of mental-health therapy

    Authors: Vivian Nguyen, Sang Min Jung, Lillian Lee, Thomas D. Hull, Cristian Danescu-Niculescu-Mizil

    Abstract: Mental-health therapy involves a complex conversation flow in which patients and therapists continuously negotiate what should be talked about next. For example, therapists might try to shift the conversation's direction to keep the therapeutic process on track and avoid stagnation, or patients might push the discussion towards issues they want to focus on. How do such patient and therapist redi… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: To appear in the Proceedings of EMNLP (Findings) 2024. Code available at https://convokit.cornell.edu

  36. arXiv:2408.14608  [pdf, other

    cs.LG stat.ML

    Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold

    Authors: Lazar Atanackovic, Xi Zhang, Brandon Amos, Mathieu Blanchette, Leo J. Lee, Yoshua Bengio, Alexander Tong, Kirill Neklyudov

    Abstract: Numerous biological and physical processes can be modeled as systems of interacting entities evolving continuously over time, e.g. the dynamics of communicating cells or physical particles. Learning the dynamics of such systems is essential for predicting the temporal evolution of populations across novel samples and unseen environments. Flow-based models allow for learning these dynamics at the p… ▽ More

    Submitted 3 March, 2025; v1 submitted 26 August, 2024; originally announced August 2024.

    Comments: Accepted to ICLR 2025

  37. arXiv:2408.12080  [pdf, other

    eess.SP cs.AI cs.NI

    Exploring the Feasibility of Automated Data Standardization using Large Language Models for Seamless Positioning

    Authors: Max J. L. Lee, Ju Lin, Li-Ta Hsu

    Abstract: We propose a feasibility study for real-time automated data standardization leveraging Large Language Models (LLMs) to enhance seamless positioning systems in IoT environments. By integrating and standardizing heterogeneous sensor data from smartphones, IoT devices, and dedicated systems such as Ultra-Wideband (UWB), our study ensures data compatibility and improves positioning accuracy using the… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: Accepted at IPIN 2024. To be published in IEEE Xplore

  38. MetaDragonBoat: Exploring Paddling Techniques of Virtual Dragon Boating in a Metaverse Campus

    Authors: Wei He, Xiang Li, Shengtian Xu, Yuzheng Chen, Chan-In Sio, Ge Lin Kan, Lik-Hang Lee

    Abstract: The preservation of cultural heritage, as mandated by the United Nations Sustainable Development Goals (SDGs), is integral to sustainable urban development. This paper focuses on the Dragon Boat Festival, a prominent event in Chinese cultural heritage, and proposes leveraging Virtual Reality (VR), to enhance its preservation and accessibility. Traditionally, participation in the festival's dragon… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 10 pages, accepted at ACM MM 2024

  39. arXiv:2407.16296  [pdf, other

    quant-ph cs.AI

    Quantum Computing for Climate Resilience and Sustainability Challenges

    Authors: Kin Tung Michael Ho, Kuan-Cheng Chen, Lily Lee, Felix Burt, Shang Yu, Po-Heng, Lee

    Abstract: The escalating impacts of climate change and the increasing demand for sustainable development and natural resource management necessitate innovative technological solutions. Quantum computing (QC) has emerged as a promising tool with the potential to revolutionize these critical areas. This review explores the application of quantum machine learning and optimization techniques for climate change… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  40. arXiv:2407.15291  [pdf, other

    cs.IR

    Evidence-Based Temporal Fact Verification

    Authors: Anab Maulana Barik, Wynne Hsu, Mong Li Lee

    Abstract: Automated fact verification plays an essential role in fostering trust in the digital space. Despite the growing interest, the verification of temporal facts has not received much attention in the community. Temporal fact verification brings new challenges where cues of the temporal information need to be extracted and temporal reasoning involving various temporal aspects of the text must be appli… ▽ More

    Submitted 18 August, 2024; v1 submitted 21 July, 2024; originally announced July 2024.

  41. arXiv:2407.00925  [pdf, other

    cs.MM

    SIDQL: An Efficient Keyframe Extraction and Motion Reconstruction Framework in Motion Capture

    Authors: Xuling Zhang, Ziru Zhang, Yuyang Wang, Lik-hang Lee, Pan Hui

    Abstract: Metaverse, which integrates the virtual and physical worlds, has emerged as an innovative paradigm for changing people's lifestyles. Motion capture has become a reliable approach to achieve seamless synchronization of the movements between avatars and human beings, which plays an important role in diverse Metaverse applications. However, due to the continuous growth of data, current communication… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  42. arXiv:2406.15391  [pdf

    cs.CY cs.AI cs.ET

    Examining the Legal Status of Digital Assets as Property: A Comparative Analysis of Jurisdictional Approaches

    Authors: Luke Lee

    Abstract: This paper examines the complex legal landscape surrounding digital assets, analysing how they are defined and regulated as property across various jurisdictions. As digital assets such as cryptocurrencies and non-fungible tokens (NFTs) increasingly integrate with global economies, their intangible nature presents unique challenges to traditional property law concepts, necessitating a re-evaluatio… ▽ More

    Submitted 26 April, 2024; originally announced June 2024.

    Comments: 16 pages

  43. arXiv:2406.11886  [pdf, other

    cs.LG cs.AI cs.CE q-fin.CP

    Financial Assets Dependency Prediction Utilizing Spatiotemporal Patterns

    Authors: Haoren Zhu, Pengfei Zhao, Wilfred Siu Hung NG, Dik Lun Lee

    Abstract: Financial assets exhibit complex dependency structures, which are crucial for investors to create diversified portfolios to mitigate risk in volatile financial markets. To explore the financial asset dependencies dynamics, we propose a novel approach that models the dependencies of assets as an Asset Dependency Matrix (ADM) and treats the ADM sequences as image sequences. This allows us to leverag… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  44. arXiv:2405.18047  [pdf, other

    cs.LG cs.AI cs.DC

    2BP: 2-Stage Backpropagation

    Authors: Christopher Rae, Joseph K. L. Lee, James Richings

    Abstract: As Deep Neural Networks (DNNs) grow in size and complexity, they often exceed the memory capacity of a single accelerator, necessitating the sharding of model parameters across multiple accelerators. Pipeline parallelism is a commonly used sharding strategy for training large DNNs. However, current implementations of pipeline parallelism are being unintentionally bottlenecked by the automatic diff… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  45. Cross-Domain Feature Augmentation for Domain Generalization

    Authors: Yingnan Liu, Yingtian Zou, Rui Qiao, Fusheng Liu, Mong Li Lee, Wynne Hsu

    Abstract: Domain generalization aims to develop models that are robust to distribution shifts. Existing methods focus on learning invariance across domains to enhance model robustness, and data augmentation has been widely used to learn invariant predictors, with most methods performing augmentation in the input space. However, augmentation in the input space has limited diversity whereas in the feature spa… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Accepted to the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024); Code is available at https://github.com/NancyQuris/XDomainMix

  46. arXiv:2404.14687  [pdf, other

    cs.MM cs.AI cs.CL cs.CV

    Pegasus-v1 Technical Report

    Authors: Raehyuk Jung, Hyojun Go, Jaehyuk Yi, Jiho Jang, Daniel Kim, Jay Suh, Aiden Lee, Cooper Han, Jae Lee, Jeff Kim, Jin-Young Kim, Junwan Kim, Kyle Park, Lucas Lee, Mars Ha, Minjoon Seo, Abraham Jo, Ed Park, Hassan Kianinejad, SJ Kim, Tony Moon, Wade Jeong, Andrei Popescu, Esther Kim, EK Yoon , et al. (19 additional authors not shown)

    Abstract: This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpreting spatiotemporal information, to offer nuanced video content comprehension across various lengths. This technical report overviews Pegasus-1's archi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  47. arXiv:2404.11898  [pdf

    cs.AI

    Enhancing Financial Inclusion and Regulatory Challenges: A Critical Analysis of Digital Banks and Alternative Lenders Through Digital Platforms, Machine Learning, and Large Language Models Integration

    Authors: Luke Lee

    Abstract: This paper explores the dual impact of digital banks and alternative lenders on financial inclusion and the regulatory challenges posed by their business models. It discusses the integration of digital platforms, machine learning (ML), and Large Language Models (LLMs) in enhancing financial services accessibility for underserved populations. Through a detailed analysis of operational frameworks an… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 17 pages

  48. arXiv:2404.10536  [pdf, ps, other

    cs.DC

    Benchmarking Machine Learning Applications on Heterogeneous Architecture using Reframe

    Authors: Christopher Rae, Joseph K. L. Lee, James Richings, Michele Weiland

    Abstract: With the rapid increase in machine learning workloads performed on HPC systems, it is beneficial to regularly perform machine learning specific benchmarks to monitor performance and identify issues. Furthermore, as part of the Edinburgh International Data Facility, EPCC currently hosts a wide range of machine learning accelerators including Nvidia GPUs, the Graphcore Bow Pod64 and Cerebras CS-2, w… ▽ More

    Submitted 25 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: Author accepted version of paper in the PERMAVOST workshop at the 33rd International Symposium on High-Performance Parallel and Distributed Computing (HPDC 24)

  49. arXiv:2404.06466  [pdf, other

    cs.LG stat.ML

    Hyperparameter Selection in Continual Learning

    Authors: Thomas L. Lee, Sigrid Passano Hellan, Linus Ericsson, Elliot J. Crowley, Amos Storkey

    Abstract: In continual learning (CL) -- where a learner trains on a stream of data -- standard hyperparameter optimisation (HPO) cannot be applied, as a learner does not have access to all of the data at the same time. This has prompted the development of CL-specific HPO frameworks. The most popular way to tune hyperparameters in CL is to repeatedly train over the whole data stream with different hyperparam… ▽ More

    Submitted 14 March, 2025; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Preprint, 16 pages

  50. arXiv:2404.03575  [pdf, other

    cs.CV

    DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling

    Authors: Haoran Li, Haolin Shi, Wenli Zhang, Wenjun Wu, Yong Liao, Lin Wang, Lik-hang Lee, Pengyuan Zhou

    Abstract: Text-to-3D scene generation holds immense potential for the gaming, film, and architecture sectors. Despite significant progress, existing methods struggle with maintaining high quality, consistency, and editing flexibility. In this paper, we propose DreamScene, a 3D Gaussian-based novel text-to-3D scene generation framework, to tackle the aforementioned three challenges mainly via two strategies.… ▽ More

    Submitted 19 July, 2024; v1 submitted 4 April, 2024; originally announced April 2024.