Skip to main content

Showing 1–50 of 149 results for author: Wong, E

.
  1. arXiv:2506.13734  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Instruction Following by Boosting Attention of Large Language Models

    Authors: Vitoria Guardieiro, Adam Stein, Avishree Khare, Eric Wong

    Abstract: Controlling the generation of large language models (LLMs) remains a central challenge to ensure their safe and reliable deployment. While prompt engineering and finetuning are common approaches, recent work has explored latent steering, a lightweight technique that alters LLM internal activations to guide generation. However, subsequent studies revealed latent steering's effectiveness to be limit… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  2. arXiv:2506.06414  [pdf, other

    cs.CR cs.AI

    Benchmarking Misuse Mitigation Against Covert Adversaries

    Authors: Davis Brown, Mahdi Sabbaghi, Luze Sun, Alexander Robey, George J. Pappas, Eric Wong, Hamed Hassani

    Abstract: Existing language model safety evaluations focus on overt attacks and low-stakes tasks. Realistic attackers can subvert current safeguards by requesting help on small, benign-seeming tasks across many independent queries. Because individual queries do not appear harmful, the attack is hard to {detect}. However, when combined, these fragments uplift misuse by helping the attacker complete hard and… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  3. arXiv:2505.24874  [pdf, ps, other

    cs.LG

    The Road to Generalizable Neuro-Symbolic Learning Should be Paved with Foundation Models

    Authors: Adam Stein, Aaditya Naik, Neelay Velingker, Mayur Naik, Eric Wong

    Abstract: Neuro-symbolic learning was proposed to address challenges with training neural networks for complex reasoning tasks with the added benefits of interpretability, reliability, and efficiency. Neuro-symbolic learning methods traditionally train neural models in conjunction with symbolic programs, but they face significant challenges that limit them to simplistic problems. On the other hand, purely-n… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

    Comments: 19 pages, 11 figures

  4. Data Mining-Based Techniques for Software Fault Localization

    Authors: Peggy Cellier, Mireille Ducassé, Sébastien Ferré, Olivier Ridoux, W. Eric Wong

    Abstract: This chapter illustrates the basic concepts of fault localization using a data mining technique. It utilizes the Trityp program to illustrate the general method. Formal concept analysis and association rule are two well-known methods for symbolic data mining. In their original inception, they both consider data in the form of an object-attribute table. In their original inception, they both consid… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Journal ref: Handbook of Software Fault Localization, 1, Wiley, Chapitre 7, 2023, Handbook of Software Fault Localization: Foundations and Advances, 9781119291824

  5. arXiv:2505.18082  [pdf, ps, other

    cs.LG

    An Iterative Framework for Generative Backmapping of Coarse Grained Proteins

    Authors: Georgios Kementzidis, Erin Wong, John Nicholson, Ruichen Xu, Yuefan Deng

    Abstract: The techniques of data-driven backmapping from coarse-grained (CG) to fine-grained (FG) representation often struggle with accuracy, unstable training, and physical realism, especially when applied to complex systems such as proteins. In this work, we introduce a novel iterative framework by using conditional Variational Autoencoders and graph-based neural networks, specifically designed to tackle… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 17 pages, 8 figures. For associated code repositories, see: CGVAE: https://github.com/wwang2/CoarseGrainingVAE GenZProT: https://github.com/learningmatter-mit/GenZProt See also arXiv:2201.12176 and arXiv:2303.01569 for related methods

  6. arXiv:2505.13366  [pdf, ps, other

    quant-ph

    A Game-Theoretic Quantum Algorithm for Solving Magic Squares

    Authors: Sarah Chehade, Andrea Delgado, Elaine Wong

    Abstract: Variational quantum algorithms (VQAs) offer a promising near-term approach to finding optimal quantum strategies for playing non-local games. These games test quantum correlations beyond classical limits and enable entanglement verification. In this work, we present a variational framework for the Magic Square Game (MSG), a two-player non-local game with perfect quantum advantage. We construct a v… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: Submitted for review

  7. arXiv:2504.13787  [pdf, other

    cs.LG cs.AI

    Probabilistic Stability Guarantees for Feature Attributions

    Authors: Helen Jin, Anton Xue, Weiqiu You, Surbhi Goel, Eric Wong

    Abstract: Stability guarantees have emerged as a principled way to evaluate feature attributions, but existing certification methods rely on heavily smoothed classifiers and often produce conservative guarantees. To address these limitations, we introduce soft stability and propose a simple, model-agnostic, sample-efficient stability certification algorithm (SCA) that yields non-trivial and interpretable gu… ▽ More

    Submitted 17 May, 2025; v1 submitted 18 April, 2025; originally announced April 2025.

  8. arXiv:2503.24123  [pdf, other

    cs.LG

    CTSketch: Compositional Tensor Sketching for Scalable Neurosymbolic Learning

    Authors: Seewon Choi, Alaia Solko-Breslin, Rajeev Alur, Eric Wong

    Abstract: Many computational tasks benefit from being formulated as the composition of neural networks followed by a discrete symbolic program. The goal of neurosymbolic learning is to train the neural networks using only end-to-end input-output labels of the composite. We introduce CTSketch, a novel, scalable neurosymbolic learning algorithm. CTSketch uses two techniques to improve the scalability of neuro… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

    Comments: 15 pages, 6 figures

  9. arXiv:2503.08600  [pdf, other

    cs.CL

    NSF-SciFy: Mining the NSF Awards Database for Scientific Claims

    Authors: Delip Rao, Weiqiu You, Eric Wong, Chris Callison-Burch

    Abstract: We present NSF-SciFy, a large-scale dataset for scientific claim extraction derived from the National Science Foundation (NSF) awards database, comprising over 400K grant abstracts spanning five decades. While previous datasets relied on published literature, we leverage grant abstracts which offer a unique advantage: they capture claims at an earlier stage in the research lifecycle before publica… ▽ More

    Submitted 15 March, 2025; v1 submitted 11 March, 2025; originally announced March 2025.

    Comments: 11 pages, 3 figures, 6 tables

  10. arXiv:2503.01986  [pdf, other

    cs.CL cs.AI cs.LG

    Adaptively profiling models with task elicitation

    Authors: Davis Brown, Prithvi Balehannina, Helen Jin, Shreya Havaldar, Hamed Hassani, Eric Wong

    Abstract: Language model evaluations often fail to characterize consequential failure modes, forcing experts to inspect outputs and build new benchmarks. We introduce task elicitation, a method that automatically builds new evaluations to profile model behavior. Task elicitation finds hundreds of natural-language tasks -- an order of magnitude more than prior work -- where frontier models exhibit systematic… ▽ More

    Submitted 20 May, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

  11. arXiv:2503.01787  [pdf, other

    quant-ph cs.DC

    Building a Software Stack for Quantum-HPC Integration

    Authors: Amir Shehata, Peter Groszkowski, Thomas Naughton, Murali Gopalakrishnan Meena, Elaine Wong, Daniel Claudino, Rafael Ferreira da Silvaa, Thomas Beck

    Abstract: This paper presents a comprehensive software stack architecture for integrating quantum computing (QC) capabilities with High-Performance Computing (HPC) environments. While quantum computers show promise as specialized accelerators for scientific computing, their effective integration with classical HPC systems presents significant technical challenges. We propose a hardware-agnostic software fra… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: 14 pages, 14 figures

  12. arXiv:2502.20598  [pdf, other

    cs.NI cs.AI

    Scalable Coordinated Learning for H2M/R Applications over Optical Access Networks (Invited)

    Authors: Sourav Mondal, Elaine Wong

    Abstract: One of the primary research interests adhering to next-generation fiber-wireless access networks is human-to-machine/robot (H2M/R) collaborative communications facilitating Industry 5.0. This paper discusses scalable H2M/R communications across large geographical distances that also allow rapid onboarding of new machines/robots as $\sim72\%$ training time is saved through global-local coordinated… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: This article is accepted for publication in 29th Opto-Electronics and Communications Conference 2024 (OECC2024). Copyright @ IEEE

  13. arXiv:2502.13966  [pdf, other

    cs.SE cs.LG

    Where's the Bug? Attention Probing for Scalable Fault Localization

    Authors: Adam Stein, Arthur Wayne, Aaditya Naik, Mayur Naik, Eric Wong

    Abstract: Ensuring code correctness remains a challenging problem even as large language models (LLMs) become increasingly capable at code-related tasks. While LLM-based program repair systems can propose bug fixes using only a user's bug report, their effectiveness is fundamentally limited by their ability to perform fault localization (FL), a challenging problem for both humans and LLMs. Existing FL appro… ▽ More

    Submitted 19 February, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: 14 pages, 5 figures

  14. The current cratering rate on the regular satellites of Jupiter, Saturn, and Uranus

    Authors: R. Brasser, E. W. Wong, S. C. Werner

    Abstract: We aim to compute the impact rates for objects with a diameter of 1 km onto the regular satellites of Jupiter, Saturn and Uranus using our latest dynamical simulations of the evolution of outer solar system coupled with the best estimates of the current population of objects beyond Neptune and their size-frequency distribution. We use the outcome of the last 3.5~Gyr of evolution of the outer solar… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

    Comments: In revision with Astronomy and Astrophysics

    Journal ref: A&A 695, A276 (2025)

  15. arXiv:2501.09176  [pdf, ps, other

    math.PR

    Existence of Periodic and Stationary Solutions to Distribution-Dependent SDEs

    Authors: Wei Sun, Ethan Wong

    Abstract: We investigate the periodic and stationary solutions of distribution-dependent stochastic differential equations. While generally, the semigroups associated with the equations are nonlinear, we show that the methods of weak convergence and Lyapunov functions can be combined to give efficient criteria for the existence of periodic and stationary solutions. Concrete examples are presented to illustr… ▽ More

    Submitted 15 January, 2025; originally announced January 2025.

    MSC Class: 60H10; 37A50

  16. arXiv:2412.16720  [pdf, other

    cs.AI

    OpenAI o1 System Card

    Authors: OpenAI, :, Aaron Jaech, Adam Kalai, Adam Lerer, Adam Richardson, Ahmed El-Kishky, Aiden Low, Alec Helyar, Aleksander Madry, Alex Beutel, Alex Carney, Alex Iftimie, Alex Karpenko, Alex Tachard Passos, Alexander Neitz, Alexander Prokofiev, Alexander Wei, Allison Tam, Ally Bennett, Ananya Kumar, Andre Saraiva, Andrea Vallone, Andrew Duberstein, Andrew Kondrich , et al. (238 additional authors not shown)

    Abstract: The o1 model series is trained with large-scale reinforcement learning to reason using chain of thought. These advanced reasoning capabilities provide new avenues for improving the safety and robustness of our models. In particular, our models can reason about our safety policies in context when responding to potentially unsafe prompts, through deliberative alignment. This leads to state-of-the-ar… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

  17. arXiv:2412.15058  [pdf, other

    cs.CV cs.LG eess.IV

    MultiverSeg: Scalable Interactive Segmentation of Biomedical Imaging Datasets with In-Context Guidance

    Authors: Hallee E. Wong, Jose Javier Gonzalez Ortiz, John Guttag, Adrian V. Dalca

    Abstract: Medical researchers and clinicians often need to perform novel segmentation tasks on a set of related images. Existing methods for segmenting a new dataset are either interactive, requiring substantial human effort for each image, or require an existing set of manually labeled images. We introduce a system, MultiverSeg, that enables practitioners to rapidly segment an entire new dataset without re… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

    Comments: Project Website: https://multiverseg.csail.mit.edu Keywords: interactive segmentation, in-context learning, medical image analysis, biomedical imaging, image annotation, visual prompting

  18. arXiv:2412.00760  [pdf, other

    eess.AS cs.AI cs.CL cs.ET cs.LG

    Automating Feedback Analysis in Surgical Training: Detection, Categorization, and Assessment

    Authors: Firdavs Nasriddinov, Rafal Kocielnik, Arushi Gupta, Cherine Yang, Elyssa Wong, Anima Anandkumar, Andrew Hung

    Abstract: This work introduces the first framework for reconstructing surgical dialogue from unstructured real-world recordings, which is crucial for characterizing teaching tasks. In surgical training, the formative verbal feedback that trainers provide to trainees during live surgeries is crucial for ensuring safety, correcting behavior immediately, and facilitating long-term skill acquisition. However, a… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

    Comments: Accepted as a proceedings paper at Machine Learning for Health 2024

    MSC Class: 68T50; 68U99; 68T99 ACM Class: I.2; I.2.7; I.5.4; J.3; K.3.1

  19. arXiv:2411.17101  [pdf

    cs.SE

    Software Fault Localization Based on Multi-objective Feature Fusion and Deep Learning

    Authors: Xiaolei Hu, Dongcheng Li, W. Eric Wong, Ya Zou

    Abstract: Software fault localization remains challenging due to limited feature diversity and low precision in traditional methods. This paper proposes a novel approach that integrates multi-objective optimization with deep learning models to improve both accuracy and efficiency in fault localization (FL). By framing feature selection as a multi-objective optimization problem (MOP), we extract and fuse thr… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

  20. arXiv:2411.10919  [pdf, other

    cs.LG cs.AI cs.CV

    Multi-Modal Self-Supervised Learning for Surgical Feedback Effectiveness Assessment

    Authors: Arushi Gupta, Rafal Kocielnik, Jiayun Wang, Firdavs Nasriddinov, Cherine Yang, Elyssa Wong, Anima Anandkumar, Andrew Hung

    Abstract: During surgical training, real-time feedback from trainers to trainees is important for preventing errors and enhancing long-term skill acquisition. Accurately predicting the effectiveness of this feedback, specifically whether it leads to a change in trainee behavior, is crucial for developing methods for improving surgical training and education. However, relying on human annotations to assess f… ▽ More

    Submitted 16 November, 2024; originally announced November 2024.

    Comments: Accepted as a spotlight proceedings paper at Machine Learning for Health 2024

    MSC Class: 68T07; 68T45; 68U10; 92C50 ACM Class: I.2; I.2.10; I.5.4; I.4.7; J.3; K.3.1

  21. arXiv:2411.02372  [pdf, other

    cs.CV cs.LG

    Learning General-Purpose Biomedical Volume Representations using Randomized Synthesis

    Authors: Neel Dey, Benjamin Billot, Hallee E. Wong, Clinton J. Wang, Mengwei Ren, P. Ellen Grant, Adrian V. Dalca, Polina Golland

    Abstract: Current volumetric biomedical foundation models struggle to generalize as public 3D datasets are small and do not cover the broad diversity of medical procedures, conditions, anatomical regions, and imaging protocols. We address this by creating a representation learning method that instead anticipates strong domain shifts at training time itself. We first propose a data engine that synthesizes hi… ▽ More

    Submitted 2 March, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

    Comments: ICLR 2025: International Conference on Learning Representations. Code and model weights available at https://github.com/neel-dey/anatomix. Keywords: synthetic data, representation learning, medical image analysis, image registration, image segmentation

  22. arXiv:2411.01033  [pdf

    cs.SE

    Many-Objective Search-Based Coverage-Guided Automatic Test Generation for Deep Neural Networks

    Authors: Dongcheng Li, W. Eric Wong, Hu Liu, Man Zhao

    Abstract: To ensure the reliability of DNN systems and address the test generation problem for neural networks, this paper proposes a fuzzing test generation technique based on many-objective optimization algorithms. Traditional fuzz testing employs random search, leading to lower testing efficiency and tends to generate numerous invalid test cases. By utilizing many-objective optimization techniques, effec… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  23. arXiv:2410.24178  [pdf, other

    cs.LG

    AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties

    Authors: Xiayan Ji, Anton Xue, Eric Wong, Oleg Sokolsky, Insup Lee

    Abstract: Anomaly detection is widely used for identifying critical errors and suspicious behaviors, but current methods lack interpretability. We leverage common properties of existing methods and recent advances in generative models to introduce counterfactual explanations for anomaly detection. Given an input, we generate its counterfactual as a diffusion-based repair that shows what a non-anomalous vers… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

  24. arXiv:2410.21276  [pdf, other

    cs.CL cs.AI cs.CV cs.CY cs.LG cs.SD eess.AS

    GPT-4o System Card

    Authors: OpenAI, :, Aaron Hurst, Adam Lerer, Adam P. Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, AJ Ostrow, Akila Welihinda, Alan Hayes, Alec Radford, Aleksander Mądry, Alex Baker-Whitcomb, Alex Beutel, Alex Borzunov, Alex Carney, Alex Chow, Alex Kirillov, Alex Nichol, Alex Paino, Alex Renzin, Alex Tachard Passos, Alexander Kirillov, Alexi Christakis , et al. (395 additional authors not shown)

    Abstract: GPT-4o is an autoregressive omni model that accepts as input any combination of text, audio, image, and video, and generates any combination of text, audio, and image outputs. It's trained end-to-end across text, vision, and audio, meaning all inputs and outputs are processed by the same neural network. GPT-4o can respond to audio inputs in as little as 232 milliseconds, with an average of 320 mil… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

  25. arXiv:2410.19173  [pdf, other

    quant-ph math.PR

    Probabilistic Representation of Commutative Quantum Circuit Models

    Authors: Richard Yu, Jorge Ramirez, Elaine Wong

    Abstract: In commuting parametric quantum circuits, the Fourier series of the pairwise fidelity can be expressed as the characteristic function of random variables. Furthermore, expressiveness can be cast as the recurrence probability of a random walk on a lattice. This construction has been successfully applied to the group composed only of Pauli-Z rotations, and we generalize this probabilistic strategy t… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

  26. arXiv:2410.10046  [pdf

    cs.SE

    A Hybrid Sampling and Multi-Objective Optimization Approach for Enhanced Software Defect Prediction

    Authors: Jie Zhang, Dongcheng Li, W. Eric Wong, Shengrong Wang

    Abstract: Accurate early prediction of software defects is essential to maintain software quality and reduce maintenance costs. However, the field of software defect prediction (SDP) faces challenges such as class imbalances, high-dimensional feature spaces, and suboptimal prediction accuracy. To mitigate these challenges, this paper introduces a novel SDP framework that integrates hybrid sampling technique… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

  27. arXiv:2410.03348  [pdf, ps, other

    cs.LG

    Dolphin: A Programmable Framework for Scalable Neurosymbolic Learning

    Authors: Aaditya Naik, Jason Liu, Claire Wang, Amish Sethi, Saikat Dutta, Mayur Naik, Eric Wong

    Abstract: Neurosymbolic learning enables the integration of symbolic reasoning with deep learning but faces significant challenges in scaling to complex symbolic programs, large datasets, or both. We introduce DOLPHIN, a framework that tackles these challenges by supporting neurosymbolic programs in Python, executing complex symbolic reasoning on the CPU while vectorizing probabilistic computations and grad… ▽ More

    Submitted 28 May, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

  28. arXiv:2410.00282  [pdf

    cs.SE

    Smart Contract Vulnerability Detection based on Static Analysis and Multi-Objective Search

    Authors: Dongcheng Li, W. Eric Wong, Xiaodan Wang, Sean Pan, Liang-Seng Koh

    Abstract: This paper introduces a method for detecting vulnerabilities in smart contracts using static analysis and a multi-objective optimization algorithm. We focus on four types of vulnerabilities: reentrancy, call stack overflow, integer overflow, and timestamp dependencies. Initially, smart contracts are compiled into an abstract syntax tree to analyze relationships between contracts and functions, inc… ▽ More

    Submitted 30 September, 2024; originally announced October 2024.

  29. arXiv:2409.13684  [pdf, other

    cs.LG cs.AI

    The FIX Benchmark: Extracting Features Interpretable to eXperts

    Authors: Helen Jin, Shreya Havaldar, Chaehyeon Kim, Anton Xue, Weiqiu You, Helen Qu, Marco Gatti, Daniel A Hashimoto, Bhuvnesh Jain, Amin Madani, Masao Sako, Lyle Ungar, Eric Wong

    Abstract: Feature-based methods are commonly used to explain model predictions, but these methods often implicitly assume that interpretable features are readily available. However, this is often not the case for high-dimensional data, and it can be hard even for domain experts to mathematically specify which features are important. Can we instead automatically extract collections or groups of features that… ▽ More

    Submitted 23 December, 2024; v1 submitted 20 September, 2024; originally announced September 2024.

  30. arXiv:2407.00075  [pdf, other

    cs.AI cs.CL cs.CR cs.LG

    Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference

    Authors: Anton Xue, Avishree Khare, Rajeev Alur, Surbhi Goel, Eric Wong

    Abstract: We study how to subvert large language models (LLMs) from following prompt-specified rules. We first formalize rule-following as inference in propositional Horn logic, a mathematical system in which rules have the form "if $P$ and $Q$, then $R$" for some propositions $P$, $Q$, and $R$. Next, we prove that although small transformers can faithfully follow such rules, maliciously crafted prompts can… ▽ More

    Submitted 28 February, 2025; v1 submitted 21 June, 2024; originally announced July 2024.

  31. arXiv:2406.18534  [pdf, other

    cs.CL cs.LG

    Towards Compositionality in Concept Learning

    Authors: Adam Stein, Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong

    Abstract: Concept-based interpretability methods offer a lens into the internals of foundation models by decomposing their embeddings into high-level concepts. These concept representations are most useful when they are compositional, meaning that the individual concepts compose to explain the full sample. We show that existing unsupervised concept extraction methods find concepts which are not compositiona… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024. 26 pages, 10 figures

  32. arXiv:2406.10952  [pdf, other

    cs.CL

    Avoiding Copyright Infringement via Large Language Model Unlearning

    Authors: Guangyao Dou, Zheyuan Liu, Qing Lyu, Kaize Ding, Eric Wong

    Abstract: Pre-trained Large Language Models (LLMs) have demonstrated remarkable capabilities but also pose risks by learning and generating copyrighted material, leading to significant legal and ethical concerns. In real-world scenarios, model owners need to continuously address copyright infringement as new requests for content removal emerge at different time points. This leads to the need for sequential… ▽ More

    Submitted 10 February, 2025; v1 submitted 16 June, 2024; originally announced June 2024.

  33. arXiv:2406.06246  [pdf, other

    cs.LG

    Data-Efficient Learning with Neural Programs

    Authors: Alaia Solko-Breslin, Seewon Choi, Ziyang Li, Neelay Velingker, Rajeev Alur, Mayur Naik, Eric Wong

    Abstract: Many computational tasks can be naturally expressed as a composition of a DNN followed by a program written in a traditional programming language or an API call to an LLM. We call such composites "neural programs" and focus on the problem of learning the DNN parameters when the training data consist of end-to-end input-output labels for the composite. When the program is written in a differentiabl… ▽ More

    Submitted 31 October, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  34. Rethinking Programming Paradigms in the QC-HPC Context

    Authors: Silvina Caino-Lores, Daniel Claudino, Eugene Dumitrescu, Travis S. Humble, Sonia Lopez Alarcon, Elaine Wong

    Abstract: Programming for today's quantum computers is making significant strides toward modern workflows compatible with high performance computing (HPC), but fundamental challenges still remain in the integration of these vastly different technologies. Quantum computing (QC) programming languages share some common ground, as well as their emerging runtimes and algorithmic modalities. In this short paper,… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Journal ref: WAMTA 2024: Proceedings of the Workshop on Asynchronous Many-Task Systems and Applications. Lecture Notes in Computer Science, Vol 14626, Pages 84-91, Springer, Cham

  35. arXiv:2406.00611  [pdf, other

    cs.LG stat.ME

    DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation

    Authors: Yinjun Wu, Mayank Keoliya, Kan Chen, Neelay Velingker, Ziyang Li, Emily J Getzen, Qi Long, Mayur Naik, Ravi B Parikh, Eric Wong

    Abstract: Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024. 22 pages, 5 figures

  36. arXiv:2405.06692  [pdf, ps, other

    cs.CL

    Analyzing Language Bias Between French and English in Conventional Multilingual Sentiment Analysis Models

    Authors: Ethan Parker Wong, Faten M'hiri

    Abstract: Inspired by the 'Bias Considerations in Bilingual Natural Language Processing' report by Statistics Canada, this study delves into potential biases in multilingual sentiment analysis between English and French. Given a 50-50 dataset of French and English, we aim to determine if there exists a language bias and explore how the incorporation of more diverse datasets in the future might affect the eq… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Undergraduate Research Project

  37. Practice-informed Patterns for Organising Large Groups in Distributed Mixed Reality Collaboration

    Authors: Emily Wong, Juan Sánchez Esquivel, Jens Emil Grønbæk, Germán Leiva, Eduardo Velloso

    Abstract: Collaborating across dissimilar, distributed spaces presents numerous challenges for computer-aided spatial communication. Mixed reality (MR) can blend selected surfaces, allowing collaborators to work in blended f-formations (facing formations), even when their workstations are physically misaligned. Since collaboration often involves more than just participant pairs, this research examines how w… ▽ More

    Submitted 9 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Journal ref: CHI '24, Proceedings of the CHI Conference on Human Factors in Computing Systems, May 11-16 2024, Honolulu, HI, USA

  38. Expressiveness of Commutative Quantum Circuits: A Probabilistic Approach

    Authors: Jorge M. Ramirez, Elaine Wong, Caio Alves, Sarah Chehade, Ryan Bennink

    Abstract: This study investigates the frame potential and expressiveness of commutative quantum circuits. Based on the Fourier series representation of these circuits, we express quantum expectation and pairwise fidelity as characteristic functions of random variables, and expressiveness as the recurrence probability of a random walk on a lattice. A central outcome of our work includes formulas to approxima… ▽ More

    Submitted 2 December, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Journal ref: IEEE Transactions on Quantum Engineering, Vol. 5, Pages 1-15 (2024)

  39. arXiv:2404.14299  [pdf, other

    quant-ph cs.SE

    A Cross-Platform Execution Engine for the Quantum Intermediate Representation

    Authors: Elaine Wong, Vicente Leyton Ortega, Daniel Claudino, Seth Johnson, Sharmin Afrose, Meenambika Gowrishankar, Anthony M. Cabrera, Travis S. Humble

    Abstract: Hybrid languages like the Quantum Intermediate Representation (QIR) are essential for programming systems that mix quantum and conventional computing models, while execution of these programs is often deferred to a system-specific implementation. Here, we describe and demonstrate the QIR Execution Engine (QIR-EE) for parsing, interpreting, and executing QIR across multiple hardware platforms. QIR-… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  40. arXiv:2404.01318  [pdf, other

    cs.CR cs.LG

    JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

    Authors: Patrick Chao, Edoardo Debenedetti, Alexander Robey, Maksym Andriushchenko, Francesco Croce, Vikash Sehwag, Edgar Dobriban, Nicolas Flammarion, George J. Pappas, Florian Tramer, Hamed Hassani, Eric Wong

    Abstract: Jailbreak attacks cause large language models (LLMs) to generate harmful, unethical, or otherwise objectionable content. Evaluating these attacks presents a number of challenges, which the current collection of benchmarks and evaluation techniques do not adequately address. First, there is no clear standard of practice regarding jailbreaking evaluation. Second, existing works compute costs and suc… ▽ More

    Submitted 31 October, 2024; v1 submitted 27 March, 2024; originally announced April 2024.

    Comments: The camera-ready version of JailbreakBench v1.0 (accepted at NeurIPS 2024 Datasets and Benchmarks Track): more attack artifacts, more test-time defenses, a more accurate jailbreak judge (Llama-3-70B with a custom prompt), a larger dataset of human preferences for selecting a jailbreak judge (300 examples), an over-refusal evaluation dataset, a semantic refusal judge based on Llama-3-8B

  41. arXiv:2402.16192  [pdf, other

    cs.CL

    Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing

    Authors: Jiabao Ji, Bairu Hou, Alexander Robey, George J. Pappas, Hamed Hassani, Yang Zhang, Eric Wong, Shiyu Chang

    Abstract: Aligned large language models (LLMs) are vulnerable to jailbreaking attacks, which bypass the safeguards of targeted LLMs and fool them into generating objectionable content. While initial defenses show promise against token-based threat models, there do not exist defenses that provide robustness against semantic attacks and avoid unfavorable trade-offs between robustness and nominal performance.… ▽ More

    Submitted 28 February, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: 37 pages

  42. arXiv:2401.13650  [pdf, other

    eess.IV cs.CV

    Tyche: Stochastic In-Context Learning for Medical Image Segmentation

    Authors: Marianne Rakic, Hallee E. Wong, Jose Javier Gonzalez Ortiz, Beth Cimini, John Guttag, Adrian V. Dalca

    Abstract: Existing learning-based solutions to medical image segmentation have two important shortcomings. First, for most new segmentation task, a new model has to be trained or fine-tuned. This requires extensive resources and machine learning expertise, and is therefore often infeasible for medical researchers and clinicians. Second, most existing segmentation methods produce a single deterministic segme… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  43. arXiv:2312.07381  [pdf, other

    cs.CV eess.IV

    ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image

    Authors: Hallee E. Wong, Marianne Rakic, John Guttag, Adrian V. Dalca

    Abstract: Biomedical image segmentation is a crucial part of both scientific research and clinical care. With enough labelled data, deep learning models can be trained to accurately automate specific biomedical image segmentation tasks. However, manually segmenting images to create training data is highly labor intensive and requires domain expertise. We present \emph{ScribblePrompt}, a flexible neural netw… ▽ More

    Submitted 16 July, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted by ECCV 2024. Project Website: https://scribbleprompt.csail.mit.edu Keywords: Interactive Segmentation, Medical Imaging, Segment Anything Model, SAM, Scribble Annotations, Prompt

  44. arXiv:2312.05716  [pdf, other

    cs.CV

    Initialization Matters for Adversarial Transfer Learning

    Authors: Andong Hua, Jindong Gu, Zhiyu Xue, Nicholas Carlini, Eric Wong, Yao Qin

    Abstract: With the prevalence of the Pretraining-Finetuning paradigm in transfer learning, the robustness of downstream tasks has become a critical concern. In this work, we delve into adversarial robustness in transfer learning and reveal the critical role of initialization, including both the pretrained model and the linear head. First, we discover the necessity of an adversarially robust pretrained model… ▽ More

    Submitted 30 March, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: CVPR 2024

  45. arXiv:2312.03231  [pdf, other

    cs.LG cs.AI cs.CV cs.HC eess.AS

    Deep Multimodal Fusion for Surgical Feedback Classification

    Authors: Rafal Kocielnik, Elyssa Y. Wong, Timothy N. Chu, Lydia Lin, De-An Huang, Jiayun Wang, Anima Anandkumar, Andrew J. Hung

    Abstract: Quantification of real-time informal feedback delivered by an experienced surgeon to a trainee during surgery is important for skill improvements in surgical training. Such feedback in the live operating room is inherently multimodal, consisting of verbal conversations (e.g., questions and answers) as well as non-verbal elements (e.g., through visual cues like pointing to anatomic elements). In th… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Journal ref: Published in Proceedings of Machine Learning for Health 2024

  46. arXiv:2310.16316  [pdf, other

    cs.LG cs.AI

    Sum-of-Parts: Self-Attributing Neural Networks with End-to-End Learning of Feature Groups

    Authors: Weiqiu You, Helen Qu, Marco Gatti, Bhuvnesh Jain, Eric Wong

    Abstract: Self-attributing neural networks (SANNs) present a potential path towards interpretable models for high-dimensional problems, but often face significant trade-offs in performance. In this work, we formally prove a lower bound on errors of per-feature SANNs, whereas group-based SANNs can achieve zero error and thus high performance. Motivated by these insights, we propose Sum-of-Parts (SOP), a fram… ▽ More

    Submitted 14 February, 2025; v1 submitted 24 October, 2023; originally announced October 2023.

  47. arXiv:2310.12508  [pdf, other

    cs.LG cs.AI

    SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation

    Authors: Chongyu Fan, Jiancheng Liu, Yihua Zhang, Eric Wong, Dennis Wei, Sijia Liu

    Abstract: With evolving data regulations, machine unlearning (MU) has become an important tool for fostering trust and safety in today's AI models. However, existing MU methods focusing on data and/or weight perspectives often suffer limitations in unlearning accuracy, stability, and cross-domain applicability. To address these challenges, we introduce the concept of 'weight saliency' for MU, drawing parall… ▽ More

    Submitted 4 April, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024 as a Spotlight paper

  48. arXiv:2310.08419  [pdf, other

    cs.LG cs.AI

    Jailbreaking Black Box Large Language Models in Twenty Queries

    Authors: Patrick Chao, Alexander Robey, Edgar Dobriban, Hamed Hassani, George J. Pappas, Eric Wong

    Abstract: There is growing interest in ensuring that large language models (LLMs) align with human values. However, the alignment of such models is vulnerable to adversarial jailbreaks, which coax LLMs into overriding their safety guardrails. The identification of these vulnerabilities is therefore instrumental in understanding inherent weaknesses and preventing future misuse. To this end, we propose Prompt… ▽ More

    Submitted 18 July, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  49. arXiv:2310.07135  [pdf, other

    cs.CL

    Comparing Styles across Languages: A Cross-Cultural Exploration of Politeness

    Authors: Shreya Havaldar, Matthew Pressimone, Eric Wong, Lyle Ungar

    Abstract: Understanding how styles differ across languages is advantageous for training both humans and computers to generate culturally appropriate text. We introduce an explanation framework to extract stylistic differences from multilingual LMs and compare styles across languages. Our framework (1) generates comprehensive style lexica in any language and (2) consolidates feature importances from LMs into… ▽ More

    Submitted 26 March, 2025; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023

  50. arXiv:2310.03684  [pdf, other

    cs.LG cs.AI stat.ML

    SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks

    Authors: Alexander Robey, Eric Wong, Hamed Hassani, George J. Pappas

    Abstract: Despite efforts to align large language models (LLMs) with human intentions, widely-used LLMs such as GPT, Llama, and Claude are susceptible to jailbreaking attacks, wherein an adversary fools a targeted LLM into generating objectionable content. To address this vulnerability, we propose SmoothLLM, the first algorithm designed to mitigate jailbreaking attacks. Based on our finding that adversarial… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 October, 2023; originally announced October 2023.