Skip to main content

Showing 1–23 of 23 results for author: Diffenderfer, J

.
  1. arXiv:2505.01912  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI

    BOOM: Benchmarking Out-Of-distribution Molecular Property Predictions of Machine Learning Models

    Authors: Evan R. Antoniuk, Shehtab Zaman, Tal Ben-Nun, Peggy Li, James Diffenderfer, Busra Demirci, Obadiah Smolenski, Tim Hsu, Anna M. Hiszpanski, Kenneth Chiu, Bhavya Kailkhura, Brian Van Essen

    Abstract: Advances in deep learning and generative modeling have driven interest in data-driven molecule discovery pipelines, whereby machine learning (ML) models are used to filter and design novel molecules without requiring prohibitively expensive first-principles simulations. Although the discovery of novel molecules that extend the boundaries of known chemistry requires accurate out-of-distribution (OO… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

  2. arXiv:2504.10185  [pdf, other

    cs.CL cs.AI cs.LG

    LLM Unlearning Reveals a Stronger-Than-Expected Coreset Effect in Current Benchmarks

    Authors: Soumyadeep Pal, Changsheng Wang, James Diffenderfer, Bhavya Kailkhura, Sijia Liu

    Abstract: Large language model unlearning has become a critical challenge in ensuring safety and controlled model behavior by removing undesired data-model influences from the pretrained model while preserving general utility. Significant recent efforts have been dedicated to developing LLM unlearning benchmarks such as WMDP (Weapons of Mass Destruction Proxy) and MUSE (Machine Unlearning Six-way Evaluation… ▽ More

    Submitted 16 April, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

  3. arXiv:2503.18929  [pdf, other

    cs.LG

    Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training

    Authors: Brian R. Bartoldson, Siddarth Venkatraman, James Diffenderfer, Moksh Jain, Tal Ben-Nun, Seanie Lee, Minsu Kim, Johan Obando-Ceron, Yoshua Bengio, Bhavya Kailkhura

    Abstract: Reinforcement learning (RL) is a critical component of large language model (LLM) post-training. However, existing on-policy algorithms used for post-training are inherently incompatible with the use of experience replay buffers, which can be populated scalably by distributed off-policy actors to enhance exploration as compute increases. We propose efficiently obtaining this benefit of replay buff… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  4. arXiv:2503.10602  [pdf, other

    cs.CV cs.AI cs.CL

    TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention

    Authors: Jinhao Duan, Fei Kong, Hao Cheng, James Diffenderfer, Bhavya Kailkhura, Lichao Sun, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu

    Abstract: Object Hallucination (OH) has been acknowledged as one of the major trustworthy challenges in Large Vision-Language Models (LVLMs). Recent advancements in Large Language Models (LLMs) indicate that internal states, such as hidden states, encode the "overall truthfulness" of generated responses. However, it remains under-explored how internal states in LVLMs function and whether they could serve as… ▽ More

    Submitted 21 March, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

    Comments: 15 pages, 9 figures, the first two authors contributed equally

  5. arXiv:2404.18239  [pdf, other

    cs.LG cs.CL

    SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning

    Authors: Jinghan Jia, Yihua Zhang, Yimeng Zhang, Jiancheng Liu, Bharat Runwal, James Diffenderfer, Bhavya Kailkhura, Sijia Liu

    Abstract: Large Language Models (LLMs) have highlighted the necessity of effective unlearning mechanisms to comply with data regulations and ethical AI practices. LLM unlearning aims at removing undesired data influences and associated model capabilities without compromising utility beyond the scope of unlearning. While interest in studying LLM unlearning is growing, the impact of the optimizer choice for L… ▽ More

    Submitted 24 June, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

  6. arXiv:2404.11766  [pdf, other

    cs.LG math.NA math.OC

    End-to-End Mesh Optimization of a Hybrid Deep Learning Black-Box PDE Solver

    Authors: Shaocong Ma, James Diffenderfer, Bhavya Kailkhura, Yi Zhou

    Abstract: Deep learning has been widely applied to solve partial differential equations (PDEs) in computational fluid dynamics. Recent research proposed a PDE correction framework that leverages deep learning to correct the solution obtained by a PDE solver on a coarse mesh. However, end-to-end training of such a PDE correction model over both solver-dependent parameters such as mesh parameters and neural n… ▽ More

    Submitted 28 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  7. arXiv:2404.09349  [pdf, other

    cs.LG cs.CR cs.CV

    Adversarial Robustness Limits via Scaling-Law and Human-Alignment Studies

    Authors: Brian R. Bartoldson, James Diffenderfer, Konstantinos Parasyris, Bhavya Kailkhura

    Abstract: This paper revisits the simple, long-studied, yet still unsolved problem of making image classifiers robust to imperceptible perturbations. Taking CIFAR10 as an example, SOTA clean accuracy is about $100$%, but SOTA robustness to $\ell_{\infty}$-norm bounded perturbations barely exceeds $70$%. To understand this gap, we analyze how model size, dataset size, and synthetic data quality affect robust… ▽ More

    Submitted 10 July, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: ICML 2024

  8. arXiv:2403.15447  [pdf, other

    cs.CL cs.AI

    Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

    Authors: Junyuan Hong, Jinhao Duan, Chenhui Zhang, Zhangheng Li, Chulin Xie, Kelsey Lieberman, James Diffenderfer, Brian Bartoldson, Ajay Jaiswal, Kaidi Xu, Bhavya Kailkhura, Dan Hendrycks, Dawn Song, Zhangyang Wang, Bo Li

    Abstract: Compressing high-capability Large Language Models (LLMs) has emerged as a favored strategy for resource-efficient inferences. While state-of-the-art (SoTA) compression methods boast impressive advancements in preserving benign task performance, the potential risks of compression in terms of safety and trustworthiness have been largely neglected. This study conducts the first, thorough evaluation o… ▽ More

    Submitted 4 June, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted to ICML'24

  9. arXiv:2402.12348  [pdf, other

    cs.CL cs.AI cs.LG

    GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations

    Authors: Jinhao Duan, Renming Zhang, James Diffenderfer, Bhavya Kailkhura, Lichao Sun, Elias Stengel-Eskin, Mohit Bansal, Tianlong Chen, Kaidi Xu

    Abstract: As Large Language Models (LLMs) are integrated into critical real-world applications, their strategic and logical reasoning abilities are increasingly crucial. This paper evaluates LLMs' reasoning abilities in competitive environments through game-theoretic tasks, e.g., board and card games that require pure logic and strategic reasoning to compete with opponents. We first propose GTBench, a langu… ▽ More

    Submitted 10 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 26 pages; the first two authors contributed equally; GTBench HF Leaderboard: https://huggingface.co/spaces/GTBench/GTBench

  10. arXiv:2312.06900  [pdf, other

    cs.CV

    When Bio-Inspired Computing meets Deep Learning: Low-Latency, Accurate, & Energy-Efficient Spiking Neural Networks from Artificial Neural Networks

    Authors: Gourav Datta, Zeyu Liu, James Diffenderfer, Bhavya Kailkhura, Peter A. Beerel

    Abstract: Bio-inspired Spiking Neural Networks (SNN) are now demonstrating comparable accuracy to intricate convolutional neural networks (CNN), all while delivering remarkable energy and latency efficiency when deployed on neuromorphic hardware. In particular, ANN-to-SNN conversion has recently gained significant traction in developing deep SNNs with close to state-of-the-art (SOTA) test accuracy on comple… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Under review

  11. arXiv:2310.02025  [pdf, other

    cs.LG

    DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training

    Authors: Aochuan Chen, Yimeng Zhang, Jinghan Jia, James Diffenderfer, Jiancheng Liu, Konstantinos Parasyris, Yihua Zhang, Zheng Zhang, Bhavya Kailkhura, Sijia Liu

    Abstract: Zeroth-order (ZO) optimization has become a popular technique for solving machine learning (ML) problems when first-order (FO) information is difficult or impossible to obtain. However, the scalability of ZO optimization remains an open problem: Its use has primarily been limited to relatively small-scale ML problems, such as sample-wise adversarial attack generation. To our best knowledge, no pri… ▽ More

    Submitted 15 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR'24. Codes are available at https://github.com/OPTML-Group/DeepZero

  12. arXiv:2307.08657  [pdf, other

    eess.IV cs.LG

    Neural Image Compression: Generalization, Robustness, and Spectral Biases

    Authors: Kelsey Lieberman, James Diffenderfer, Charles Godfrey, Bhavya Kailkhura

    Abstract: Recent advances in neural image compression (NIC) have produced models that are starting to outperform classic codecs. While this has led to growing excitement about using NIC in real-world applications, the successful adoption of any machine learning system in the wild requires it to generalize (and be robust) to unseen distribution shifts at deployment. Unfortunately, current research lacks comp… ▽ More

    Submitted 27 October, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  13. arXiv:2209.12839  [pdf, other

    cs.LG cs.AI

    Efficient Multi-Prize Lottery Tickets: Enhanced Accuracy, Training, and Inference Speed

    Authors: Hao Cheng, Pu Zhao, Yize Li, Xue Lin, James Diffenderfer, Ryan Goldhahn, Bhavya Kailkhura

    Abstract: Recently, Diffenderfer and Kailkhura proposed a new paradigm for learning compact yet highly accurate binary neural networks simply by pruning and quantizing randomly weighted full precision neural networks. However, the accuracy of these multi-prize tickets (MPTs) is highly sensitive to the optimal prune ratio, which limits their applicability. Furthermore, the original implementation did not att… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  14. arXiv:2207.04075  [pdf, other

    cs.LG

    Models Out of Line: A Fourier Lens on Distribution Shift Robustness

    Authors: Sara Fridovich-Keil, Brian R. Bartoldson, James Diffenderfer, Bhavya Kailkhura, Peer-Timo Bremer

    Abstract: Improving the accuracy of deep neural networks (DNNs) on out-of-distribution (OOD) data is critical to an acceptance of deep learning (DL) in real world applications. It has been observed that accuracies on in-distribution (ID) versus OOD data follow a linear trend and models that outperform this baseline are exceptionally rare (and referred to as "effectively robust"). Recently, some promising ap… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  15. arXiv:2206.02785  [pdf, other

    cs.LG cs.AI

    Zeroth-Order SciML: Non-intrusive Integration of Scientific Software with Deep Learning

    Authors: Ioannis Tsaknakis, Bhavya Kailkhura, Sijia Liu, Donald Loveland, James Diffenderfer, Anna Maria Hiszpanski, Mingyi Hong

    Abstract: Using deep learning (DL) to accelerate and/or improve scientific workflows can yield discoveries that are otherwise impossible. Unfortunately, DL models have yielded limited success in complex scientific domains due to large data requirements. In this work, we propose to overcome this issue by integrating the abundance of scientific knowledge sources (SKS) with the DL training process. Existing kn… ▽ More

    Submitted 4 June, 2022; originally announced June 2022.

  16. arXiv:2203.11295  [pdf, other

    cs.LG cs.AR

    Benchmarking Test-Time Unsupervised Deep Neural Network Adaptation on Edge Devices

    Authors: Kshitij Bhardwaj, James Diffenderfer, Bhavya Kailkhura, Maya Gokhale

    Abstract: The prediction accuracy of the deep neural networks (DNNs) after deployment at the edge can suffer with time due to shifts in the distribution of the new data. To improve robustness of DNNs, they must be able to update themselves to enhance their prediction accuracy. This adaptation at the resource-constrained edge is challenging as: (i) new labeled data may not be present; (ii) adaptation needs t… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: This paper was selected for poster presentation in International Symposium on Performance Analysis of Systems and Software (ISPASS), 2022

  17. arXiv:2107.07479  [pdf, ps, other

    math.OC

    NPASA: An algorithm for nonlinear programming -- Motivation and Global Convergence

    Authors: James Diffenderfer, William W. Hager

    Abstract: In this paper, we present a two phase method for solving nonlinear programming problems called Nonlinear Polyhedral Active Set Algorithm (NPASA) that has global and local convergence guarantees under reasonable assumptions. The first phase consists of an augmented Lagrangian method to ensure global convergence while the second phase is designed to promote fast local convergence by performing a bal… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

  18. arXiv:2107.07478  [pdf, ps, other

    math.OC

    NPASA: An algorithm for nonlinear programming -- Local Convergence

    Authors: James Diffenderfer, William W. Hager

    Abstract: In this paper, we provide local convergence analysis for the two phase Nonlinear Polyhedral Active Set Algorithm (NPASA) designed to solve nonlinear programs. In particular, we establish local quadratic convergence of the primal iterates and global error estimator for NPASA under reasonable assumptions. Additionally, under the same set of assumptions we prove that only phase two of NPASA is execut… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

  19. arXiv:2106.09129  [pdf, other

    cs.LG

    A Winning Hand: Compressing Deep Networks Can Improve Out-Of-Distribution Robustness

    Authors: James Diffenderfer, Brian R. Bartoldson, Shreya Chaganti, Jize Zhang, Bhavya Kailkhura

    Abstract: Successful adoption of deep learning (DL) in the wild requires models to be: (1) compact, (2) accurate, and (3) robust to distributional shifts. Unfortunately, efforts towards simultaneously meeting these requirements have mostly been unsuccessful. This raises an important question: Is the inability to create Compact, Accurate, and Robust Deep neural networks (CARDs) fundamental? To answer this qu… ▽ More

    Submitted 5 November, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

  20. arXiv:2105.00115  [pdf, other

    math.NA cs.MS

    QDOT: Quantized Dot Product Kernel for Approximate High-Performance Computing

    Authors: James Diffenderfer, Daniel Osei-Kuffuor, Harshitha Menon

    Abstract: Approximate computing techniques have been successful in reducing computation and power costs in several domains. However, error sensitive applications in high-performance computing are unable to benefit from existing approximate computing strategies that are not developed with guaranteed error bounds. While approximate computing techniques can be developed for individual high-performance computin… ▽ More

    Submitted 30 April, 2021; originally announced May 2021.

  21. arXiv:2103.09377  [pdf, other

    cs.LG cs.CV

    Multi-Prize Lottery Ticket Hypothesis: Finding Accurate Binary Neural Networks by Pruning A Randomly Weighted Network

    Authors: James Diffenderfer, Bhavya Kailkhura

    Abstract: Recently, Frankle & Carbin (2019) demonstrated that randomly-initialized dense networks contain subnetworks that once found can be trained to reach test accuracy comparable to the trained dense network. However, finding these high performing trainable subnetworks is expensive, requiring iterative process of training and pruning weights. In this paper, we propose (and prove) a stronger Multi-Prize… ▽ More

    Submitted 16 March, 2021; originally announced March 2021.

  22. Stability Analysis of Inline ZFP Compression for Floating-Point Data in Iterative Methods

    Authors: Alyson Fox, James Diffenderfer, Jeffrey Hittinger, Geoffrey Sanders, Peter Lindstrom

    Abstract: Currently, the dominating constraint in many high performance computing applications is data capacity and bandwidth, in both inter-node communications and even more-so in on-node data motion. A new approach to address this limitation is to make use of data compression in the form of a compressed data array. Storing data in a compressed data array and converting to standard IEEE-754 types as needed… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

  23. Error Analysis of ZFP Compression for Floating-Point Data

    Authors: James Diffenderfer, Alyson Fox, Jeffrey Hittinger, Geoffrey Sanders, Peter Lindstrom

    Abstract: Compression of floating-point data will play an important role in high-performance computing as data bandwidth and storage become dominant costs. Lossy compression of floating-point data is powerful, but theoretical results are needed to bound its errors when used to store look-up tables, simulation results, or even the solution state during the computation. \black{In this paper, we analyze the ro… ▽ More

    Submitted 11 February, 2019; v1 submitted 1 May, 2018; originally announced May 2018.