Skip to main content

Showing 1–24 of 24 results for author: Siegel, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.06079  [pdf, ps, other

    cs.LG cs.AI

    QS4D: Quantization-aware training for efficient hardware deployment of structured state-space sequential models

    Authors: Sebastian Siegel, Ming-Jay Yang, Younes Bouhadjar, Maxime Fabre, Emre Neftci, John Paul Strachan

    Abstract: Structured State Space models (SSM) have recently emerged as a new class of deep learning models, particularly well-suited for processing long sequences. Their constant memory footprint, in contrast to the linearly scaling memory demands of Transformers, makes them attractive candidates for deployment on resource-constrained edge-computing devices. While recent works have explored the effect of qu… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

  2. arXiv:2504.15934  [pdf, other

    cs.ET q-bio.QM

    Real-time raw signal genomic analysis using fully integrated memristor hardware

    Authors: Peiyi He, Shengbo Wang, Ruibin Mao, Sebastian Siegel, Giacomo Pedretti, Jim Ignowski, John Paul Strachan, Ruibang Luo, Can Li

    Abstract: Advances in third-generation sequencing have enabled portable and real-time genomic sequencing, but real-time data processing remains a bottleneck, hampering on-site genomic analysis due to prohibitive time and energy costs. These technologies generate a massive amount of noisy analog signals that traditionally require basecalling and digital mapping, both demanding frequent and costly data moveme… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: 16 pages, 6 figures

  3. IMSSA: Deploying modern state-space models on memristive in-memory compute hardware

    Authors: Sebastian Siegel, Ming-Jay Yang, John-Paul Strachan

    Abstract: Processing long temporal sequences is a key challenge in deep learning. In recent years, Transformers have become state-of-the-art for this task, but suffer from excessive memory requirements due to the need to explicitly store the sequences. To address this issue, structured state-space sequential (S4) models recently emerged, offering a fixed memory state while still enabling the processing of v… ▽ More

    Submitted 28 December, 2024; originally announced December 2024.

    Comments: 5 pages, 4 figures, submitted to IEEE ISCAS 2025

  4. arXiv:2410.00946  [pdf, other

    eess.IV cs.LG

    Spectral Graph Sample Weighting for Interpretable Sub-cohort Analysis in Predictive Models for Neuroimaging

    Authors: Magdalini Paschali, Yu Hang Jiang, Spencer Siegel, Camila Gonzalez, Kilian M. Pohl, Akshay Chaudhari, Qingyu Zhao

    Abstract: Recent advancements in medicine have confirmed that brain disorders often comprise multiple subtypes of mechanisms, developmental trajectories, or severity levels. Such heterogeneity is often associated with demographic aspects (e.g., sex) or disease-related contributors (e.g., genetics). Thus, the predictive power of machine learning models used for symptom prediction varies across subjects based… ▽ More

    Submitted 5 October, 2024; v1 submitted 1 October, 2024; originally announced October 2024.

  5. arXiv:2409.19315  [pdf, other

    cs.NE cs.AI cs.AR cs.ET

    Analog In-Memory Computing Attention Mechanism for Fast and Energy-Efficient Large Language Models

    Authors: Nathan Leroux, Paul-Philipp Manea, Chirag Sudarshan, Jan Finkbeiner, Sebastian Siegel, John Paul Strachan, Emre Neftci

    Abstract: Transformer networks, driven by self-attention, are central to Large Language Models. In generative Transformers, self-attention uses cache memory to store token projections, avoiding recomputation at each time step. However, GPU-stored projections must be loaded into SRAM for each new generation step, causing latency and energy bottlenecks. We present a custom self-attention in-memory computing… ▽ More

    Submitted 25 November, 2024; v1 submitted 28 September, 2024; originally announced September 2024.

    Comments: 25 pages, 6 figures, 1 table

  6. arXiv:2409.11363  [pdf, other

    cs.CL cs.AI cs.MA

    CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark

    Authors: Zachary S. Siegel, Sayash Kapoor, Nitya Nagdir, Benedikt Stroebl, Arvind Narayanan

    Abstract: AI agents have the potential to aid users on a variety of consequential tasks, including conducting scientific research. To spur the development of useful agents, we need benchmarks that are challenging, but more crucially, directly correspond to real-world tasks of interest. This paper introduces such a benchmark, designed to measure the accuracy of AI agents in tackling a crucial yet surprisingl… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: Benchmark harness and code available at http://github.com/siegelz/core-bench

  7. arXiv:2407.12883  [pdf, other

    cs.CL cs.AI cs.IR

    BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

    Authors: Hongjin Su, Howard Yen, Mengzhou Xia, Weijia Shi, Niklas Muennighoff, Han-yu Wang, Haisu Liu, Quan Shi, Zachary S. Siegel, Michael Tang, Ruoxi Sun, Jinsung Yoon, Sercan O. Arik, Danqi Chen, Tao Yu

    Abstract: Existing retrieval benchmarks primarily consist of information-seeking queries (e.g., aggregated questions from search engines) where keyword or semantic-based retrieval is usually sufficient. However, many complex real-world queries require in-depth reasoning to identify relevant documents that go beyond surface form matching. For example, finding documentation for a coding question requires unde… ▽ More

    Submitted 26 March, 2025; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: 51 pages

  8. arXiv:2407.01502  [pdf, other

    cs.LG cs.AI

    AI Agents That Matter

    Authors: Sayash Kapoor, Benedikt Stroebl, Zachary S. Siegel, Nitya Nadgir, Arvind Narayanan

    Abstract: AI agents are an exciting new research direction, and agent development is driven by benchmarks. Our analysis of current agent benchmarks and evaluation practices reveals several shortcomings that hinder their usefulness in real-world applications. First, there is a narrow focus on accuracy without attention to other metrics. As a result, SOTA agents are needlessly complex and costly, and the comm… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  9. arXiv:2404.06344  [pdf, other

    cs.NE cond-mat.mtrl-sci eess.SP

    Synaptogen: A cross-domain generative device model for large-scale neuromorphic circuit design

    Authors: Tyler Hennen, Leon Brackmann, Tobias Ziegler, Sebastian Siegel, Stephan Menzel, Rainer Waser, Dirk J. Wouters, Daniel Bedau

    Abstract: We present a fast generative modeling approach for resistive memories that reproduces the complex statistical properties of real-world devices. To enable efficient modeling of analog circuits, the model is implemented in Verilog-A. By training on extensive measurement data of integrated 1T1R arrays (6,000 cycles of 512 devices), an autoregressive stochastic process accurately accounts for the cros… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Code is available at https://zenodo.org/doi/10.5281/zenodo.10942560

  10. The Ouroboros of Memristors: Neural Networks Facilitating Memristor Programming

    Authors: Zhenming Yu, Ming-Jay Yang, Jan Finkbeiner, Sebastian Siegel, John Paul Strachan, Emre Neftci

    Abstract: Memristive devices hold promise to improve the scale and efficiency of machine learning and neuromorphic hardware, thanks to their compact size, low power consumption, and the ability to perform matrix multiplications in constant time. However, on-chip training with memristor arrays still faces challenges, including device-to-device and cycle-to-cycle variations, switching non-linearity, and espec… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: This work is accepted at the 2024 IEEE AICAS

    Journal ref: 2024 AICAS, Abu Dhabi, United Arab Emirates, 2024, pp. 398-402

  11. arXiv:2403.06322  [pdf, other

    cs.CV cs.AI

    Leveraging Computer Vision in the Intensive Care Unit (ICU) for Examining Visitation and Mobility

    Authors: Scott Siegel, Jiaqing Zhang, Sabyasachi Bandyopadhyay, Subhash Nerella, Brandon Silva, Tezcan Baslanti, Azra Bihorac, Parisa Rashidi

    Abstract: Despite the importance of closely monitoring patients in the Intensive Care Unit (ICU), many aspects are still assessed in a limited manner due to the time constraints imposed on healthcare providers. For example, although excessive visitations during rest hours can potentially exacerbate the risk of circadian rhythm disruption and delirium, it is not captured in the ICU. Likewise, while mobility… ▽ More

    Submitted 12 July, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  12. arXiv:2312.15640  [pdf, other

    cs.DC cs.SE

    Report of the DOE/NSF Workshop on Correctness in Scientific Computing, June 2023, Orlando, FL

    Authors: Maya Gokhale, Ganesh Gopalakrishnan, Jackson Mayo, Santosh Nagarakatte, Cindy Rubio-González, Stephen F. Siegel

    Abstract: This report is a digest of the DOE/NSF Workshop on Correctness in Scientific Computing (CSC'23) held on June 17, 2023, as part of the Federated Computing Research Conference (FCRC) 2023. CSC was conceived by DOE and NSF to address the growing concerns about correctness among those who employ computational methods to perform large-scale scientific simulations. These concerns have escalated, given t… ▽ More

    Submitted 27 December, 2023; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: 36 pages. DOE/NSF Workshop on Correctness in Scientific Computing (CSC 2023) was a PLDI 2023 workshop

    ACM Class: B.8.1; C.1.4; D.0.3; D.0.4; D.1.3; D.2.1; D.2.5; D.3.1; G.1.2; J.2

  13. arXiv:2312.08566  [pdf, other

    cs.AI cs.CL cs.RO

    Learning adaptive planning representations with natural language guidance

    Authors: Lionel Wong, Jiayuan Mao, Pratyusha Sharma, Zachary S. Siegel, Jiahai Feng, Noa Korneev, Joshua B. Tenenbaum, Jacob Andreas

    Abstract: Effective planning in the real world requires not only world knowledge, but the ability to leverage that knowledge to build the right representation of the task at hand. Decades of hierarchical planning techniques have used domain-specific temporal action abstractions to support efficient and accurate planning, almost always relying on human priors and domain knowledge to decompose hard tasks into… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  14. arXiv:2308.08473  [pdf, other

    cs.SE

    DataRaceBench V1.4.1 and DataRaceBench-ML V0.1: Benchmark Suites for Data Race Detection

    Authors: Le Chen, Wenhao Wu, Stephen F. Siegel, Pei-Hung Lin, Chunhua Liao

    Abstract: Data races pose a significant threat in multi-threaded parallel applications due to their negative impact on program correctness. DataRaceBench, an open-source benchmark suite, is specifically crafted to assess these data race detection tools in a systematic and measurable manner. Machine learning techniques have recently demonstrated considerable potential in high-performance computing (HPC) prog… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  15. arXiv:2308.02400  [pdf, other

    cs.AR cs.CY

    Work-in-Progress: A Universal Instrumentation Platform for Non-Volatile Memories

    Authors: Felix Staudigl, Mohammed Hossein, Tobias Ziegler, Hazem Al Indari, Rebecca Pelke, Sebastian Siegel, Dirk J. Wouters, Dominik Sisejkovic, Jan Moritz Joseph, Rainer Leupers

    Abstract: Emerging non-volatile memories (NVMs) represent a disruptive technology that allows a paradigm shift from the conventional von Neumann architecture towards more efficient computing-in-memory (CIM) architectures. Several instrumentation platforms have been proposed to interface NVMs allowing the characterization of single cells and crossbar structures. However, these platforms suffer from low flexi… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  16. Transformers in Healthcare: A Survey

    Authors: Subhash Nerella, Sabyasachi Bandyopadhyay, Jiaqing Zhang, Miguel Contreras, Scott Siegel, Aysegul Bumin, Brandon Silva, Jessica Sena, Benjamin Shickel, Azra Bihorac, Kia Khezeli, Parisa Rashidi

    Abstract: With Artificial Intelligence (AI) increasingly permeating various aspects of society, including healthcare, the adoption of the Transformers neural network architecture is rapidly changing many applications. Transformer is a type of deep learning architecture initially developed to solve general-purpose Natural Language Processing (NLP) tasks and has subsequently been adapted in many fields, inclu… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

    Report number: 102900

    Journal ref: Transformers and large language models in healthcare: A review, Artificial Intelligence in Medicine, Volume 154, 2024, 102900,

  17. arXiv:2305.18198  [pdf, ps, other

    cs.PL cs.DC

    Model Checking Race-freedom When "Sequential Consistency for Data-race-free Programs" is Guaranteed

    Authors: Wenhao Wu, Jan Hückelheim, Paul D. Hovland, Ziqing Luo, Stephen F. Siegel

    Abstract: Many parallel programming models guarantee that if all sequentially consistent (SC) executions of a program are free of data races, then all executions of the program will appear to be sequentially consistent. This greatly simplifies reasoning about the program, but leaves open the question of how to verify that all SC executions are race-free. In this paper, we show that with a few simple modific… ▽ More

    Submitted 20 July, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  18. arXiv:2303.06252  [pdf

    cs.AI

    AI-Enhanced Intensive Care Unit: Revolutionizing Patient Care with Pervasive Sensing

    Authors: Subhash Nerella, Ziyuan Guan, Scott Siegel, Jiaqing Zhang, Ruilin Zhu, Kia Khezeli, Azra Bihorac, Parisa Rashidi

    Abstract: The intensive care unit (ICU) is a specialized hospital space where critically ill patients receive intensive care and monitoring. Comprehensive monitoring is imperative in assessing patients conditions, in particular acuity, and ultimately the quality of care. However, the extent of patient monitoring in the ICU is limited due to time constraints and the workload on healthcare providers. Currentl… ▽ More

    Submitted 21 November, 2024; v1 submitted 10 March, 2023; originally announced March 2023.

  19. arXiv:2211.16592  [pdf, other

    cs.NE cs.ET

    Sequence learning in a spiking neuronal network with memristive synapses

    Authors: Younes Bouhadjar, Sebastian Siegel, Tom Tetzlaff, Markus Diesmann, Rainer Waser, Dirk J. Wouters

    Abstract: Brain-inspired computing proposes a set of algorithmic principles that hold promise for advancing artificial intelligence. They endow systems with self learning capabilities, efficient energy usage, and high storage capacity. A core concept that lies at the heart of brain computation is sequence learning and prediction. This form of computation is essential for almost all our daily tasks such as m… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: 23 pages, 13 Figures

  20. Hierarchical Proxy Modeling for Improved HPO in Time Series Forecasting

    Authors: Arindam Jati, Vijay Ekambaram, Shaonli Pal, Brian Quanz, Wesley M. Gifford, Pavithra Harsha, Stuart Siegel, Sumanta Mukherjee, Chandra Narayanaswami

    Abstract: Selecting the right set of hyperparameters is crucial in time series forecasting. The classical temporal cross-validation framework for hyperparameter optimization (HPO) often leads to poor test performance because of a possible mismatch between validation and test periods. To address this test-validation mismatch, we propose a novel technique, H-Pro to drive HPO via test proxies by exploiting dat… ▽ More

    Submitted 2 November, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

  21. arXiv:2206.05794  [pdf, other

    cs.LG stat.ML

    SGD and Weight Decay Secretly Minimize the Rank of Your Neural Network

    Authors: Tomer Galanti, Zachary S. Siegel, Aparna Gupte, Tomaso Poggio

    Abstract: We investigate the inherent bias of Stochastic Gradient Descent (SGD) toward learning low-rank weight matrices during the training of deep neural networks. Our results demonstrate that training with mini-batch SGD and weight decay induces a bias toward rank minimization in the weight matrices. Specifically, we show both theoretically and empirically that this bias becomes more pronounced with smal… ▽ More

    Submitted 18 October, 2024; v1 submitted 12 June, 2022; originally announced June 2022.

  22. arXiv:1909.07502  [pdf

    cs.HC cs.CL cs.LG stat.ML

    Automatic Detection and Classification of Cognitive Distortions in Mental Health Text

    Authors: Benjamin Shickel, Scott Siegel, Martin Heesacker, Sherry Benton, Parisa Rashidi

    Abstract: In cognitive psychology, automatic and self-reinforcing irrational thought patterns are known as cognitive distortions. Left unchecked, patients exhibiting these types of thoughts can become stuck in negative feedback loops of unhealthy thinking, leading to inaccurate perceptions of reality commonly associated with anxiety and depression. In this paper, we present a machine learning framework for… ▽ More

    Submitted 22 September, 2019; v1 submitted 16 September, 2019; originally announced September 2019.

  23. arXiv:1804.10201  [pdf, other

    cs.HC cs.AI cs.CV eess.SP

    The Intelligent ICU Pilot Study: Using Artificial Intelligence Technology for Autonomous Patient Monitoring

    Authors: Anis Davoudi, Kumar Rohit Malhotra, Benjamin Shickel, Scott Siegel, Seth Williams, Matthew Ruppert, Emel Bihorac, Tezcan Ozrazgat-Baslanti, Patrick J. Tighe, Azra Bihorac, Parisa Rashidi

    Abstract: Currently, many critical care indices are repetitively assessed and recorded by overburdened nurses, e.g. physical function or facial pain expressions of nonverbal patients. In addition, many essential information on patients and their environment are not captured at all, or are captured in a non-granular manner, e.g. sleep disturbance factors such as bright light, loud background noise, or excess… ▽ More

    Submitted 26 September, 2018; v1 submitted 25 April, 2018; originally announced April 2018.

  24. arXiv:1705.07478  [pdf, other

    cs.DC

    Report of the HPC Correctness Summit, Jan 25--26, 2017, Washington, DC

    Authors: Ganesh Gopalakrishnan, Paul D. Hovland, Costin Iancu, Sriram Krishnamoorthy, Ignacio Laguna, Richard A. Lethin, Koushik Sen, Stephen F. Siegel, Armando Solar-Lezama

    Abstract: Maintaining leadership in HPC requires the ability to support simulations at large scales and fidelity. In this study, we detail one of the most significant productivity challenges in achieving this goal, namely the increasing proclivity to bugs, especially in the face of growing hardware and software heterogeneity and sheer system scale. We identify key areas where timely new research must be pro… ▽ More

    Submitted 21 May, 2017; originally announced May 2017.

    Comments: 57 pages