Skip to main content

Showing 1–50 of 52 results for author: Rosing, T

Searching in archive cs. Search in all archives.
.
  1. Hybrid SLC-MLC RRAM Mixed-Signal Processing-in-Memory Architecture for Transformer Acceleration via Gradient Redistribution

    Authors: Chang Eun Song, Priyansh Bhatnagar, Zihan Xia, Nam Sung Kim, Tajana Rosing, Mingu Kang

    Abstract: Transformers, while revolutionary, face challenges due to their demanding computational cost and large data movement. To address this, we propose HyFlexPIM, a novel mixed-signal processing-in-memory (PIM) accelerator for inference that flexibly utilizes both single-level cell (SLC) and multi-level cell (MLC) RRAM technologies to trade-off accuracy and efficiency. HyFlexPIM achieves efficient dual-… ▽ More

    Submitted 20 May, 2025; originally announced June 2025.

    Comments: Accepted by ISCA'25

  2. arXiv:2505.15146  [pdf, ps, other

    cs.AI

    lmgame-Bench: How Good are LLMs at Playing Games?

    Authors: Lanxiang Hu, Mingjia Huo, Yuxuan Zhang, Haoyang Yu, Eric P. Xing, Ion Stoica, Tajana Rosing, Haojian Jin, Hao Zhang

    Abstract: Playing video games requires perception, memory, and planning, exactly the faculties modern large language model (LLM) agents are expected to master. We study the major challenges in using popular video games to evaluate modern LLMs and find that directly dropping LLMs into games cannot make an effective evaluation, for three reasons -- brittle vision perception, prompt sensitivity, and potential… ▽ More

    Submitted 3 June, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

  3. arXiv:2505.05413  [pdf, other

    cs.LG

    DPQ-HD: Post-Training Compression for Ultra-Low Power Hyperdimensional Computing

    Authors: Nilesh Prasad Pandey, Shriniwas Kulkarni, David Wang, Onat Gungor, Flavio Ponzina, Tajana Rosing

    Abstract: Hyperdimensional Computing (HDC) is emerging as a promising approach for edge AI, offering a balance between accuracy and efficiency. However, current HDC-based applications often rely on high-precision models and/or encoding matrices to achieve competitive performance, which imposes significant computational and memory demands, especially for ultra-low power devices. While recent efforts use tech… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  4. arXiv:2504.13301  [pdf, other

    cs.CR

    DYNAMITE: Dynamic Defense Selection for Enhancing Machine Learning-based Intrusion Detection Against Adversarial Attacks

    Authors: Jing Chen, Onat Gungor, Zhengli Shang, Elvin Li, Tajana Rosing

    Abstract: The rapid proliferation of the Internet of Things (IoT) has introduced substantial security vulnerabilities, highlighting the need for robust Intrusion Detection Systems (IDS). Machine learning-based intrusion detection systems (ML-IDS) have significantly improved threat detection capabilities; however, they remain highly susceptible to adversarial attacks. While numerous defense mechanisms have b… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: Accepted by the IEEE/ACM Workshop on the Internet of Safe Things (SafeThings 2025)

  5. arXiv:2504.01921  [pdf, other

    cs.LG stat.ML

    Client Selection in Federated Learning with Data Heterogeneity and Network Latencies

    Authors: Harsh Vardhan, Xiaofan Yu, Tajana Rosing, Arya Mazumdar

    Abstract: Federated learning (FL) is a distributed machine learning paradigm where multiple clients conduct local training based on their private data, then the updated models are sent to a central server for global aggregation. The practical convergence of FL is challenged by multiple factors, with the primary hurdle being the heterogeneity among clients. This heterogeneity manifests as data heterogeneity… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  6. arXiv:2503.07882  [pdf, other

    cs.LG cs.CR

    ReLATE: Resilient Learner Selection for Multivariate Time-Series Classification Against Adversarial Attacks

    Authors: Cagla Ipek Kocal, Onat Gungor, Aaron Tartz, Tajana Rosing, Baris Aksanli

    Abstract: Minimizing computational overhead in time-series classification, particularly in deep learning models, presents a significant challenge. This challenge is further compounded by adversarial attacks, emphasizing the need for resilient methods that ensure robust performance and efficient model selection. We introduce ReLATE, a framework that identifies robust learners based on dataset similarity, red… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: Accepted by the AAAI-25 Workshop on Artificial Intelligence for Time Series Analysis (AI4TS)

  7. arXiv:2502.15901  [pdf, other

    cs.LG

    TS-OOD: Evaluating Time-Series Out-of-Distribution Detection and Prospective Directions for Progress

    Authors: Onat Gungor, Amanda Sofie Rios, Nilesh Ahuja, Tajana Rosing

    Abstract: Detecting out-of-distribution (OOD) data is a fundamental challenge in the deployment of machine learning models. From a security standpoint, this is particularly important because OOD test data can result in misleadingly confident yet erroneous predictions, which undermine the reliability of the deployed model. Although numerous models for OOD detection have been developed in computer vision and… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: Accepted for an oral presentation at AAAI-25 AI4TS

  8. arXiv:2502.15285  [pdf, other

    cs.SD cs.AI cs.DC cs.NI eess.AS

    Offload Rethinking by Cloud Assistance for Efficient Environmental Sound Recognition on LPWANs

    Authors: Le Zhang, Quanling Zhao, Run Wang, Shirley Bian, Onat Gungor, Flavio Ponzina, Tajana Rosing

    Abstract: Learning-based environmental sound recognition has emerged as a crucial method for ultra-low-power environmental monitoring in biological research and city-scale sensing systems. These systems usually operate under limited resources and are often powered by harvested energy in remote areas. Recent efforts in on-device sound recognition suffer from low accuracy due to resource constraints, whereas… ▽ More

    Submitted 21 March, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

    Comments: Accepted by The 23rd ACM Conference on Embedded Networked Sensor Systems (SenSys '25)

  9. arXiv:2502.14094  [pdf, other

    cs.CR cs.LG

    CND-IDS: Continual Novelty Detection for Intrusion Detection Systems

    Authors: Sean Fuhrman, Onat Gungor, Tajana Rosing

    Abstract: Intrusion detection systems (IDS) play a crucial role in IoT and network security by monitoring system data and alerting to suspicious activities. Machine learning (ML) has emerged as a promising solution for IDS, offering highly accurate intrusion detection. However, ML-IDS solutions often overlook two critical aspects needed to build reliable systems: continually changing data streams and a lack… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: Accepted by the 62nd Design Automation Conference (DAC 2025)

  10. arXiv:2502.07119  [pdf, other

    cs.CR cs.LG

    SAFE: Self-Supervised Anomaly Detection Framework for Intrusion Detection

    Authors: Elvin Li, Zhengli Shang, Onat Gungor, Tajana Rosing

    Abstract: The proliferation of IoT devices has significantly increased network vulnerabilities, creating an urgent need for effective Intrusion Detection Systems (IDS). Machine Learning-based IDS (ML-IDS) offer advanced detection capabilities but rely on labeled attack data, which limits their ability to identify unknown threats. Self-Supervised Learning (SSL) presents a promising solution by using only nor… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: Accepted by the AAAI-25 Workshop on Artificial Intelligence for Cyber Security (AICS)

  11. arXiv:2502.02883  [pdf, other

    cs.AI cs.HC

    SensorChat: Answering Qualitative and Quantitative Questions during Long-Term Multimodal Sensor Interactions

    Authors: Xiaofan Yu, Lanxiang Hu, Benjamin Reichman, Dylan Chu, Rushil Chandrupatla, Xiyuan Zhang, Larry Heck, Tajana Rosing

    Abstract: Natural language interaction with sensing systems is crucial for addressing users' personal concerns and providing health-related insights into their daily lives. When a user asks a question, the system automatically analyzes the full history of sensor data, extracts relevant information, and generates an appropriate response. However, existing systems are limited to short-duration (e.g., one minu… ▽ More

    Submitted 15 May, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: Under review

  12. arXiv:2501.04974  [pdf, other

    cs.CL cs.AI

    SensorQA: A Question Answering Benchmark for Daily-Life Monitoring

    Authors: Benjamin Reichman, Xiaofan Yu, Lanxiang Hu, Jack Truxal, Atishay Jain, Rushil Chandrupatla, Tajana Šimunić Rosing, Larry Heck

    Abstract: With the rapid growth in sensor data, effectively interpreting and interfacing with these data in a human-understandable way has become crucial. While existing research primarily focuses on learning classification models, fewer studies have explored how end users can actively extract useful insights from sensor data, often hindered by the lack of a proper dataset. To address this gap, we introduce… ▽ More

    Submitted 3 March, 2025; v1 submitted 9 January, 2025; originally announced January 2025.

  13. arXiv:2412.20993  [pdf, other

    cs.LG cs.CL

    Efficiently Scaling LLM Reasoning with Certaindex

    Authors: Yichao Fu, Junda Chen, Siqi Zhu, Zheyu Fu, Zhongdongming Dai, Yonghao Zhuang, Yian Ma, Aurick Qiao, Tajana Rosing, Ion Stoica, Hao Zhang

    Abstract: Test-time reasoning algorithms such as chain-of-thought, self-consistency, and MCTS enhance LLM problem-solving but can wastefully generate many tokens without improving accuracy. At the same time, we observe that these algorithms exhibit answer stabilization: their intermediate solutions often cease to change after a certain point, and further investment of compute does not change their final ans… ▽ More

    Submitted 27 May, 2025; v1 submitted 30 December, 2024; originally announced December 2024.

  14. arXiv:2412.11242  [pdf, other

    cs.LG cs.AI cs.CL

    TrimLLM: Progressive Layer Dropping for Domain-Specific LLMs

    Authors: Lanxiang Hu, Tajana Rosing, Hao Zhang

    Abstract: Specializing large language models (LLMs) for local deployment in domain-specific use cases is necessary for strong performance while meeting latency and privacy constraints. However, conventional task-specific adaptation approaches do not show simultaneous memory saving and inference speedup at deployment time. Practical compression techniques like quantization and pruning require dedicated hardw… ▽ More

    Submitted 19 December, 2024; v1 submitted 15 December, 2024; originally announced December 2024.

  15. arXiv:2411.09760  [pdf, other

    cs.AR cs.ET eess.SP

    SpecPCM: A Low-power PCM-based In-Memory Computing Accelerator for Full-stack Mass Spectrometry Analysis

    Authors: Keming Fan, Ashkan Moradifirouzabadi, Xiangjin Wu, Zheyu Li, Flavio Ponzina, Anton Persson, Eric Pop, Tajana Rosing, Mingu Kang

    Abstract: Mass spectrometry (MS) is essential for proteomics and metabolomics but faces impending challenges in efficiently processing the vast volumes of data. This paper introduces SpecPCM, an in-memory computing (IMC) accelerator designed to achieve substantial improvements in energy and delay efficiency for both MS spectral clustering and database (DB) search. SpecPCM employs analog processing with low-… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

  16. arXiv:2411.02814  [pdf, other

    cs.PF cs.AR cs.DC cs.OS

    The Hitchhiker's Guide to Programming and Optimizing CXL-Based Heterogeneous Systems

    Authors: Zixuan Wang, Suyash Mahar, Luyi Li, Jangseon Park, Jinpyo Kim, Theodore Michailidis, Yue Pan, Tajana Rosing, Dean Tullsen, Steven Swanson, Kyung Chang Ryoo, Sungjoo Park, Jishen Zhao

    Abstract: We present a thorough analysis of the use of CXL-based heterogeneous systems. We built a cluster of server systems that combines different vendor's CPUs and various types of CXL devices. We further developed a heterogeneous memory benchmark suite, Heimdall, to profile the performance of such heterogeneous systems. By leveraging Heimdall, we unveiled the detailed architecture design in these system… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

  17. arXiv:2410.15179  [pdf, other

    cs.PL

    HPVM-HDC: A Heterogeneous Programming System for Accelerating Hyperdimensional Computing

    Authors: Russel Arbore, Xavier Routh, Abdul Rafae Noor, Akash Kothari, Haichao Yang, Weihong Xu, Sumukh Pinge, Vikram Adve, Tajana Rosing, Minxuan Zhou

    Abstract: Hyperdimensional Computing (HDC), a technique inspired by cognitive models of computation, has been proposed as an efficient and robust alternative basis for machine learning. HDC programs are often manually written in low-level and target specific languages targeting CPUs, GPUs, and FPGAs - these codes cannot be easily retargeted onto HDC-specific accelerators. No previous programming system enab… ▽ More

    Submitted 1 December, 2024; v1 submitted 19 October, 2024; originally announced October 2024.

  18. arXiv:2409.13361  [pdf, other

    cs.DC cs.AR

    RapidOMS: FPGA-based Open Modification Spectral Library Searching with HD Computing

    Authors: Sumukh Pinge, Weihong Xu, Wout Bittremieux, Niema Moshiri, Sang-Woo Jun, Tajana Rosing

    Abstract: Mass spectrometry (MS) is essential for protein analysis but faces significant challenges with large datasets and complex post-translational modifications, resulting in difficulties in spectral identification. Open Modification Search (OMS) improves the analysis of these modifications. We present RapidOMS, a solution leveraging the Samsung SmartSSD, which integrates SSD and FPGA in a near-storage… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  19. arXiv:2409.10918  [pdf, other

    cs.AR cs.LG

    FSL-HDnn: A 5.7 TOPS/W End-to-end Few-shot Learning Classifier Accelerator with Feature Extraction and Hyperdimensional Computing

    Authors: Haichao Yang, Chang Eun Song, Weihong Xu, Behnam Khaleghi, Uday Mallappa, Monil Shah, Keming Fan, Mingu Kang, Tajana Rosing

    Abstract: This paper introduces FSL-HDnn, an energy-efficient accelerator that implements the end-to-end pipeline of feature extraction, classification, and on-chip few-shot learning (FSL) through gradient-free learning techniques in a 40 nm CMOS process. At its core, FSL-HDnn integrates two low-power modules: Weight clustering feature extractor and Hyperdimensional Computing (HDC). Feature extractor utiliz… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: 4 pages, 12 figures, ESSERC 2024

  20. arXiv:2409.08369  [pdf, other

    cs.DC cs.AI cs.ET cs.PF

    E-QUARTIC: Energy Efficient Edge Ensemble of Convolutional Neural Networks for Resource-Optimized Learning

    Authors: Le Zhang, Onat Gungor, Flavio Ponzina, Tajana Rosing

    Abstract: Ensemble learning is a meta-learning approach that combines the predictions of multiple learners, demonstrating improved accuracy and robustness. Nevertheless, ensembling models like Convolutional Neural Networks (CNNs) result in high memory and computing overhead, preventing their deployment in embedded systems. These devices are usually equipped with small batteries that provide power supply and… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: Accepted by the 30th Asia and South Pacific Design Automation Conference (ASP-DAC 2025)

  21. arXiv:2407.00604  [pdf, other

    cs.AR

    Fast-OverlaPIM: A Fast Overlap-driven Mapping Framework for Processing In-Memory Neural Network Acceleration

    Authors: Xuan Wang, Minxuan Zhou, Tajana Rosing

    Abstract: Processing in-memory (PIM) is promising to accelerate neural networks (NNs) because it minimizes data movement and provides large computational parallelism. Similar to machine learning accelerators, application mapping, which determines the operation scheduling and data layout, plays a critical role in the NN acceleration on PIM. The mapping optimization of previous NN accelerators focused on opti… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: This work is accepted by IEEE TCAD

  22. arXiv:2405.02756  [pdf, other

    cs.AR

    Efficient Open Modification Spectral Library Searching in High-Dimensional Space with Multi-Level-Cell Memory

    Authors: Keming Fan, Wei-Chen Chen, Sumukh Pinge, H. -S. Philip Wong, Tajana Rosing

    Abstract: Open Modification Search (OMS) is a promising algorithm for mass spectrometry analysis that enables the discovery of modified peptides. However, OMS encounters challenges as it exponentially extends the search scope. Existing OMS accelerators either have limited parallelism or struggle to scale effectively with growing data volumes. In this work, we introduce an OMS accelerator utilizing multi-lev… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: Accepted by DAC'24

  23. arXiv:2404.00039  [pdf, other

    cs.PF cs.AI cs.LG cs.NE math.OC

    MicroHD: An Accuracy-Driven Optimization of Hyperdimensional Computing Algorithms for TinyML systems

    Authors: Flavio Ponzina, Tajana Rosing

    Abstract: Hyperdimensional computing (HDC) is emerging as a promising AI approach that can effectively target TinyML applications thanks to its lightweight computing and memory requirements. Previous works on HDC showed that limiting the standard 10k dimensions of the hyperdimensional space to much lower values is possible, reducing even more HDC resource requirements. Similarly, other studies demonstrated… ▽ More

    Submitted 23 March, 2024; originally announced April 2024.

    Comments: Accepted as a full paper by the tinyML Research Symposium 2024

  24. arXiv:2403.04759  [pdf, other

    cs.LG cs.NE

    Lifelong Intelligence Beyond the Edge using Hyperdimensional Computing

    Authors: Xiaofan Yu, Anthony Thomas, Ivannia Gomez Moreno, Louis Gutierrez, Tajana Rosing

    Abstract: On-device learning has emerged as a prevailing trend that avoids the slow response time and costly communication of cloud-based learning. The ability to learn continuously and indefinitely in a changing environment, and with resource constraints, is critical for real sensor deployments. However, existing designs are inadequate for practical scenarios with (i) streaming data input, (ii) lack of sup… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted by IPSN'24

  25. arXiv:2312.15966  [pdf, other

    cs.LG cs.DC

    Federated Hyperdimensional Computing

    Authors: Kazim Ergun, Rishikanth Chandrasekaran, Tajana Rosing

    Abstract: Federated learning (FL) enables a loose set of participating clients to collaboratively learn a global model via coordination by a central server and with no need for data sharing. Existing FL approaches that rely on complex algorithms with massive models, such as deep neural networks (DNNs), suffer from computation and communication bottlenecks. In this paper, we first propose FedHDC, a federated… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: Submitted for publication, 20 pages

  26. arXiv:2312.04257  [pdf, other

    cs.AR

    Proxima: Near-storage Acceleration for Graph-based Approximate Nearest Neighbor Search in 3D NAND

    Authors: Weihong Xu, Junwei Chen, Po-Kai Hsu, Jaeyoung Kang, Minxuan Zhou, Sumukh Pinge, Shimeng Yu, Tajana Rosing

    Abstract: Approximate nearest neighbor search (ANNS) plays an indispensable role in a wide variety of applications, including recommendation systems, information retrieval, and semantic search. Among the cutting-edge ANNS algorithms, graph-based approaches provide superior accuracy and scalability on massive datasets. However, the best-performing graph-based ANN search solutions incur tens of hundreds of me… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  27. arXiv:2311.16293  [pdf, other

    cs.AR cs.CR

    FHEmem: A Processing In-Memory Accelerator for Fully Homomorphic Encryption

    Authors: Minxuan Zhou, Yujin Nam, Pranav Gangwar, Weihong Xu, Arpan Dutta, Kartikeyan Subramanyam, Chris Wilkerson, Rosario Cammarota, Saransh Gupta, Tajana Rosing

    Abstract: Fully Homomorphic Encryption (FHE) is a technique that allows arbitrary computations to be performed on encrypted data without the need for decryption, making it ideal for securing many emerging applications. However, FHE computation is significantly slower than computation on plain data due to the increase in data size after encryption. Processing In-Memory (PIM) is a promising technology that ca… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  28. arXiv:2311.12874  [pdf, other

    q-bio.QM cs.AR cs.DC cs.LG

    SpecHD: Hyperdimensional Computing Framework for FPGA-based Mass Spectrometry Clustering

    Authors: Sumukh Pinge, Weihong Xu, Jaeyoung Kang, Tianqi Zhang, Neima Moshiri, Wout Bittremieux, Tajana Rosing

    Abstract: Mass spectrometry-based proteomics is a key enabler for personalized healthcare, providing a deep dive into the complex protein compositions of biological systems. This technology has vast applications in biotechnology and biomedicine but faces significant computational bottlenecks. Current methodologies often require multiple hours or even days to process extensive datasets, particularly in the d… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  29. arXiv:2305.07205  [pdf, other

    cs.IR cs.AI cs.LG

    Mem-Rec: Memory Efficient Recommendation System using Alternative Representation

    Authors: Gopi Krishna Jha, Anthony Thomas, Nilesh Jain, Sameh Gobriel, Tajana Rosing, Ravi Iyer

    Abstract: Deep learning-based recommendation systems (e.g., DLRMs) are widely used AI models to provide high-quality personalized recommendations. Training data used for modern recommendation systems commonly includes categorical features taking on tens-of-millions of possible distinct values. These categorical tokens are typically assigned learned vector representations, that are stored in large embedding… ▽ More

    Submitted 14 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  30. arXiv:2303.15604  [pdf, other

    q-bio.BM cs.LG

    HD-Bind: Encoding of Molecular Structure with Low Precision, Hyperdimensional Binary Representations

    Authors: Derek Jones, Jonathan E. Allen, Xiaohua Zhang, Behnam Khaleghi, Jaeyoung Kang, Weihong Xu, Niema Moshiri, Tajana S. Rosing

    Abstract: Publicly available collections of drug-like molecules have grown to comprise 10s of billions of possibilities in recent history due to advances in chemical synthesis. Traditional methods for identifying ``hit'' molecules from a large collection of potential drug-like candidates have relied on biophysical theory to compute approximations to the Gibbs free energy of the binding interaction between t… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  31. arXiv:2301.09740  [pdf, other

    cs.CR cs.LG

    DODEM: DOuble DEfense Mechanism Against Adversarial Attacks Towards Secure Industrial Internet of Things Analytics

    Authors: Onat Gungor, Tajana Rosing, Baris Aksanli

    Abstract: Industrial Internet of Things (I-IoT) is a collaboration of devices, sensors, and networking equipment to monitor and collect data from industrial operations. Machine learning (ML) methods use this data to make high-level decisions with minimal human intervention. Data-driven predictive maintenance (PDM) is a crucial ML-based I-IoT application to find an optimal maintenance schedule for industrial… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  32. arXiv:2301.06646  [pdf, other

    cs.LG cs.DC cs.NI

    Async-HFL: Efficient and Robust Asynchronous Federated Learning in Hierarchical IoT Networks

    Authors: Xiaofan Yu, Ludmila Cherkasova, Harsh Vardhan, Quanling Zhao, Emily Ekaireb, Xiyuan Zhang, Arya Mazumdar, Tajana Rosing

    Abstract: Federated Learning (FL) has gained increasing interest in recent years as a distributed on-device learning paradigm. However, multiple challenges remain to be addressed for deploying FL in real-world Internet-of-Things (IoT) networks with hierarchies. Although existing works have proposed various approaches to account data heterogeneity, system heterogeneity, unexpected stragglers and scalibility,… ▽ More

    Submitted 10 April, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

    Comments: Accepted by IoTDI'23

  33. arXiv:2211.16422  [pdf, other

    cs.DC

    Massively Parallel Open Modification Spectral Library Searching with Hyperdimensional Computing

    Authors: Jaeyoung Kang, Weihong Xu, Wout Bittremieux, Tajana Rosing

    Abstract: Mass spectrometry, commonly used for protein identification, generates a massive number of spectra that need to be matched against a large database. In reality, most of them remain unidentified or mismatched due to unexpected post-translational modifications. Open modification search (OMS) has been proposed as a strategy to improve the identification rate by considering every possible change in sp… ▽ More

    Submitted 31 December, 2022; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: 6 pages, 7 figures, extension of PACT 2022 paper

  34. RAPIDx: High-performance ReRAM Processing in-Memory Accelerator for Sequence Alignment

    Authors: Weihong Xu, Saransh Gupta, Niema Moshiri, Tajana Rosing

    Abstract: Genome sequence alignment is the core of many biological applications. The advancement of sequencing technologies produces a tremendous amount of data, making sequence alignment a critical bottleneck in bioinformatics analysis. The existing hardware accelerators for alignment suffer from limited on-chip memory, costly data movement, and poorly optimized alignment algorithms. They cannot afford to… ▽ More

    Submitted 24 January, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

  35. arXiv:2209.09868  [pdf, other

    cs.LG cs.NE

    Streaming Encoding Algorithms for Scalable Hyperdimensional Computing

    Authors: Anthony Thomas, Behnam Khaleghi, Gopi Krishna Jha, Sanjoy Dasgupta, Nageen Himayat, Ravi Iyer, Nilesh Jain, Tajana Rosing

    Abstract: Hyperdimensional computing (HDC) is a paradigm for data representation and learning originating in computational neuroscience. HDC represents data as high-dimensional, low-precision vectors which can be used for a variety of information processing tasks like learning or recall. The mapping to high-dimensional space is a fundamental problem in HDC, and existing methods encounter scalability issues… ▽ More

    Submitted 8 February, 2023; v1 submitted 20 September, 2022; originally announced September 2022.

  36. arXiv:2208.11266  [pdf, other

    cs.LG cs.AI

    SCALE: Online Self-Supervised Lifelong Learning without Prior Knowledge

    Authors: Xiaofan Yu, Yunhui Guo, Sicun Gao, Tajana Rosing

    Abstract: Unsupervised lifelong learning refers to the ability to learn over time while memorizing previous patterns without supervision. Although great progress has been made in this direction, existing work often assumes strong prior knowledge about the incoming data (e.g., knowing the class boundaries), which can be impossible to obtain in complex and unpredictable environments. In this paper, motivated… ▽ More

    Submitted 10 April, 2023; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Accepted by CLVision'23

  37. arXiv:2204.12557  [pdf

    cs.CR cs.AR

    MemFHE: End-to-End Computing with Fully Homomorphic Encryption in Memory

    Authors: Saransh Gupta, Rosario Cammarota, Tajana Rosing

    Abstract: The increasing amount of data and the growing complexity of problems has resulted in an ever-growing reliance on cloud computing. However, many applications, most notably in healthcare, finance or defense, demand security and privacy which today's solutions cannot fully address. Fully homomorphic encryption (FHE) elevates the bar of today's solutions by adding confidentiality of data during proces… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

  38. arXiv:2203.08148  [pdf, other

    cs.CR cs.LG

    RES-HD: Resilient Intelligent Fault Diagnosis Against Adversarial Attacks Using Hyper-Dimensional Computing

    Authors: Onat Gungor, Tajana Rosing, Baris Aksanli

    Abstract: Industrial Internet of Things (I-IoT) enables fully automated production systems by continuously monitoring devices and analyzing collected data. Machine learning methods are commonly utilized for data analytics in such systems. Cyber-attacks are a grave threat to I-IoT as they can manipulate legitimate inputs, corrupting ML predictions and causing disruptions in the production systems. Hyper-dime… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

  39. A Theoretical Perspective on Hyperdimensional Computing

    Authors: Anthony Thomas, Sanjoy Dasgupta, Tajana Rosing

    Abstract: Hyperdimensional (HD) computing is a set of neurally inspired methods for obtaining high-dimensional, low-precision, distributed representations of data. These representations can be combined with simple, neurally plausible algorithms to effect a variety of information processing tasks. HD computing has recently garnered significant interest from the computer hardware community as an energy-effici… ▽ More

    Submitted 17 February, 2022; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: Updates with published version

    Journal ref: Journal of Artificial Intelligence Research 72 (2021): 215-249

  40. arXiv:2008.04449  [pdf, ps, other

    cs.CR cs.AI cs.AR cs.CY cs.LG

    Trustworthy AI Inference Systems: An Industry Research View

    Authors: Rosario Cammarota, Matthias Schunter, Anand Rajan, Fabian Boemer, Ágnes Kiss, Amos Treiber, Christian Weinert, Thomas Schneider, Emmanuel Stapf, Ahmad-Reza Sadeghi, Daniel Demmler, Joshua Stock, Huili Chen, Siam Umar Hussain, Sadegh Riazi, Farinaz Koushanfar, Saransh Gupta, Tajan Simunic Rosing, Kamalika Chaudhuri, Hamid Nejatollahi, Nikil Dutt, Mohsen Imani, Kim Laine, Anuj Dubey, Aydin Aysu , et al. (4 additional authors not shown)

    Abstract: In this work, we provide an industry research view for approaching the design, deployment, and operation of trustworthy Artificial Intelligence (AI) inference systems. Such systems provide customers with timely, informed, and customized inferences to aid their decision, while at the same time utilizing appropriate security protection mechanisms for AI models. Additionally, such systems should also… ▽ More

    Submitted 10 February, 2023; v1 submitted 10 August, 2020; originally announced August 2020.

  41. arXiv:2007.10330  [pdf, other

    cs.LG cs.AR

    SHEARer: Highly-Efficient Hyperdimensional Computing by Software-Hardware Enabled Multifold Approximation

    Authors: Behnam Khaleghi, Sahand Salamat, Anthony Thomas, Fatemeh Asgarinejad, Yeseong Kim, Tajana Rosing

    Abstract: Hyperdimensional computing (HD) is an emerging paradigm for machine learning based on the evidence that the brain computes on high-dimensional, distributed, representations of data. The main operation of HD is encoding, which transfers the input data to hyperspace by mapping each input feature to a hypervector, accompanied by so-called bundling procedure that simply adds up the hypervectors to rea… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: A shorter version is accepted in ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED 2020)

  42. arXiv:2005.06716  [pdf, other

    cs.LG cs.CR stat.ML

    Prive-HD: Privacy-Preserved Hyperdimensional Computing

    Authors: Behnam Khaleghi, Mohsen Imani, Tajana Rosing

    Abstract: The privacy of data is a major challenge in machine learning as a trained model may expose sensitive information of the enclosed dataset. Besides, the limited computation capability and capacity of edge devices have made cloud-hosted inference inevitable. Sending private information to remote servers makes the privacy of inference also vulnerable because of susceptible communication channels or ev… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: Accepted in Design Automation Conference (DAC) 2020

  43. arXiv:2002.02394  [pdf, other

    q-bio.QM cs.AR q-bio.GN

    FPGA Acceleration of Sequence Alignment: A Survey

    Authors: Sahand Salamat, Tajana Rosing

    Abstract: Genomics is changing our understanding of humans, evolution, diseases, and medicines to name but a few. As sequencing technology is developed collecting DNA sequences takes less time thereby generating more genetic data every day. Today the rate of generating genetic data is outpacing the rate of computation power growth. Current sequencing machines can sequence 50 humans genome per day; however,… ▽ More

    Submitted 27 July, 2020; v1 submitted 4 February, 2020; originally announced February 2020.

  44. arXiv:1912.07200  [pdf, other

    cs.CV cs.LG

    A Broader Study of Cross-Domain Few-Shot Learning

    Authors: Yunhui Guo, Noel C. Codella, Leonid Karlinsky, James V. Codella, John R. Smith, Kate Saenko, Tajana Rosing, Rogerio Feris

    Abstract: Recent progress on few-shot learning largely relies on annotated data for meta-learning: base classes sampled from the same domain as the novel classes. However, in many applications, collecting data for meta-learning is infeasible or impossible. This leads to the cross-domain few-shot learning problem, where there is a large shift between base and novel class domains. While investigations of the… ▽ More

    Submitted 17 July, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: ECCV 2020. Website: https://www.learning-with-limited-labels.com/

  45. arXiv:1911.12446  [pdf, other

    cs.LG cs.NE stat.ML

    QubitHD: A Stochastic Acceleration Method for HD Computing-Based Machine Learning

    Authors: Samuel Bosch, Alexander Sanchez de la Cerda, Mohsen Imani, Tajana Simunic Rosing, Giovanni De Micheli

    Abstract: Machine Learning algorithms based on Brain-inspired Hyperdimensional(HD) computing imitate cognition by exploiting statistical properties of high-dimensional vector spaces. It is a promising solution for achieving high energy efficiency in different machine learning tasks, such as classification, semi-supervised learning, and clustering. A weakness of existing HD computing-based ML algorithms is t… ▽ More

    Submitted 10 October, 2022; v1 submitted 27 November, 2019; originally announced November 2019.

    Comments: 8 pages, 5 figures, 3 tables

  46. arXiv:1911.09659  [pdf, other

    cs.CV

    AdaFilter: Adaptive Filter Fine-tuning for Deep Transfer Learning

    Authors: Yunhui Guo, Yandong Li, Liqiang Wang, Tajana Rosing

    Abstract: There is an increasing number of pre-trained deep neural network models. However, it is still unclear how to effectively use these models for a new task. Transfer learning, which aims to transfer knowledge from source tasks to a target task, is an effective solution to this problem. Fine-tuning is a popular transfer learning technique for deep neural networks where a few rounds of training are app… ▽ More

    Submitted 8 December, 2019; v1 submitted 21 November, 2019; originally announced November 2019.

  47. arXiv:1911.07187  [pdf, other

    cs.AR

    FPGA Energy Efficiency by Leveraging Thermal Margin

    Authors: Behnam Khaleghi, Sahand Salamat, Mohsen Imani, Tajana Rosing

    Abstract: Cutting edge FPGAs are not energy efficient as conventionally presumed to be, and therefore, aggressive power-saving techniques have become imperative. The clock rate of an FPGA-mapped design is set based on worst-case conditions to ensure reliable operation under all circumstances. This usually leaves a considerable timing margin that can be exploited to reduce power consumption by scaling voltag… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

    Comments: Accepted in IEEE International Conference on Computer Design (ICCD) 2019

  48. arXiv:1909.11763  [pdf, other

    cs.LG stat.ML

    Improved Schemes for Episodic Memory-based Lifelong Learning

    Authors: Yunhui Guo, Mingrui Liu, Tianbao Yang, Tajana Rosing

    Abstract: Current deep neural networks can achieve remarkable performance on a single task. However, when the deep neural network is continually trained on a sequence of tasks, it seems to gradually forget the previous learned knowledge. This phenomenon is referred to as \textit{catastrophic forgetting} and motivates the field called lifelong learning. Recently, episodic memory based approaches such as GEM… ▽ More

    Submitted 14 December, 2020; v1 submitted 25 September, 2019; originally announced September 2019.

    Comments: NeurIPS 2020, Spotlight. 17 pages. Code: https://github.com/yunhuiguo/MEGA

  49. arXiv:1908.06519  [pdf, other

    cs.AR cs.PF

    Workload-Aware Opportunistic Energy Efficiency in Multi-FPGA Platforms

    Authors: Sahand Salamat, Behnam Khaleghi, Mohsen Imani, Tajana Rosing

    Abstract: The continuous growth of big data applications with high computational and scalability demands has resulted in increasing popularity of cloud computing. Optimizing the performance and power consumption of cloud resources is therefore crucial to relieve the costs of data centers. In recent years, multi-FPGA platforms have gained traction in data centers as low-cost yet high-performance solutions pa… ▽ More

    Submitted 28 October, 2019; v1 submitted 18 August, 2019; originally announced August 2019.

    Comments: The paper will be published in ICCAD 2019

  50. arXiv:1902.00927  [pdf, other

    cs.CV

    Depthwise Convolution is All You Need for Learning Multiple Visual Domains

    Authors: Yunhui Guo, Yandong Li, Rogerio Feris, Liqiang Wang, Tajana Rosing

    Abstract: There is a growing interest in designing models that can deal with images from different visual domains. If there exists a universal structure in different visual domains that can be captured via a common parameterization, then we can use a single model for all domains rather than one model per domain. A model aware of the relationships between different domains can also be trained to work on new… ▽ More

    Submitted 19 February, 2019; v1 submitted 3 February, 2019; originally announced February 2019.