Skip to main content

Showing 1–18 of 18 results for author: Chaterji, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.20988  [pdf, other

    cs.LG cs.AI cs.DC

    Hubs and Spokes Learning: Efficient and Scalable Collaborative Machine Learning

    Authors: Atul Sharma, Kavindu Herath, Saurabh Bagchi, Chaoyue Liu, Somali Chaterji

    Abstract: We introduce the Hubs and Spokes Learning (HSL) framework, a novel paradigm for collaborative machine learning that combines the strengths of Federated Learning (FL) and Decentralized Learning (P2PL). HSL employs a two-tier communication structure that avoids the single point of failure inherent in FL and outperforms the state-of-the-art P2PL framework, Epidemic Learning Local (ELL). At equal comm… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  2. arXiv:2503.10905  [pdf, other

    cs.AI cs.CV cs.LG

    Learning to Inference Adaptively for Multimodal Large Language Models

    Authors: Zhuoyan Xu, Khoi Duc Nguyen, Preeti Mukherjee, Saurabh Bagchi, Somali Chaterji, Yingyu Liang, Yin Li

    Abstract: Multimodal Large Language Models (MLLMs) have shown impressive capabilities in reasoning, yet come with substantial computational cost, limiting their deployment in resource-constrained settings. Despite recent efforts on improving the efficiency of MLLMs, prior solutions fall short in responding to varying runtime conditions, in particular changing resource availability (e.g., contention due to t… ▽ More

    Submitted 17 March, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

  3. arXiv:2503.08010  [pdf, other

    cs.CV cs.AI

    SKALD: Learning-Based Shot Assembly for Coherent Multi-Shot Video Creation

    Authors: Chen Yi Lu, Md Mehrab Tanjim, Ishita Dasgupta, Somdeb Sarkhel, Gang Wu, Saayan Mitra, Somali Chaterji

    Abstract: We present SKALD, a multi-shot video assembly method that constructs coherent video sequences from candidate shots with minimal reliance on text. Central to our approach is the Learned Clip Assembly (LCA) score, a learning-based metric that measures temporal and semantic relationships between shots to quantify narrative coherence. We tackle the exponential complexity of combining multiple shots wi… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  4. arXiv:2405.03636  [pdf, other

    cs.CR cs.LG

    The Federation Strikes Back: A Survey of Federated Learning Privacy Attacks, Defenses, Applications, and Policy Landscape

    Authors: Joshua C. Zhao, Saurabh Bagchi, Salman Avestimehr, Kevin S. Chan, Somali Chaterji, Dimitris Dimitriadis, Jiacheng Li, Ninghui Li, Arash Nourian, Holger R. Roth

    Abstract: Deep learning has shown incredible potential across a wide array of tasks, and accompanied by this growth has been an insatiable appetite for data. However, a large amount of data needed for enabling deep learning is stored on personal devices, and recent concerns on privacy have further highlighted challenges for accessing such data. As a result, federated learning (FL) has emerged as an importan… ▽ More

    Submitted 22 March, 2025; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted to ACM Computing Surveys; 35 pages

    ACM Class: I.2; H.4; I.5

  5. arXiv:2112.13076  [pdf, other

    cs.CV cs.AI

    Virtuoso: Video-based Intelligence for real-time tuning on SOCs

    Authors: Jayoung Lee, PengCheng Wang, Ran Xu, Venkat Dasari, Noah Weston, Yin Li, Saurabh Bagchi, Somali Chaterji

    Abstract: Efficient and adaptive computer vision systems have been proposed to make computer vision tasks, such as image classification and object detection, optimized for embedded or mobile devices. These solutions, quite recent in their origin, focus on optimizing the model (a deep neural network, DNN) or the system by designing an adaptive system with approximation knobs. In spite of several recent effor… ▽ More

    Submitted 24 December, 2021; originally announced December 2021.

    Comments: 28 pages, 15 figures, 4 tables, ACM-TODAES

  6. arXiv:2112.10068  [pdf, other

    q-bio.GN cs.LG

    Lerna: Transformer Architectures for Configuring Error Correction Tools for Short- and Long-Read Genome Sequencing

    Authors: Atul Sharma, Pranjal Jain, Ashraf Mahgoub, Zihan Zhou, Kanak Mahadik, Somali Chaterji

    Abstract: Sequencing technologies are prone to errors, making error correction (EC) necessary for downstream applications. EC tools need to be manually configured for optimal performance. We find that the optimal parameters (e.g., k-mer size) are both tool- and dataset-dependent. Moreover, evaluating the performance (i.e., Alignment-rate or Gain) of a given tool usually relies on a reference genome, but qua… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

    Comments: 26 pages, 5 figures, 10 tables. Accepted to BMC Bioinformatics

  7. arXiv:2111.09422  [pdf, other

    cs.NI

    ORPHEUS: Living Labs for End-to-End Data Infrastructures for Digital Agriculture

    Authors: Pengcheng Wang, Edgardo Barsallo Yi, Tomas Ratkus, Somali Chaterji

    Abstract: IoT networks are being used to collect, analyze, and utilize sensor data. There are still some key requirements to leverage IoT networks in digital agriculture, e.g., design and deployment of energy saving and ruggedized sensor nodes (SN), reliable and long-range wireless network connectivity, end-to-end data collection pipelines for batch and streaming data. Thus, we introduce our living lab ORPH… ▽ More

    Submitted 4 October, 2021; originally announced November 2021.

  8. arXiv:2110.10108  [pdf, other

    cs.LG

    TESSERACT: Gradient Flip Score to Secure Federated Learning Against Model Poisoning Attacks

    Authors: Atul Sharma, Wei Chen, Joshua Zhao, Qiang Qiu, Somali Chaterji, Saurabh Bagchi

    Abstract: Federated learning---multi-party, distributed learning in a decentralized environment---is vulnerable to model poisoning attacks, even more so than centralized learning approaches. This is because malicious clients can collude and send in carefully tailored model updates to make the global model inaccurate. This motivated the development of Byzantine-resilient federated learning algorithms, such a… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: 12 pages

  9. arXiv:2107.12147  [pdf, other

    cs.DC cs.CV

    Federated Action Recognition on Heterogeneous Embedded Devices

    Authors: Pranjal Jain, Shreyas Goenka, Saurabh Bagchi, Biplab Banerjee, Somali Chaterji

    Abstract: Federated learning allows a large number of devices to jointly learn a model without sharing data. In this work, we enable clients with limited computing power to perform action recognition, a computationally heavy task. We first perform model compression at the central server through knowledge distillation on a large dataset. This allows the model to learn complex features and serves as an initia… ▽ More

    Submitted 17 July, 2021; originally announced July 2021.

    Comments: 13 pages, 12 figures

    Journal ref: IEEE Transactions on Artificial Intelligence 2021

  10. arXiv:2107.05090  [pdf, other

    cs.NI eess.SP

    Ambrosia: Reduction in Data Transfer from Sensor to Server for Increased Lifetime of IoT Sensor Nodes

    Authors: Shikhar Suryavansh, Abu Benna, Chris Guest, Somali Chaterji

    Abstract: Data transmission accounts for significant energy consumption in wireless sensor networks where streaming data is generatedby the sensors. This impedes their use in many settings, including livestock monitoring over large pastures (which formsour target application). We present Ambrosia, a lightweight protocol that utilizes a window-based timeseries forecastingmechanism for data reduction. Ambrosi… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

    Comments: 13 pages, 7 figures, Nature Scientific Reports

  11. arXiv:2012.04880  [pdf, other

    cs.CV cs.DC

    JANUS: Benchmarking Commercial and Open-Source Cloud and Edge Platforms for Object and Anomaly Detection Workloads

    Authors: Karthick Shankar, Pengcheng Wang, Ran Xu, Ashraf Mahgoub, Somali Chaterji

    Abstract: With diverse IoT workloads, placing compute and analytics close to where data is collected is becoming increasingly important. We seek to understand what is the performance and the cost implication of running analytics on IoT data at the various available platforms. These workloads can be compute-light, such as outlier detection on sensor data, or compute-intensive, such as object detection from v… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Comments: Appeared at the IEEE Cloud 2020 conference. 10 pages

    Journal ref: "JANUS: Benchmarking Commercial and Open-Source Cloud and Edge Platforms for Object and Anomaly Detection Workloads," IEEE International Conference on Cloud Computing (IEEE Cloud), pp. 1--10, Oct 18-24, 2020

  12. arXiv:2010.10754  [pdf, other

    cs.CV

    ApproxDet: Content and Contention-Aware Approximate Object Detection for Mobiles

    Authors: Ran Xu, Chen-lin Zhang, Pengcheng Wang, Jayoung Lee, Subrata Mitra, Somali Chaterji, Yin Li, Saurabh Bagchi

    Abstract: Advanced video analytic systems, including scene classification and object detection, have seen widespread success in various domains such as smart cities and autonomous transportation. With an ever-growing number of powerful client devices, there is incentive to move these heavy video analytics workloads from the cloud to mobile devices to achieve low latency and real-time processing and to prese… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: Accepted to appear at the 18th ACM Conference on Embedded Networked Sensor Systems (SenSys), 2020

  13. Hybrid Low-Power Wide-Area Mesh Network for IoT Applications

    Authors: Xiaofan Jiang, Heng zhang, Edgardo Alberto Barsallo Yi, Nithin Raghunathan, Charilaos Mousoulis, Somali Chaterji, Dimitrios Peroulis, Ali Shakouri, Saurabh Bagchi

    Abstract: The recent advancement of the Internet of Things (IoT) enables the possibility of data collection from diverse environments using IoT devices. However, despite the rapid advancement of low-power communication technologies, the deployment of IoT networks still faces many challenges. In this paper, we propose a hybrid, low-power, wide-area network (LPWAN) structure that can achieve wide-area communi… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  14. arXiv:2001.09786  [pdf, other

    cs.CY cs.AI cs.DB cs.LG

    Artificial Intelligence for Digital Agriculture at Scale: Techniques, Policies, and Challenges

    Authors: Somali Chaterji, Nathan DeLay, John Evans, Nathan Mosier, Bernard Engel, Dennis Buckmaster, Ranveer Chandra

    Abstract: Digital agriculture has the promise to transform agricultural throughput. It can do this by applying data science and engineering for mapping input factors to crop throughput, while bounding the available resources. In addition, as the data volumes and varieties increase with the increase in sensor deployment in agricultural fields, data engineering techniques will also be instrumental in collecti… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

    Comments: 15 pages, 1 figure

  15. arXiv:2001.00090  [pdf, other

    cs.CY cs.DC eess.SY

    Resilient Cyberphysical Systems and their Application Drivers: A Technology Roadmap

    Authors: Somali Chaterji, Parinaz Naghizadeh, Muhammad Ashraful Alam, Saurabh Bagchi, Mung Chiang, David Corman, Brian Henz, Suman Jana, Na Li, Shaoshuai Mou, Meeko Oishi, Chunyi Peng, Tiark Rompf, Ashutosh Sabharwal, Shreyas Sundaram, James Weimer, Jennifer Weller

    Abstract: Cyberphysical systems (CPS) are ubiquitous in our personal and professional lives, and they promise to dramatically improve micro-communities (e.g., urban farms, hospitals), macro-communities (e.g., cities and metropolises), urban structures (e.g., smart homes and cars), and living structures (e.g., human bodies, synthetic genomes). The question that we address in this article pertains to designin… ▽ More

    Submitted 19 December, 2019; originally announced January 2020.

    Comments: 36 pages, 2 figures, NSF-supported workshop on Grand Challenges in Resilience, held at Purdue, March 20-21, 2019

    MSC Class: C.5.3; D.4.5; H.4.0 ACM Class: C.5.3; D.4.5; H.4.0

  16. arXiv:1912.11598  [pdf, other

    cs.CR cs.CY cs.NI

    Grand Challenges in Resilience: Autonomous System Resilience through Design and Runtime Measures

    Authors: Saurabh Bagchi, Vaneet Aggarwal, Somali Chaterji, Fred Douglis, Aly El Gamal, Jiawei Han, Brian J. Henz, Hank Hoffmann, Suman Jana, Milind Kulkarni, Felix Xiaozhu Lin, Karen Marais, Prateek Mittal, Shaoshuai Mou, Xiaokang Qiu, Gesualdo Scutari

    Abstract: A set of about 80 researchers, practitioners, and federal agency program managers participated in the NSF-sponsored Grand Challenges in Resilience Workshop held on Purdue campus on March 19-21, 2019. The workshop was divided into three themes: resilience in cyber, cyber-physical, and socio-technical systems. About 30 attendees in all participated in the discussions of cyber resilience. This articl… ▽ More

    Submitted 9 May, 2020; v1 submitted 25 December, 2019; originally announced December 2019.

    ACM Class: C.4; D.4.5

    Journal ref: IEEE Open Journal of the Computer Society, 2020

  17. arXiv:1909.02068  [pdf, other

    cs.CV eess.IV

    ApproxNet: Content and Contention-Aware Video Analytics System for Embedded Clients

    Authors: Ran Xu, Rakesh Kumar, Pengcheng Wang, Peter Bai, Ganga Meghanath, Somali Chaterji, Subrata Mitra, Saurabh Bagchi

    Abstract: Videos take a lot of time to transport over the network, hence running analytics on the live video on embedded or mobile devices has become an important system driver. Considering that such devices, e.g., surveillance cameras or AR/VR gadgets, are resource constrained, creating lightweight deep neural networks (DNNs) for embedded devices is crucial. None of the current approximation techniques for… ▽ More

    Submitted 14 July, 2021; v1 submitted 28 August, 2019; originally announced September 2019.

    Comments: This paper has been accepted to appear in ACM Transactions on Sensor Networks in 2021

  18. arXiv:1812.11467  [pdf, other

    cs.NE q-bio.GN

    ATHENA: Automated Tuning of Genomic Error Correction Algorithms using Language Models

    Authors: Mustafa Abdallah, Ashraf Mahgoub, Saurabh Bagchi, Somali Chaterji

    Abstract: The performance of most error-correction algorithms that operate on genomic sequencer reads is dependent on the proper choice of its configuration parameters, such as the value of k in k-mer based techniques. In this work, we target the problem of finding the best values of these configuration parameters to optimize error correction. We perform this in a data-driven manner, due to the observation… ▽ More

    Submitted 29 December, 2018; originally announced December 2018.

    Comments: 10 main pages, 7 references and appendices