Skip to main content

Showing 1–11 of 11 results for author: Ravikumar, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.09721  [pdf, other

    cs.LG cs.AI

    Finding the Muses: Identifying Coresets through Loss Trajectories

    Authors: Manish Nagaraj, Deepak Ravikumar, Efstathia Soufleri, Kaushik Roy

    Abstract: Deep learning models achieve state-of-the-art performance across domains but face scalability challenges in real-time or resource-constrained scenarios. To address this, we propose Loss Trajectory Correlation (LTC), a novel metric for coreset selection that identifies critical training samples driving generalization. $LTC$ quantifies the alignment between training sample loss trajectories and vali… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  2. Intelligent Pooling: Proactive Resource Provisioning in Large-scale Cloud Service

    Authors: Deepak Ravikumar, Alex Yeo, Yiwen Zhu, Aditya Lakra, Harsha Nagulapalli, Santhosh Kumar Ravindran, Steve Suh, Niharika Dutta, Andrew Fogarty, Yoonjae Park, Sumeet Khushalani, Arijit Tarafdar, Kunal Parekh, Subru Krishnan

    Abstract: The proliferation of big data and analytic workloads has driven the need for cloud compute and cluster-based job processing. With Apache Spark, users can process terabytes of data at ease with hundreds of parallel executors. At Microsoft, we aim at providing a fast and succinct interface for users to run Spark applications, such as through creating simple notebook "sessions" by abstracting the und… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

    Journal ref: Proceedings of the VLDB Endowment, Vol. 17, No. 7 ISSN 2150-8097, 2024

  3. arXiv:2407.02747  [pdf, other

    cs.LG cs.CR

    Curvature Clues: Decoding Deep Learning Privacy with Input Loss Curvature

    Authors: Deepak Ravikumar, Efstathia Soufleri, Kaushik Roy

    Abstract: In this paper, we explore the properties of loss curvature with respect to input data in deep neural networks. Curvature of loss with respect to input (termed input loss curvature) is the trace of the Hessian of the loss with respect to the input. We investigate how input loss curvature varies between train and test sets, and its implications for train-test distinguishability. We develop a theoret… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  4. arXiv:2407.02713  [pdf, other

    cs.CV cs.LG

    Advancing Compressed Video Action Recognition through Progressive Knowledge Distillation

    Authors: Efstathia Soufleri, Deepak Ravikumar, Kaushik Roy

    Abstract: Compressed video action recognition classifies video samples by leveraging the different modalities in compressed videos, namely motion vectors, residuals, and intra-frames. For this purpose, three neural networks are deployed, each dedicated to processing one modality. Our observations indicate that the network processing intra-frames tend to converge to a flatter minimum than the network process… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  5. arXiv:2403.08618  [pdf, other

    cs.LG cs.AI stat.ML

    SAP: Corrective Machine Unlearning with Scaled Activation Projection for Label Noise Robustness

    Authors: Sangamesh Kodge, Deepak Ravikumar, Gobinda Saha, Kaushik Roy

    Abstract: Label corruption, where training samples are mislabeled due to non-expert annotation or adversarial attacks, significantly degrades model performance. Acquiring large, perfectly labeled datasets is costly, and retraining models from scratch is computationally expensive. To address this, we introduce Scaled Activation Projection (SAP), a novel SVD (Singular Value Decomposition)-based corrective mac… ▽ More

    Submitted 2 January, 2025; v1 submitted 13 March, 2024; originally announced March 2024.

  6. arXiv:2402.18726  [pdf, other

    cs.LG cs.AI cs.CR

    Unveiling Privacy, Memorization, and Input Curvature Links

    Authors: Deepak Ravikumar, Efstathia Soufleri, Abolfazl Hashemi, Kaushik Roy

    Abstract: Deep Neural Nets (DNNs) have become a pervasive tool for solving many emerging problems. However, they tend to overfit to and memorize the training set. Memorization is of keen interest since it is closely related to several concepts such as generalization, noisy learning, and privacy. To study memorization, Feldman (2019) proposed a formal score, however its computational requirements limit its p… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  7. arXiv:2307.05831  [pdf, other

    cs.LG cs.AI

    Memorization Through the Lens of Curvature of Loss Function Around Samples

    Authors: Isha Garg, Deepak Ravikumar, Kaushik Roy

    Abstract: Deep neural networks are over-parameterized and easily overfit the datasets they train on. In the extreme case, it has been shown that these networks can memorize a training set with fully randomized labels. We propose using the curvature of loss function around each training sample, averaged over training epochs, as a measure of memorization of the sample. We use this metric to study the generali… ▽ More

    Submitted 1 October, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Preprint

  8. arXiv:2304.04326  [pdf, other

    cs.LG cs.DC

    Homogenizing Non-IID datasets via In-Distribution Knowledge Distillation for Decentralized Learning

    Authors: Deepak Ravikumar, Gobinda Saha, Sai Aparna Aketi, Kaushik Roy

    Abstract: Decentralized learning enables serverless training of deep neural networks (DNNs) in a distributed manner on multiple nodes. This allows for the use of large datasets, as well as the ability to train with a wide variety of data sources. However, one of the key challenges with decentralized learning is heterogeneity in the data distribution across the nodes. In this paper, we propose In-Distributio… ▽ More

    Submitted 24 February, 2024; v1 submitted 9 April, 2023; originally announced April 2023.

  9. arXiv:2205.03493  [pdf, other

    cs.LG cs.CV

    Norm-Scaling for Out-of-Distribution Detection

    Authors: Deepak Ravikumar, Kaushik Roy

    Abstract: Out-of-Distribution (OoD) inputs are examples that do not belong to the true underlying distribution of the dataset. Research has shown that deep neural nets make confident mispredictions on OoD inputs. Therefore, it is critical to identify OoD inputs for safe and reliable deployment of deep neural nets. Often a threshold is applied on a similarity score to detect OoD inputs. One such similarity i… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

  10. arXiv:2012.08398  [pdf, other

    cs.LG cs.CV

    Exploring Vicinal Risk Minimization for Lightweight Out-of-Distribution Detection

    Authors: Deepak Ravikumar, Sangamesh Kodge, Isha Garg, Kaushik Roy

    Abstract: Deep neural networks have found widespread adoption in solving complex tasks ranging from image recognition to natural language processing. However, these networks make confident mispredictions when presented with data that does not belong to the training distribution, i.e. out-of-distribution (OoD) samples. In this paper we explore whether the property of Vicinal Risk Minimization (VRM) to smooth… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

  11. arXiv:2008.01524  [pdf, other

    cs.LG stat.ML

    TREND: Transferability based Robust ENsemble Design

    Authors: Deepak Ravikumar, Sangamesh Kodge, Isha Garg, Kaushik Roy

    Abstract: Deep Learning models hold state-of-the-art performance in many fields, but their vulnerability to adversarial examples poses threat to their ubiquitous deployment in practical settings. Additionally, adversarial inputs generated on one classifier have been shown to transfer to other classifiers trained on similar data, which makes the attacks possible even if model parameters are not revealed to t… ▽ More

    Submitted 30 March, 2021; v1 submitted 4 August, 2020; originally announced August 2020.