Skip to main content

Showing 1–43 of 43 results for author: Prakash, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.23556  [pdf, other

    cs.CL

    Understanding Refusal in Language Models with Sparse Autoencoders

    Authors: Wei Jie Yeo, Nirmalendu Prakash, Clement Neo, Roy Ka-Wei Lee, Erik Cambria, Ranjan Satapathy

    Abstract: Refusal is a key safety behavior in aligned language models, yet the internal mechanisms driving refusals remain opaque. In this work, we conduct a mechanistic study of refusal in instruction-tuned LLMs using sparse autoencoders to identify latent features that causally mediate refusal behaviors. We apply our method to two open-source chat models and intervene on refusal-related features to assess… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  2. arXiv:2505.14685  [pdf, ps, other

    cs.CL

    Language Models use Lookbacks to Track Beliefs

    Authors: Nikhil Prakash, Natalie Shapira, Arnab Sen Sharma, Christoph Riedl, Yonatan Belinkov, Tamar Rott Shaham, David Bau, Atticus Geiger

    Abstract: How do language models (LMs) represent characters' beliefs, especially when those beliefs may differ from reality? This question lies at the heart of understanding the Theory of Mind (ToM) capabilities of LMs. We analyze Llama-3-70B-Instruct's ability to reason about characters' beliefs using causal mediation and abstraction. We construct a dataset that consists of simple stories where two charact… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: 32 pages, 32 figures. Code and data at https://belief.baulab.info/

  3. arXiv:2504.17080  [pdf, other

    cs.RO eess.SY

    Geometric Formulation of Unified Force-Impedance Control on SE(3) for Robotic Manipulators

    Authors: Joohwan Seo, Nikhil Potu Surya Prakash, Soomi Lee, Arvind Kruthiventy, Megan Teng, Jongeun Choi, Roberto Horowitz

    Abstract: In this paper, we present an impedance control framework on the SE(3) manifold, which enables force tracking while guaranteeing passivity. Building upon the unified force-impedance control (UFIC) and our previous work on geometric impedance control (GIC), we develop the geometric unified force impedance control (GUFIC) to account for the SE(3) manifold structure in the controller formulation using… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: Submitted to Control Decision Conference (CDC) 2025

  4. arXiv:2504.13151  [pdf, ps, other

    cs.LG cs.AI cs.CL

    MIB: A Mechanistic Interpretability Benchmark

    Authors: Aaron Mueller, Atticus Geiger, Sarah Wiegreffe, Dana Arad, Iván Arcuschin, Adam Belfki, Yik Siu Chan, Jaden Fiotto-Kaufman, Tal Haklay, Michael Hanna, Jing Huang, Rohan Gupta, Yaniv Nikankin, Hadas Orgad, Nikhil Prakash, Anja Reusch, Aruna Sankaranarayanan, Shun Shao, Alessandro Stolfo, Martin Tutek, Amir Zur, David Bau, Yonatan Belinkov

    Abstract: How can we know whether new mechanistic interpretability methods achieve real improvements? In pursuit of lasting evaluation standards, we propose MIB, a Mechanistic Interpretability Benchmark, with two tracks spanning four tasks and five models. MIB favors methods that precisely and concisely recover relevant causal pathways or causal variables in neural language models. The circuit localization… ▽ More

    Submitted 9 June, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: Accepted to ICML 2025. Project website at https://mib-bench.github.io

  5. arXiv:2503.04429  [pdf, ps, other

    cs.AI

    Activation Space Interventions Can Be Transferred Between Large Language Models

    Authors: Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash, Michael Lan, Abir Harrasse, Amirali Abdullah

    Abstract: The study of representation universality in AI models reveals growing convergence across domains, modalities, and architectures. However, the practical applications of representation universality remain largely unexplored. We bridge this gap by demonstrating that safety interventions can be transferred between models through learned mappings of their shared activation spaces. We demonstrate this a… ▽ More

    Submitted 16 June, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

    Comments: 75 pages

  6. arXiv:2408.01416  [pdf, other

    cs.LG cs.AI

    The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability

    Authors: Aaron Mueller, Jannik Brinkmann, Millicent Li, Samuel Marks, Koyena Pal, Nikhil Prakash, Can Rager, Aruna Sankaranarayanan, Arnab Sen Sharma, Jiuding Sun, Eric Todd, David Bau, Yonatan Belinkov

    Abstract: Interpretability provides a toolset for understanding how and why neural networks behave in certain ways. However, there is little unity in the field: most studies employ ad-hoc evaluations and do not share theoretical foundations, making it difficult to measure progress and compare the pros and cons of different techniques. Furthermore, while mechanistic understanding is frequently discussed, the… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  7. arXiv:2407.14561  [pdf, other

    cs.LG cs.AI

    NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals

    Authors: Jaden Fiotto-Kaufman, Alexander R. Loftus, Eric Todd, Jannik Brinkmann, Koyena Pal, Dmitrii Troitskii, Michael Ripa, Adam Belfki, Can Rager, Caden Juang, Aaron Mueller, Samuel Marks, Arnab Sen Sharma, Francesca Lucchetti, Nikhil Prakash, Carla Brodley, Arjun Guha, Jonathan Bell, Byron C. Wallace, David Bau

    Abstract: We introduce NNsight and NDIF, technologies that work in tandem to enable scientific study of the representations and computations learned by very large neural networks. NNsight is an open-source system that extends PyTorch to introduce deferred remote execution. The National Deep Inference Fabric (NDIF) is a scalable inference service that executes NNsight requests, allowing users to share GPU re… ▽ More

    Submitted 1 April, 2025; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: Code at https://nnsight.net

  8. arXiv:2407.13090  [pdf

    eess.IV cs.CV

    Enhanced Denoising of Optical Coherence Tomography Images Using Residual U-Net

    Authors: Akkidas Noel Prakash, Jahnvi Sai Ganta, Ramaswami Krishnadas, Tin A. Tunc, Satish K Panda

    Abstract: Optical Coherence Tomography (OCT) imaging is pivotal in diagnosing ophthalmic conditions by providing detailed cross-sectional images of the anterior and posterior segments of the eye. Nonetheless, speckle noise and other imaging artifacts inherent to OCT impede the accuracy of diagnosis significantly. In this study, we proposed an enhanced denoising model using a Residual U-Net architecture that… ▽ More

    Submitted 24 September, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

  9. arXiv:2406.12347  [pdf, other

    cs.CL

    Interpreting Bias in Large Language Models: A Feature-Based Approach

    Authors: Nirmalendu Prakash, Lee Ka Wei Roy

    Abstract: Large Language Models (LLMs) such as Mistral and LLaMA have showcased remarkable performance across various natural language processing (NLP) tasks. Despite their success, these models inherit social biases from the diverse datasets on which they are trained. This paper investigates the propagation of biases within LLMs through a novel feature-based analytical approach. Drawing inspiration from ca… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  10. arXiv:2405.01842  [pdf, ps, other

    cs.CL

    SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Singapore

    Authors: Ri Chi Ng, Nirmalendu Prakash, Ming Shan Hee, Kenny Tsu Wei Choo, Roy Ka-Wei Lee

    Abstract: To address the limitations of current hate speech detection models, we introduce \textsf{SGHateCheck}, a novel framework designed for the linguistic and cultural context of Singapore and Southeast Asia. It extends the functional testing approach of HateCheck and MHC, employing large language models for translation and paraphrasing into Singapore's main languages, and refining these with native ann… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  11. arXiv:2402.14811  [pdf, other

    cs.CL cs.LG

    Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking

    Authors: Nikhil Prakash, Tamar Rott Shaham, Tal Haklay, Yonatan Belinkov, David Bau

    Abstract: Fine-tuning on generalized tasks such as instruction following, code generation, and mathematics has been shown to enhance language models' performance on a range of tasks. Nevertheless, explanations of how such fine-tuning influences the internal computations in these models remain elusive. We study how fine-tuning affects the internal mechanisms implemented in language models. As a case study, w… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: ICLR 2024. 26 pages, 13 figures. Code and data at https://finetuning.baulab.info/

  12. arXiv:2401.13190  [pdf, other

    cs.RO eess.SY

    A Comparison Between Lie Group- and Lie Algebra- Based Potential Functions for Geometric Impedance Control

    Authors: Joohwan Seo, Nikhil Potu Surya Prakash, Jongeun Choi, Roberto Horowitz

    Abstract: In this paper, a comparison analysis between geometric impedance controls (GICs) derived from two different potential functions on SE(3) for robotic manipulators is presented. The first potential function is defined on the Lie group, utilizing the Frobenius norm of the configuration error matrix. The second potential function is defined utilizing the Lie algebra, i.e., log-map of the configuration… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: This paper is accepted to American Control Conference (ACC) 2024

  13. arXiv:2312.09693  [pdf, other

    cs.AI

    Prompting Large Language Models for Topic Modeling

    Authors: Han Wang, Nirmalendu Prakash, Nguyen Khoi Hoang, Ming Shan Hee, Usman Naseem, Roy Ka-Wei Lee

    Abstract: Topic modeling is a widely used technique for revealing underlying thematic structures within textual data. However, existing models have certain limitations, particularly when dealing with short text datasets that lack co-occurring words. Moreover, these models often neglect sentence-level semantics, focusing primarily on token-level semantics. In this paper, we propose PromptTopic, a novel topic… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 6 pages, 3 figures, IEEE International Conference on Big Data

    ACM Class: I.2.7

  14. arXiv:2312.06094  [pdf, other

    cs.CL cs.CV cs.MM

    MATK: The Meme Analytical Tool Kit

    Authors: Ming Shan Hee, Aditi Kumaresan, Nguyen Khoi Hoang, Nirmalendu Prakash, Rui Cao, Roy Ka-Wei Lee

    Abstract: The rise of social media platforms has brought about a new digital culture called memes. Memes, which combine visuals and text, can strongly influence public opinions on social and cultural issues. As a result, people have become interested in categorizing memes, leading to the development of various datasets and multimodal models that show promising results in this field. However, there is curren… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: Accepted at ACM Multimedia'23 Open-Source Software Competition Track

    ACM Class: I.1.4

  15. arXiv:2312.06093  [pdf, other

    cs.CL cs.CV cs.MM

    PromptMTopic: Unsupervised Multimodal Topic Modeling of Memes using Large Language Models

    Authors: Nirmalendu Prakash, Han Wang, Nguyen Khoi Hoang, Ming Shan Hee, Roy Ka-Wei Lee

    Abstract: The proliferation of social media has given rise to a new form of communication: memes. Memes are multimodal and often contain a combination of text and visual elements that convey meaning, humor, and cultural significance. While meme analysis has been an active area of research, little work has been done on unsupervised multimodal topic modeling of memes, which is important for content moderation… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: Accepted at ACM Multimedia'23 Research Track

    ACM Class: I.1.4; I.1.7

  16. arXiv:2311.10322  [pdf, other

    eess.SY cs.AI cs.LG math.DS math.OC

    Clustering Techniques for Stable Linear Dynamical Systems with applications to Hard Disk Drives

    Authors: Nikhil Potu Surya Prakash, Joohwan Seo, Jongeun Choi, Roberto Horowitz

    Abstract: In Robust Control and Data Driven Robust Control design methodologies, multiple plant transfer functions or a family of transfer functions are considered and a common controller is designed such that all the plants that fall into this family are stabilized. Though the plants are stabilized, the controller might be sub-optimal for each of the plants when the variations in the plants are large. This… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 6 pages, 4 figures

  17. arXiv:2310.12609  [pdf, ps, other

    cs.RO cs.AI cs.LG

    Denoising Heat-inspired Diffusion with Insulators for Collision Free Motion Planning

    Authors: Junwoo Chang, Hyunwoo Ryu, Jiwoo Kim, Soochul Yoo, Jongeun Choi, Joohwan Seo, Nikhil Prakash, Roberto Horowitz

    Abstract: Diffusion models have risen as a powerful tool in robotics due to their flexibility and multi-modality. While some of these methods effectively address complex problems, they often depend heavily on inference-time obstacle detection and require additional equipment. Addressing these challenges, we present a method that, during inference time, simultaneously generates only reachable goals and plans… ▽ More

    Submitted 12 February, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: 9 pages, 6 figures

    Journal ref: NeurIPS 2023 Workshop on Diffusion Models

  18. Contact-rich SE(3)-Equivariant Robot Manipulation Task Learning via Geometric Impedance Control

    Authors: Joohwan Seo, Nikhil Potu Surya Prakash, Xiang Zhang, Changhao Wang, Jongeun Choi, Masayoshi Tomizuka, Roberto Horowitz

    Abstract: This paper presents a differential geometric control approach that leverages SE(3) group invariance and equivariance to increase transferability in learning robot manipulation tasks that involve interaction with the environment. Specifically, we employ a control law and a learning representation framework that remain invariant under arbitrary SE(3) transformations of the manipulation task definiti… ▽ More

    Submitted 18 December, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

  19. arXiv:2307.03637  [pdf, other

    cs.AI

    Discovering Variable Binding Circuitry with Desiderata

    Authors: Xander Davies, Max Nadeau, Nikhil Prakash, Tamar Rott Shaham, David Bau

    Abstract: Recent work has shown that computation in language models may be human-understandable, with successful efforts to localize and intervene on both single-unit features and input-output circuits. Here, we introduce an approach which extends causal mediation experiments to automatically identify model components responsible for performing a specific subtask by solely specifying a set of \textit{deside… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  20. arXiv:2305.17911  [pdf, other

    cs.SI cs.AI cs.CL cs.CV

    TotalDefMeme: A Multi-Attribute Meme dataset on Total Defence in Singapore

    Authors: Nirmalendu Prakash, Ming Shan Hee, Roy Ka-Wei Lee

    Abstract: Total Defence is a defence policy combining and extending the concept of military defence and civil defence. While several countries have adopted total defence as their defence policy, very few studies have investigated its effectiveness. With the rapid proliferation of social media and digitalisation, many social studies have been focused on investigating policy effectiveness through specially cu… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: 6 pages. Accepted at ACM MMSys 2023

    ACM Class: I.2.7

  21. arXiv:2304.00720  [pdf, other

    eess.SY cs.AR

    Data-Driven Track Following Control for Dual Stage-Actuator Hard Disk Drives

    Authors: Nikhil Potu Surya Prakash, Joohwan Seo, Alexander Rose, Roberto Horowitz

    Abstract: In this paper, we present a frequency domain data-driven feedback control design methodology for the design of tracking controllers for hard disk drives with two-stage actuator as a part of the open invited track 'Benchmark Problem on Control System Design of Hard Disk Drive with a Dual-Stage Actuator' in the IFAC World Congress 2023 (Yokohoma, Japan). The benchmark models are Compared to the trad… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 7 pages, 10 figures, IFAC World Congress, Yokohoma

  22. Geometric Impedance Control on SE(3) for Robotic Manipulators

    Authors: Joohwan Seo, Nikhil Potu Surya Prakash, Alexander Rose, Jongeun Choi, Roberto Horowitz

    Abstract: After its introduction, impedance control has been utilized as a primary control scheme for robotic manipulation tasks that involve interaction with unknown environments. While impedance control has been extensively studied, the geometric structure of SE(3) for the robotic manipulator itself and its use in formulating a robotic task has not been adequately addressed. In this paper, we propose a di… ▽ More

    Submitted 5 March, 2025; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: Presented at IFAC World Congress 2023, Yokohama, Japan

  23. arXiv:2201.00863  [pdf, other

    cs.RO cs.AI eess.SY

    Adaptive Model Predictive Control of Wheeled Mobile Robots

    Authors: Nikhil Potu Surya Prakash, Tamara Perreault, Trevor Voth, Zejun Zhong

    Abstract: In this paper, a control algorithm for guiding a two wheeled mobile robot with unknown inertia to a desired point and orientation using an Adaptive Model Predictive Control (AMPC) framework is presented. The two wheeled mobile robot is modeled as a knife edge or a skate with nonholonomic kinematic constraints and the dynamical equations are derived using the Lagrangian approach. The inputs at ever… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    Comments: 5 pages, 7 figures

  24. arXiv:2012.06161  [pdf, other

    cs.AI cs.HC

    Conceptualization and Framework of Hybrid Intelligence Systems

    Authors: Nikhil Prakash, Kory W. Mathewson

    Abstract: As artificial intelligence (AI) systems are getting ubiquitous within our society, issues related to its fairness, accountability, and transparency are increasing rapidly. As a result, researchers are integrating humans with AI systems to build robust and reliable hybrid intelligence systems. However, a proper conceptualization of these systems does not underpin this rapid growth. This article pro… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: 8 pages, 1 figure, HAMLETS (Human And Machine in-the-Loop Evaluation and Learning Strategies) workshop at Thirty-fourth Conference on Neural Information Processing Systems

  25. arXiv:2011.08013  [pdf, other

    cs.CE math.NA math.OC

    A General Numerical Method to Model Anisotropy in Discretized Bond-Based Peridynamics

    Authors: Naveen Prakash

    Abstract: This work proposes a novel, general and robust method of determining bond micromoduli for anisotropic linear elastic bond-based peridynamics. The problem of finding a discrete distribution of bond micromoduli that reproduces an anisotropic peridynamic stiffness tensor is cast as a least-squares problem. The proposed numerical method is able to find a distribution of bond micromoduli that is able t… ▽ More

    Submitted 28 May, 2021; v1 submitted 16 November, 2020; originally announced November 2020.

    Comments: 56 pages

  26. arXiv:1911.00344  [pdf, other

    cs.NE eess.SY physics.soc-ph

    Short and Wide Network Paths

    Authors: Lavanya Marla, Lav R. Varshney, Devavrat Shah, Nirmal A. Prakash, Michael E. Gale

    Abstract: Network flow is a powerful mathematical framework to systematically explore the relationship between structure and function in biological, social, and technological networks. We introduce a new pipelining model of flow through networks where commodities must be transported over single paths rather than split over several paths and recombined. We show this notion of pipelined network flow is optimi… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

  27. arXiv:1811.07323  [pdf, other

    cs.RO

    Nonlinear control of a swinging pendulum on a wheeled mobile robot with nonholonomic constraints

    Authors: Nikhil Potu Surya Prakash

    Abstract: In this paper, we propose a nonlinear control strategy for swinging up a pendulum to its upright equilibrium position by shaping its swinging energy along with regulating the cart to a desired location. While the base of a usual cart-pole system is restricted to move in a straight line, the present system is allowed to move in the x-y plane with a nonholonomic consraint that its allowable velocity… ▽ More

    Submitted 18 November, 2018; originally announced November 2018.

    Comments: 8 pages, 3 figures

  28. arXiv:1805.03727  [pdf, other

    cs.DC

    ARES: Adaptive, Reconfigurable, Erasure coded, atomic Storage

    Authors: Nicolas Nicolaou, Viveck Cadambe, N. Prakash, Andria Trigeorgi, Kishori M. Konwar, Nancy Lynch, Muriel Medard

    Abstract: Atomicity or strong consistency is one of the fundamental, most intuitive, and hardest to provide primitives in distributed shared memory emulations. To ensure survivability, scalability, and availability of a storage service in the presence of failures, traditional approaches for atomic memory emulation, in message passing environments, replicate the objects across multiple servers. Compared to r… ▽ More

    Submitted 28 May, 2021; v1 submitted 9 May, 2018; originally announced May 2018.

  29. arXiv:1805.00396  [pdf, other

    cs.IT

    Updating Content in Cache-Aided Coded Multicast

    Authors: Milad Mahdian, N. Prakash, Muriel Médard, Edmund Yeh

    Abstract: Motivated by applications to delivery of dynamically updated, but correlated data in settings such as content distribution networks, and distributed file sharing systems, we study a single source multiple destination network coded multicast problem in a cache-aided network. We focus on models where the caches are primarily located near the destinations, and where the source has no cache. The sourc… ▽ More

    Submitted 1 May, 2018; originally announced May 2018.

    Comments: To Appear in IEEE Journal on Selected Areas in Communications: Special Issue on Caching for Communication Systems and Networks

  30. arXiv:1708.05474  [pdf, other

    cs.IT

    The Storage vs Repair Bandwidth Trade-off for Multiple Failures in Clustered Storage Networks

    Authors: Vitaly Abdrashitov, N. Prakash, Muriel Médard

    Abstract: We study the trade-off between storage overhead and inter-cluster repair bandwidth in clustered storage systems, while recovering from multiple node failures within a cluster. A cluster is a collection of $m$ nodes, and there are $n$ clusters. For data collection, we download the entire content from any $k$ clusters. For repair of $t \geq 2$ nodes within a cluster, we take help from $\ell$ local n… ▽ More

    Submitted 17 August, 2017; originally announced August 2017.

    Comments: Accepted to IEEE Information Theory Workshop(ITW) 2017

  31. arXiv:1703.01286  [pdf, other

    cs.DC cs.IT

    A Layered Architecture for Erasure-Coded Consistent Distributed Storage

    Authors: Kishori M. Konwar, N. Prakash, Nancy Lynch, Muriel Medard

    Abstract: Motivated by emerging applications to the edge computing paradigm, we introduce a two-layer erasure-coded fault-tolerant distributed storage system offering atomic access for read and write operations. In edge computing, clients interact with an edge-layer of servers that is geographically near; the edge-layer in turn interacts with a back-end layer of servers. The edge-layer provides low latency… ▽ More

    Submitted 30 May, 2017; v1 submitted 3 March, 2017; originally announced March 2017.

    Comments: To appear in ACM PODC 2017

  32. The Storage vs Repair-Bandwidth Trade-off for Clustered Storage Systems

    Authors: N. Prakash, Vitaly Abdrashitov, Muriel Medard

    Abstract: We study a generalization of the setting of regenerating codes, motivated by applications to storage systems consisting of clusters of storage nodes. There are $n$ clusters in total, with $m$ nodes per cluster. A data file is coded and stored across the $mn$ nodes, with each node storing $α$ symbols. For availability of data, we require that the file be retrievable by downloading the entire conten… ▽ More

    Submitted 1 February, 2018; v1 submitted 17 January, 2017; originally announced January 2017.

    Comments: Accepted for publication in IEEE Transactions on Information Theory

    Journal ref: IEEE Transactions on Information Theory ( Volume: 64, Issue: 8, Aug. 2018 )

  33. arXiv:1606.04467  [pdf, other

    cs.IT

    Outer Bounds on the Storage-Repair Bandwidth Tradeoff of Exact-Repair Regenerating Codes

    Authors: Birenjith Sasidharan, N. Prakash, M. Nikhil Krishnan, Myna Vajha, Kaushik Senthoor, P. Vijay Kumar

    Abstract: In this paper, three outer bounds on the normalized storage-repair bandwidth (S-RB) tradeoff of regenerating codes having parameter set $\{(n,k,d),(α,β)\}$ under the exact-repair (ER) setting are presented. The first outer bound is applicable for every parameter set $(n,k,d)$ and in conjunction with a code construction known as {\em improved layered codes}, it characterizes the normalized ER trade… ▽ More

    Submitted 14 June, 2016; originally announced June 2016.

    Comments: Accepted for publication at International Journal of Information and Coding Theory (Special Issue on Information and Coding Theory for Data Storage)

  34. arXiv:1605.05717  [pdf, ps, other

    cs.DC cs.IT

    RADON: Repairable Atomic Data Object in Networks

    Authors: Kishori M. Konwar, N. Prakash, Nancy Lynch, Muriel Medard

    Abstract: Erasure codes offer an efficient way to decrease storage and communication costs while implementing atomic memory service in asynchronous distributed storage systems. In this paper, we provide erasure-code-based algorithms having the additional ability to perform background repair of crashed nodes. A repair operation of a node in the crashed state is triggered externally, and is carried out by the… ▽ More

    Submitted 21 November, 2016; v1 submitted 18 May, 2016; originally announced May 2016.

    Comments: To be presented at OPODIS 2016

  35. Storage-Optimized Data-Atomic Algorithms for Handling Erasures and Errors in Distributed Storage Systems

    Authors: Kishori M. Konwar, N. Prakash, Erez Kantor, Nancy Lynch, Muriel Medard, Alexander A. Schwarzmann

    Abstract: Erasure codes are increasingly being studied in the context of implementing atomic memory objects in large scale asynchronous distributed storage systems. When compared with the traditional replication based schemes, erasure codes have the potential of significantly lowering storage and communication costs while simultaneously guaranteeing the desired resiliency levels. In this work, we propose th… ▽ More

    Submitted 5 May, 2016; originally announced May 2016.

    Comments: Accepted for Publication at IEEE IPDPS, 2016

  36. arXiv:1605.01105  [pdf, other

    cs.IT

    Communication Cost for Updating Linear Functions when Message Updates are Sparse: Connections to Maximally Recoverable Codes

    Authors: N. Prakash, Muriel Medard

    Abstract: We consider a communication problem in which an update of the source message needs to be conveyed to one or more distant receivers that are interested in maintaining specific linear functions of the source message. The setting is one in which the updates are sparse in nature, and where neither the source nor the receiver(s) is aware of the exact {\em difference vector}, but only know the amount of… ▽ More

    Submitted 5 August, 2018; v1 submitted 3 May, 2016; originally announced May 2016.

    Comments: To Appear in IEEE Transactions on Information Theory

  37. arXiv:1501.03983  [pdf, other

    cs.IT

    The Storage-Repair-Bandwidth Trade-off of Exact Repair Linear Regenerating Codes for the Case $d = k = n-1$

    Authors: N. Prakash, M. Nikhil Krishnan

    Abstract: In this paper, we consider the setting of exact repair linear regenerating codes. Under this setting, we derive a new outer bound on the storage-repair-bandwidth trade-off for the case when $d = k = n -1$, where $(n, k, d)$ are parameters of the regenerating code, with their usual meaning. Taken together with the achievability result of Tian et. al. [1], we show that the new outer bound derived he… ▽ More

    Submitted 26 January, 2015; v1 submitted 16 January, 2015; originally announced January 2015.

    Comments: Corrected typos, minor editing for better readability

  38. arXiv:1406.6783  [pdf, other

    cs.IT

    Evaluation of Codes with Inherent Double Replication for Hadoop

    Authors: M. Nikhil Krishnan, N. Prakash, V. Lalitha, Birenjith Sasidharan, P. Vijay Kumar, Srinivasan Narayanamurthy, Ranjit Kumar, Siddhartha Nandi

    Abstract: In this paper, we evaluate the efficacy, in a Hadoop setting, of two coding schemes, both possessing an inherent double replication of data. The two coding schemes belong to the class of regenerating and locally regenerating codes respectively, and these two classes are representative of recent advances made in designing codes for the efficient storage of data in a distributed setting. In comparis… ▽ More

    Submitted 26 June, 2014; originally announced June 2014.

    Comments: in Proceedings of Usenix HotStorage, Philadelphia, PA, June 2014

  39. arXiv:1401.2422  [pdf, other

    cs.IT

    Codes with Locality for Two Erasures

    Authors: N. Prakash, V. Lalitha, P. Vijay Kumar

    Abstract: In this paper, we study codes with locality that can recover from two erasures via a sequence of two local, parity-check computations. By a local parity-check computation, we mean recovery via a single parity-check equation associated to small Hamming weight. Earlier approaches considered recovery in parallel; the sequential approach allows us to potentially construct codes with improved minimum d… ▽ More

    Submitted 27 January, 2014; v1 submitted 10 January, 2014; originally announced January 2014.

    Comments: 14 pages, 3 figures, Updated for improved readability

  40. Linear Coding Schemes for the Distributed Computation of Subspaces

    Authors: V. Lalitha, N. Prakash, K. Vinodh, P. Vijay Kumar, S. Sandeep Pradhan

    Abstract: Let $X_1, ..., X_m$ be a set of $m$ statistically dependent sources over the common alphabet $\mathbb{F}_q$, that are linearly independent when considered as functions over the sample space. We consider a distributed function computation setting in which the receiver is interested in the lossless computation of the elements of an $s$-dimensional subspace $W$ spanned by the elements of the row vect… ▽ More

    Submitted 20 February, 2013; originally announced February 2013.

    Comments: To appear in IEEE Journal of Selected Areas in Communications (In-Network Computation: Exploring the Fundamental Limits), April 2013

  41. arXiv:1302.0744  [pdf, other

    cs.IT

    Explicit MBR All-Symbol Locality Codes

    Authors: Govinda M. Kamath, Natalia Silberstein, N. Prakash, Ankit S. Rawat, V. Lalitha, O. Ozan Koyluoglu, P. Vijay Kumar, Sriram Vishwanath

    Abstract: Node failures are inevitable in distributed storage systems (DSS). To enable efficient repair when faced with such failures, two main techniques are known: Regenerating codes, i.e., codes that minimize the total repair bandwidth; and codes with locality, which minimize the number of nodes participating in the repair process. This paper focuses on regenerating codes with locality, using pre-coding… ▽ More

    Submitted 27 May, 2013; v1 submitted 4 February, 2013; originally announced February 2013.

  42. arXiv:1211.1932  [pdf, other

    cs.IT

    Codes with Local Regeneration

    Authors: Govinda M. Kamath, N. Prakash, V. Lalitha, P. Vijay Kumar

    Abstract: Regenerating codes and codes with locality are two schemes that have recently been proposed to ensure data collection and reliability in a distributed storage network. In a situation where one is attempting to repair a failed node, regenerating codes seek to minimize the amount of data downloaded for node repair, while codes with locality attempt to minimize the number of helper nodes accessed. In… ▽ More

    Submitted 4 February, 2013; v1 submitted 8 November, 2012; originally announced November 2012.

    Comments: 44 pages, 7 figures. A class of codes termed as Uniform Rank Accumulation (URA) codes is introduced and a minimum distance bound is derived when the local codes are URA codes. Also, the results of our earlier arXiv submssion(arXiv:1202:2414[cs.IT]) are included in Section 3 of this version

  43. arXiv:1202.2414  [pdf, ps, other

    cs.IT

    Optimal Linear Codes with a Local-Error-Correction Property

    Authors: N. Prakash, Govinda M. Kamath, V. Lalitha, P. Vijay Kumar

    Abstract: Motivated by applications to distributed storage, Gopalan \textit{et al} recently introduced the interesting notion of information-symbol locality in a linear code. By this it is meant that each message symbol appears in a parity-check equation associated with small Hamming weight, thereby enabling recovery of the message symbol by examining a small number of other code symbols. This notion is exp… ▽ More

    Submitted 11 February, 2012; originally announced February 2012.

    Comments: 13 pages, Shorter version submitted to ISIT 2012