Skip to main content

Showing 1–50 of 53 results for author: Ramakrishnan, R

Searching in archive cs. Search in all archives.
.
  1. AutoComp: Automated Data Compaction for Log-Structured Tables in Data Lakes

    Authors: Anja Gruenheid, Jesús Camacho-Rodríguez, Carlo Curino, Raghu Ramakrishnan, Stanislav Pak, Sumedh Sakdeo, Lenisha Gandhi, Sandeep K. Singhal, Pooja Nilangekar, Daniel J. Abadi

    Abstract: The proliferation of small files in data lakes poses significant challenges, including degraded query performance, increased storage costs, and scalability bottlenecks in distributed storage systems. Log-structured table formats (LSTs) such as Delta Lake, Apache Iceberg, and Apache Hudi exacerbate this issue due to their append-only write patterns and metadata-intensive operations. While compactio… ▽ More

    Submitted 5 April, 2025; originally announced April 2025.

    Journal ref: ACM SIGMOD 2025

  2. arXiv:2504.01793  [pdf, ps, other

    cs.IT

    Optimal shift-invariant spaces from uniform measurements

    Authors: Rohan Joy, Radha Ramakrishnan

    Abstract: Let $m$ be a positive integer and $\mathcal{C}$ be a collection of closed subspaces in $L^2(\mathbb{R})$. Given the measurements $\mathcal{F}_Y=\left\lbrace \left\lbrace y_k^1 \right\rbrace_{k\in \mathbb{Z}},\ldots, \left\lbrace y_k^m \right\rbrace_{k\in \mathbb{Z}} \right\rbrace \subset \ell^2(\mathbb{Z})$ of unknown functions… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  3. arXiv:2503.14712  [pdf, other

    quant-ph cs.NI

    Distribution and Purification of Entanglement States in Quantum Networks

    Authors: Xiaojie Fan, Yukun Yang, Himanshu Gupta, C. R. Ramakrishnan

    Abstract: We consider problems of distributing high-fidelity entangled states across nodes of a quantum network. We consider a repeater-based network architecture with entanglement swapping (fusion) operations for generating long-distance entanglements, and purification operations that produce high-fidelity states from several lower-fidelity states. The contributions of this paper are two-fold: First, while… ▽ More

    Submitted 23 March, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

    Comments: 9 pages, 8 figures

  4. arXiv:2503.10904  [pdf, other

    cs.RO

    Transferring Kinesthetic Demonstrations across Diverse Objects for Manipulation Planning

    Authors: Dibyendu Das, Aditya Patankar, Nilanjan Chakraborty, C. R. Ramakrishnan, I. V. Ramakrishnan

    Abstract: Given a demonstration of a complex manipulation task such as pouring liquid from one container to another, we seek to generate a motion plan for a new task instance involving objects with different geometries. This is non-trivial since we need to simultaneously ensure that the implicit motion constraints are satisfied (glass held upright while moving), the motion is collision-free, and that the ta… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  5. arXiv:2502.03429  [pdf, other

    cs.CL cs.AI

    On Fairness of Unified Multimodal Large Language Model for Image Generation

    Authors: Ming Liu, Hao Chen, Jindong Wang, Liwen Wang, Bhiksha Raj Ramakrishnan, Wensheng Zhang

    Abstract: Unified multimodal large language models (U-MLLMs) have demonstrated impressive performance in visual understanding and generation in an end-to-end pipeline. Compared with generation-only models (e.g., Stable Diffusion), U-MLLMs may raise new questions about bias in their outputs, which can be affected by their unified capabilities. This gap is particularly concerning given the under-explored risk… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  6. arXiv:2411.04036  [pdf, other

    cs.LG

    Stepping Forward on the Last Mile

    Authors: Chen Feng, Shaojie Zhuo, Xiaopeng Zhang, Ramchalam Kinattinkara Ramakrishnan, Zhaocong Yuan, Andrew Zou Li

    Abstract: Continuously adapting pre-trained models to local data on resource constrained edge devices is the $\emph{last mile}$ for model deployment. However, as models increase in size and depth, backpropagation requires a large amount of memory, which becomes prohibitive for edge devices. In addition, most existing low power neural processing engines (e.g., NPUs, DSPs, MCUs, etc.) are designed as fixed-po… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

  7. arXiv:2410.18275  [pdf, other

    cs.RO cs.AI

    Screw Geometry Meets Bandits: Incremental Acquisition of Demonstrations to Generate Manipulation Plans

    Authors: Dibyendu Das, Aditya Patankar, Nilanjan Chakraborty, C. R. Ramakrishnan, I. V. Ramakrishnan

    Abstract: In this paper, we study the problem of methodically obtaining a sufficient set of kinesthetic demonstrations, one at a time, such that a robot can be confident of its ability to perform a complex manipulation task in a given region of its workspace. Although Learning from Demonstrations has been an active area of research, the problems of checking whether a set of demonstrations is sufficient, and… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

    Comments: 8 pages, 6 figures, under review in IEEE Robotics and Automation Letters

  8. arXiv:2410.18221  [pdf, other

    cs.AI

    Data Augmentation for Automated Adaptive Rodent Training

    Authors: Dibyendu Das, Alfredo Fontanini, Joshua F. Kogan, Haibin Ling, C. R. Ramakrishnan, I. V. Ramakrishnan

    Abstract: Fully optimized automation of behavioral training protocols for lab animals like rodents has long been a coveted goal for researchers. It is an otherwise labor-intensive and time-consuming process that demands close interaction between the animal and the researcher. In this work, we used a data-driven approach to optimize the way rodents are trained in labs. In pursuit of our goal, we looked at da… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

    Comments: 5 pages, 3 figures

  9. arXiv:2410.14357  [pdf, other

    quant-ph cs.DC hep-ph physics.chem-ph

    Efficient charge-preserving excited state preparation with variational quantum algorithms

    Authors: Zohim Chandani, Kazuki Ikeda, Zhong-Bo Kang, Dmitri E. Kharzeev, Alexander McCaskey, Andrea Palermo, C. R. Ramakrishnan, Pooja Rao, Ranjani G. Sundaram, Kwangmin Yu

    Abstract: Determining the spectrum and wave functions of excited states of a system is crucial in quantum physics and chemistry. Low-depth quantum algorithms, such as the Variational Quantum Eigensolver (VQE) and its variants, can be used to determine the ground-state energy. However, current approaches to computing excited states require numerous controlled unitaries, making the application of the original… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 20 pages, 6 figures, 1 table

  10. arXiv:2410.09135  [pdf, other

    cs.CV cs.LG eess.IV

    Enabling Advanced Land Cover Analytics: An Integrated Data Extraction Pipeline for Predictive Modeling with the Dynamic World Dataset

    Authors: Victor Radermecker, Andrea Zanon, Nancy Thomas, Annita Vapsi, Saba Rahimi, Rama Ramakrishnan, Daniel Borrajo

    Abstract: Understanding land cover holds considerable potential for a myriad of practical applications, particularly as data accessibility transitions from being exclusive to governmental and commercial entities to now including the broader research community. Nevertheless, although the data is accessible to any community member interested in exploration, there exists a formidable learning curve and no stan… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  11. arXiv:2405.07499  [pdf, other

    quant-ph cs.ET

    Distributed Quantum Computation with Minimum Circuit Execution Time over Quantum Networks

    Authors: Ranjani G Sundaram, Himanshu Gupta, C. R. Ramakrishnan

    Abstract: Present quantum computers are constrained by limited qubit capacity and restricted physical connectivity, leading to challenges in large-scale quantum computations. Distributing quantum computations across a network of quantum computers is a promising way to circumvent these challenges and facilitate large quantum computations. However, distributed quantum computations require entanglements (to ex… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  12. Towards Building Autonomous Data Services on Azure

    Authors: Yiwen Zhu, Yuanyuan Tian, Joyce Cahoon, Subru Krishnan, Ankita Agarwal, Rana Alotaibi, Jesús Camacho-Rodríguez, Bibin Chundatt, Andrew Chung, Niharika Dutta, Andrew Fogarty, Anja Gruenheid, Brandon Haynes, Matteo Interlandi, Minu Iyer, Nick Jurgens, Sumeet Khushalani, Brian Kroth, Manoj Kumar, Jyoti Leeka, Sergiy Matusevych, Minni Mittal, Andreas Mueller, Kartheek Muthyala, Harsha Nagulapalli , et al. (13 additional authors not shown)

    Abstract: Modern cloud has turned data services into easily accessible commodities. With just a few clicks, users are now able to access a catalog of data processing systems for a wide range of tasks. However, the cloud brings in both complexity and opportunity. While cloud users can quickly start an application by using various data services, it can be difficult to configure and optimize these services to… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: SIGMOD Companion of the 2023 International Conference on Management of Data. 2023

  13. arXiv:2405.00222  [pdf, other

    quant-ph cs.NI

    Optimized Distribution of Entanglement Graph States in Quantum Networks

    Authors: Xiaojie Fan, Caitao Zhan, Himanshu Gupta, C. R. Ramakrishnan

    Abstract: Building large-scale quantum computers, essential to demonstrating quantum advantage, is a key challenge. Quantum Networks (QNs) can help address this challenge by enabling the construction of large, robust, and more capable quantum computing platforms by connecting smaller quantum computers. Moreover, unlike classical systems, QNs can enable fully secured long-distance communication. Thus, quantu… ▽ More

    Submitted 18 March, 2025; v1 submitted 30 April, 2024; originally announced May 2024.

    Comments: 16 pages, 20 figures

  14. arXiv:2401.11162  [pdf

    cs.DB

    Extending Polaris to Support Transactions

    Authors: Josep Aguilar-Saborit, Raghu Ramakrishnan, Kevin Bocksrocker, Alan Halverson, Konstantin Kosinsky, Ryan O'Connor, Nadejda Poliakova, Moe Shafiei, Taewoo Kim, Phil Kon-Kim, Haris Mahmud-Ansari, Blazej Matuszyk, Matt Miles, Sumin Mohanan, Cristian Petculescu, Ishan Rahesh-Madan, Emma Rose-Wirshing, Elias Yousefi

    Abstract: In Polaris, we introduced a cloud-native distributed query processor to perform analytics at scale. In this paper, we extend the underlying Polaris distributed computation framework, which can be thought of as a read-only transaction engine, to execute general transactions (including updates, deletes, inserts and bulk loads, in addition to queries) for Tier 1 warehousing workloads in a highly perf… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: 12 pages, 12 Figures

  15. arXiv:2401.09621  [pdf, other

    cs.DB

    XTable in Action: Seamless Interoperability in Data Lakes

    Authors: Ashvin Agrawal, Tim Brown, Anoop Johnson, Jesús Camacho-Rodríguez, Kyle Weller, Carlo Curino, Raghu Ramakrishnan

    Abstract: Contemporary approaches to data management are increasingly relying on unified analytics and AI platforms to foster collaboration, interoperability, seamless access to reliable data, and high performance. Data Lakes featuring open standard table formats such as Delta Lake, Apache Hudi, and Apache Iceberg are central components of these data architectures. Choosing the right format for managing a t… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  16. Application of AI in Nutrition

    Authors: Ritu Ramakrishnan, Tianxiang Xing, Tianfeng Chen, Ming-Hao Lee, Jinzhu Gao

    Abstract: In healthcare, artificial intelligence (AI) has been changing the way doctors and health experts take care of people. This paper will cover how AI is making major changes in the health care system, especially with nutrition. Various machine learning and deep learning algorithms have been developed to extract valuable information from healthcare data which help doctors, nutritionists, and health ex… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Journal ref: Journal of Advances in Information Science and Technology, Volume 1, Issue 1, 2023, Pages 7-12

  17. arXiv:2312.08598  [pdf, other

    cs.LG

    MotherNet: Fast Training and Inference via Hyper-Network Transformers

    Authors: Andreas Müller, Carlo Curino, Raghu Ramakrishnan

    Abstract: Foundation models are transforming machine learning across many modalities, with in-context learning replacing classical model training. Recent work on tabular data hints at a similar opportunity to build foundation models for classification for numerical data. However, existing meta-learning approaches can not compete with tree-based methods in terms of inference time. In this paper, we propose M… ▽ More

    Submitted 9 May, 2025; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 17 pages, 13 figures

    ACM Class: I.2.6

  18. arXiv:2311.09593  [pdf, other

    cs.CL cs.AI

    Multi-Step Dialogue Workflow Action Prediction

    Authors: Ramya Ramakrishnan, Ethan R. Elenberg, Hashan Narangodage, Ryan McDonald

    Abstract: In task-oriented dialogue, a system often needs to follow a sequence of actions, called a workflow, that complies with a set of guidelines in order to complete a task. In this paper, we propose the novel problem of multi-step workflow action prediction, in which the system predicts multiple future workflow actions. Accurate prediction of multiple steps allows for multi-turn automation, which can f… ▽ More

    Submitted 12 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  19. arXiv:2311.08300  [pdf, other

    cs.CL cs.AI

    Workflow-Guided Response Generation for Task-Oriented Dialogue

    Authors: Do June Min, Paloma Sodhi, Ramya Ramakrishnan

    Abstract: Task-oriented dialogue (TOD) systems aim to achieve specific goals through interactive dialogue. Such tasks usually involve following specific workflows, i.e. executing a sequence of actions in a particular order. While prior work has focused on supervised learning methods to condition on past actions, they do not explicitly optimize for compliance to a desired workflow. In this paper, we propose… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  20. arXiv:2306.15758  [pdf, ps, other

    cs.IT

    On the reconstruction of bandlimited signals from random samples quantized via noise-shaping

    Authors: Rohan Joy, Felix Krahmer, Alessandro Lupoli, Radha Ramakrishnan

    Abstract: Noise-shaping quantization techniques are widely used for converting bandlimited signals from the analog to the digital domain. They work by "shaping" the quantization noise so that it falls close to the reconstruction operator's null space. We investigate the compatibility of two such schemes, specifically $ΣΔ$ quantization and distributed noise-shaping quantization, with random samples of bandli… ▽ More

    Submitted 1 July, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: 31 pages, 3 figures

    MSC Class: 94A20; 94A12; 42C15; 41A29

  21. LST-Bench: Benchmarking Log-Structured Tables in the Cloud

    Authors: Jesús Camacho-Rodríguez, Ashvin Agrawal, Anja Gruenheid, Ashit Gosalia, Cristian Petculescu, Josep Aguilar-Saborit, Avrilia Floratou, Carlo Curino, Raghu Ramakrishnan

    Abstract: Data processing engines increasingly leverage distributed file systems for scalable, cost-effective storage. While the Apache Parquet columnar format has become a popular choice for data storage and retrieval, the immutability of Parquet files renders it impractical to meet the demands of frequent updates in contemporary analytical workloads. Log-Structured Tables (LSTs), such as Delta Lake, Apach… ▽ More

    Submitted 19 January, 2024; v1 submitted 1 May, 2023; originally announced May 2023.

    Journal ref: Proceedings of the ACM on Management of Data (2024) Volume 2 Issue 1

  22. arXiv:2210.14047  [pdf, other

    cs.DB

    OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance from Database Logs [Technical Report]

    Authors: Fotis Psallidas, Ashvin Agrawal, Chandru Sugunan, Khaled Ibrahim, Konstantinos Karanasos, Jesús Camacho-Rodríguez, Avrilia Floratou, Carlo Curino, Raghu Ramakrishnan

    Abstract: Provenance encodes information that connects datasets, their generation workflows, and associated metadata (e.g., who or when executed a query). As such, it is instrumental for a wide range of critical governance applications (e.g., observability and auditing). Unfortunately, in the context of database systems, extracting coarse-grained provenance is a long-standing problem due to the complexity a… ▽ More

    Submitted 3 March, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    ACM Class: H.2

  23. arXiv:2206.06437  [pdf, other

    cs.ET quant-ph

    Distribution of Quantum Circuits Over General Quantum Networks

    Authors: Ranjani G Sundaram, Himanshu Gupta, C. R. Ramakrishnan

    Abstract: Near-term quantum computers can hold only a small number of qubits. One way to facilitate large-scale quantum computations is through a distributed network of quantum computers. In this work, we consider the problem of distributing quantum programs represented as quantum circuits across a quantum network of heterogeneous quantum computers, in a way that minimizes the overall communication cost req… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  24. arXiv:2205.07352  [pdf, other

    cs.CL cs.AI

    Long-term Control for Dialogue Generation: Methods and Evaluation

    Authors: Ramya Ramakrishnan, Hashan Buddhika Narangodage, Mauro Schilman, Kilian Q. Weinberger, Ryan McDonald

    Abstract: Current approaches for controlling dialogue response generation are primarily focused on high-level attributes like style, sentiment, or topic. In this work, we focus on constrained long-term dialogue generation, which involves more fine-grained control and requires a given set of control words to appear in generated responses. This setting requires a model to not only consider the generation of t… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

  25. Pre-Distribution of Entanglements in Quantum Networks

    Authors: Mohammad Ghaderibaneh, Himanshu Gupta, C. R. Ramakrishnan, Ertai Luo

    Abstract: Quantum network communication is challenging, as the No-Cloning theorem in quantum regime makes many classical techniques inapplicable. For long-distance communication, the only viable approach is teleportation of quantum states, which requires a prior distribution of entangled pairs (EPs) of qubits. Establishment of EPs across remote nodes can incur significant latency due to the low probability… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: 11 pages, 9 figures

  26. arXiv:2203.05492  [pdf, other

    cs.LG

    An Empirical Study of Low Precision Quantization for TinyML

    Authors: Shaojie Zhuo, Hongyu Chen, Ramchalam Kinattinkara Ramakrishnan, Tommy Chen, Chen Feng, Yicheng Lin, Parker Zhang, Liang Shen

    Abstract: Tiny machine learning (tinyML) has emerged during the past few years aiming to deploy machine learning models to embedded AI processors with highly constrained memory and computation capacity. Low precision quantization is an important model compression technique that can greatly reduce both memory consumption and computation cost of model inference. In this study, we focus on post-training quanti… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: tinyML Research Symposium 2022

  27. arXiv:2202.03487  [pdf

    cs.LG

    Targeted-BEHRT: Deep learning for observational causal inference on longitudinal electronic health records

    Authors: Shishir Rao, Mohammad Mamouei, Gholamreza Salimi-Khorshidi, Yikuan Li, Rema Ramakrishnan, Abdelaali Hassaine, Dexter Canoy, Kazem Rahimi

    Abstract: Observational causal inference is useful for decision making in medicine when randomized clinical trials (RCT) are infeasible or non generalizable. However, traditional approaches fail to deliver unconfounded causal conclusions in practice. The rise of "doubly robust" non-parametric tools coupled with the growth of deep learning for capturing rich representations of multimodal data, offers a uniqu… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: This work has been submitted to the IEEE for possible publication

  28. Efficient Quantum Network Communication using Optimized Entanglement-Swapping Trees

    Authors: Mohammad Ghaderibaneh, Caitao Zhan, Himanshu Gupta, C. R. Ramakrishnan

    Abstract: Quantum network communication is challenging, as the No-cloning theorem in quantum regime makes many classical techniques inapplicable. For long-distance communication, the only viable communication approach is teleportation of quantum states, which requires a prior distribution of entangled pairs (EPs) of qubits. Establishment of EPs across remote nodes can incur significant latency due to the lo… ▽ More

    Submitted 4 April, 2024; v1 submitted 21 December, 2021; originally announced December 2021.

  29. arXiv:2103.15171  [pdf, other

    cs.AI

    A Bayesian Approach to Identifying Representational Errors

    Authors: Ramya Ramakrishnan, Vaibhav Unhelkar, Ece Kamar, Julie Shah

    Abstract: Trained AI systems and expert decision makers can make errors that are often difficult to identify and understand. Determining the root cause for these errors can improve future decisions. This work presents Generative Error Model (GEM), a generative model for inferring representational errors based on observations of an actor's behavior (either simulated agent, robot, or human). The model conside… ▽ More

    Submitted 28 March, 2021; originally announced March 2021.

  30. arXiv:2101.11359  [pdf

    cs.LG

    An explainable Transformer-based deep learning model for the prediction of incident heart failure

    Authors: Shishir Rao, Yikuan Li, Rema Ramakrishnan, Abdelaali Hassaine, Dexter Canoy, John Cleland, Thomas Lukasiewicz, Gholamreza Salimi-Khorshidi, Kazem Rahimi

    Abstract: Predicting the incidence of complex chronic conditions such as heart failure is challenging. Deep learning models applied to rich electronic health records may improve prediction but remain unexplainable hampering their wider use in medical practice. We developed a novel Transformer deep-learning model for more accurate and yet explainable prediction of incident heart failure involving 100,071 pat… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

  31. arXiv:2006.05442  [pdf, other

    cs.LG stat.ML

    Tensor train decompositions on recurrent networks

    Authors: Alejandro Murua, Ramchalam Ramakrishnan, Xinlin Li, Rui Heng Yang, Vahid Partovi Nia

    Abstract: Recurrent neural networks (RNN) such as long-short-term memory (LSTM) networks are essential in a multitude of daily live tasks such as speech, language, video, and multimodal learning. The shift from cloud to edge computation intensifies the need to contain the growth of RNN parameters. Current research on RNN shows that despite the performance obtained on convolutional neural networks (CNN), kee… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

  32. arXiv:2003.10170  [pdf, other

    cs.LG stat.ML

    Deep Bayesian Gaussian Processes for Uncertainty Estimation in Electronic Health Records

    Authors: Yikuan Li, Shishir Rao, Abdelaali Hassaine, Rema Ramakrishnan, Yajie Zhu, Dexter Canoy, Gholamreza Salimi-Khorshidi, Thomas Lukasiewicz, Kazem Rahimi

    Abstract: One major impediment to the wider use of deep learning for clinical decision making is the difficulty of assigning a level of confidence to model predictions. Currently, deep Bayesian neural networks and sparse Gaussian processes are the main two scalable uncertainty estimation methods. However, deep Bayesian neural network suffers from lack of expressiveness, and more expressive models such as de… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

    Comments: 21 pages

  33. arXiv:1911.00231  [pdf, other

    cs.DB cs.LG

    Extending Relational Query Processing with ML Inference

    Authors: Konstantinos Karanasos, Matteo Interlandi, Doris Xin, Fotis Psallidas, Rathijit Sen, Kwanghyun Park, Ivan Popivanov, Supun Nakandal, Subru Krishnan, Markus Weimer, Yuan Yu, Raghu Ramakrishnan, Carlo Curino

    Abstract: The broadening adoption of machine learning in the enterprise is increasing the pressure for strict governance and cost-effective performance, in particular for the common and consequential steps of model storage and inference. The RDBMS provides a natural starting point, given its mature infrastructure for fast data access and processing, along with support for enterprise features (e.g., encrypti… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

  34. Value of Information in Probabilistic Logic Programs

    Authors: Sarthak Ghosh, C. R. Ramakrishnan

    Abstract: In medical decision making, we have to choose among several expensive diagnostic tests such that the certainty about a patient's health is maximized while remaining within the bounds of resources like time and money. The expected increase in certainty in the patient's condition due to performing a test is called the value of information (VoI) for that test. In general, VoI relates to acquiring add… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

    Comments: In Proceedings ICLP 2019, arXiv:1909.07646

    ACM Class: I.2.4

    Journal ref: EPTCS 306, 2019, pp. 71-84

  35. arXiv:1909.04567  [pdf, other

    cs.LG stat.ML

    Differentiable Mask for Pruning Convolutional and Recurrent Networks

    Authors: Ramchalam Kinattinkara Ramakrishnan, Eyyüb Sari, Vahid Partovi Nia

    Abstract: Pruning is one of the most effective model reduction techniques. Deep networks require massive computation and such models need to be compressed to bring them on edge devices. Most existing pruning techniques are focused on vision-based models like convolutional networks, while text-based models are still evolving. The emergence of multi-modal multi-task learning calls for a general method that wo… ▽ More

    Submitted 29 April, 2020; v1 submitted 10 September, 2019; originally announced September 2019.

  36. arXiv:1909.00084  [pdf, other

    cs.DB cs.DC cs.LG

    Cloudy with high chance of DBMS: A 10-year prediction for Enterprise-Grade ML

    Authors: Ashvin Agrawal, Rony Chatterjee, Carlo Curino, Avrilia Floratou, Neha Gowdal, Matteo Interlandi, Alekh Jindal, Kostantinos Karanasos, Subru Krishnan, Brian Kroth, Jyoti Leeka, Kwanghyun Park, Hiren Patel, Olga Poppe, Fotis Psallidas, Raghu Ramakrishnan, Abhishek Roy, Karla Saur, Rathijit Sen, Markus Weimer, Travis Wright, Yiwen Zhu

    Abstract: Machine learning (ML) has proven itself in high-value web applications such as search ranking and is emerging as a powerful tool in a much broader range of enterprise scenarios including voice recognition and conversational understanding for customer support, autotuning for videoconferencing, intelligent feedback loops in large-scale sysops, manufacturing and autonomous vehicle management, complex… ▽ More

    Submitted 27 December, 2019; v1 submitted 30 August, 2019; originally announced September 2019.

  37. arXiv:1904.00775  [pdf, other

    cs.CV cs.LG stat.ML

    Deep Demosaicing for Edge Implementation

    Authors: Ramchalam Kinattinkara Ramakrishnan, Shangling Jui, Vahid Patrovi Nia

    Abstract: Most digital cameras use sensors coated with a Color Filter Array (CFA) to capture channel components at every pixel location, resulting in a mosaic image that does not contain pixel values in all channels. Current research on reconstructing these missing channels, also known as demosaicing, introduces many artifacts, such as zipper effect and false color. Many deep learning demosaicing techniques… ▽ More

    Submitted 23 May, 2019; v1 submitted 26 March, 2019; originally announced April 2019.

    Comments: Accepted in the 16th International Conference of Image Analysis and Recognition (ICIAR 2019)

  38. arXiv:1806.07834  [pdf, other

    cs.RO

    A Look at Motion Planning for Autonomous Vehicles at an Intersection

    Authors: Shravan Krishnan, Govind Aadithya R, Rahul Ramakrishnan, Vijay Arvindh, Sivanathan K

    Abstract: Autonomous Vehicles are currently being tested in a variety of scenarios. As we move towards Autonomous Vehicles, how should intersections look? To answer that question, we break down an intersection management into the different conundrums and scenarios involved in the trajectory planning and current approaches to solve them. Then, a brief analysis of current works in autonomous intersection is c… ▽ More

    Submitted 7 September, 2018; v1 submitted 20 June, 2018; originally announced June 2018.

    Comments: Accepted for presentation at ITSC 2018, Final Version

  39. arXiv:1805.08966  [pdf, other

    cs.LG cs.AI stat.ML

    Discovering Blind Spots in Reinforcement Learning

    Authors: Ramya Ramakrishnan, Ece Kamar, Debadeepta Dey, Julie Shah, Eric Horvitz

    Abstract: Agents trained in simulation may make errors in the real world due to mismatches between training and execution environments. These mistakes can be dangerous and difficult to discover because the agent cannot predict them a priori. We propose using oracle feedback to learn a predictive model of these blind spots to reduce costly errors in real-world applications. We focus on blind spots in reinfor… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

    Comments: To appear at AAMAS 2018

  40. arXiv:1804.10237  [pdf, ps, other

    cs.LO

    Constraint-Based Inference in Probabilistic Logic Programs

    Authors: Arun Nampally, Timothy Zhang, C. R. Ramakrishnan

    Abstract: Probabilistic Logic Programs (PLPs) generalize traditional logic programs and allow the encoding of models combining logical structure and uncertainty. In PLP, inference is performed by summarizing the possible worlds which entail the query in a suitable data structure, and using it to compute the answer probability. Systems such as ProbLog, PITA, etc., use propositional data structures like expla… ▽ More

    Submitted 26 April, 2018; originally announced April 2018.

    Comments: Paper presented at the 34nd International Conference on Logic Programming (ICLP 2018), Oxford, UK, July 14 to July 17, 2018 18 pages, LaTeX, 5 PDF figures (arXiv:YYMM.NNNNN)

  41. Convolutional Neural Networks for Passive Monitoring of a Shallow Water Environment using a Single Sensor

    Authors: Eric L. Ferguson, Rishi Ramakrishnan, Stefan B. Williams, Craig T. Jin

    Abstract: A cost effective approach to remote monitoring of protected areas such as marine reserves and restricted naval waters is to use passive sonar to detect, classify, localize, and track marine vessel activity (including small boats and autonomous underwater vehicles). Cepstral analysis of underwater acoustic data enables the time delay between the direct path arrival and the first multipath arrival t… ▽ More

    Submitted 11 December, 2016; originally announced December 2016.

    Comments: Final draft for IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017. 5 pages, 4 figures

  42. arXiv:1611.09007  [pdf, other

    cs.CV

    Hyperspectral CNN Classification with Limited Training Samples

    Authors: Lloyd Windrim, Rishi Ramakrishnan, Arman Melkumyan, Richard Murphy

    Abstract: Hyperspectral imaging sensors are becoming increasingly popular in robotics applications such as agriculture and mining, and allow per-pixel thematic classification of materials in a scene based on their unique spectral signatures. Recently, convolutional neural networks have shown remarkable performance for classification tasks, but require substantial amounts of labelled training data. This data… ▽ More

    Submitted 28 November, 2016; originally announced November 2016.

    Comments: 10 pages, 6 figures

  43. arXiv:1608.05763  [pdf, ps, other

    cs.AI cs.LO

    Inference in Probabilistic Logic Programs using Lifted Explanations

    Authors: Arun Nampally, C. R. Ramakrishnan

    Abstract: In this paper, we consider the problem of lifted inference in the context of Prism-like probabilistic logic programming languages. Traditional inference in such languages involves the construction of an explanation graph for the query and computing probabilities over this graph. When evaluating queries over probabilistic logic programs with a large number of instances of random variables, traditio… ▽ More

    Submitted 19 August, 2016; originally announced August 2016.

  44. arXiv:1604.06118  [pdf, other

    cs.LO

    XPL: An extended probabilistic logic for probabilistic transition systems

    Authors: Andrey Gorlin, C. R. Ramakrishnan

    Abstract: Generalized Probabilistic Logic (GPL) is a temporal logic, based on the modal mu-calculus, for specifying properties of reactive probabilistic systems. We explore XPL, an extension to GPL allowing the semantics of nondeterminism present in Markov decision processes (MDPs). XPL is expressive enough that a number of independently studied problems--- such as termination of Recursive MDPs (RMDPs), PCT… ▽ More

    Submitted 9 May, 2017; v1 submitted 20 April, 2016; originally announced April 2016.

    ACM Class: I.2.3

  45. arXiv:1509.08439  [pdf, other

    cs.CV

    Hyper-Fisher Vectors for Action Recognition

    Authors: Sanath Narayan, Kalpathi R. Ramakrishnan

    Abstract: In this paper, a novel encoding scheme combining Fisher vector and bag-of-words encodings has been proposed for recognizing action in videos. The proposed Hyper-Fisher vector encoding is sum of local Fisher vectors which are computed based on the traditional Bag-of-Words (BoW) encoding. Thus, the proposed encoding is simple and yet an effective representation over the traditional Fisher Vector enc… ▽ More

    Submitted 28 September, 2015; originally announced September 2015.

  46. arXiv:1501.03879  [pdf, other

    cs.CV

    A new ADMM algorithm for the Euclidean median and its application to robust patch regression

    Authors: Kunal N. Chaudhury, K. R. Ramakrishnan

    Abstract: The Euclidean Median (EM) of a set of points $Ω$ in an Euclidean space is the point x minimizing the (weighted) sum of the Euclidean distances of x to the points in $Ω$. While there exits no closed-form expression for the EM, it can nevertheless be computed using iterative methods such as the Wieszfeld algorithm. The EM has classically been used as a robust estimator of centrality for multivariate… ▽ More

    Submitted 16 January, 2015; originally announced January 2015.

    Comments: 5 pages, 3 figures, 1 table. To appear in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, April 19-24, 2015

  47. arXiv:1405.6341  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Efficient Model Learning for Human-Robot Collaborative Tasks

    Authors: Stefanos Nikolaidis, Keren Gu, Ramya Ramakrishnan, Julie Shah

    Abstract: We present a framework for learning human user models from joint-action demonstrations that enables the robot to compute a robust policy for a collaborative task with a human. The learning takes place completely automatically, without any human intervention. First, we describe the clustering of demonstrated action sequences into different human types using an unsupervised learning algorithm. These… ▽ More

    Submitted 24 May, 2014; originally announced May 2014.

    ACM Class: I.2.6; I.2.8; I.2.9

    Journal ref: Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction (HRI 2015)

  48. arXiv:1403.6036  [pdf, other

    cs.AI

    Adaptive MCMC-Based Inference in Probabilistic Logic Programs

    Authors: Arun Nampally, C. R. Ramakrishnan

    Abstract: Probabilistic Logic Programming (PLP) languages enable programmers to specify systems that combine logical models with statistical knowledge. The inference problem, to determine the probability of query answers in PLP, is intractable in general, thereby motivating the need for approximate techniques. In this paper, we present a technique for approximate inference of conditional probabilities for P… ▽ More

    Submitted 24 March, 2014; originally announced March 2014.

  49. arXiv:1303.3517  [pdf, other

    cs.DC cs.DB cs.LG

    Iterative MapReduce for Large Scale Machine Learning

    Authors: Joshua Rosen, Neoklis Polyzotis, Vinayak Borkar, Yingyi Bu, Michael J. Carey, Markus Weimer, Tyson Condie, Raghu Ramakrishnan

    Abstract: Large datasets ("Big Data") are becoming ubiquitous because the potential value in deriving insights from data, across a wide range of business and scientific applications, is increasingly recognized. In particular, machine learning - one of the foundational disciplines for data analysis, summarization and inference - on Big Data has become routine at most organizations that operate large clouds,… ▽ More

    Submitted 13 March, 2013; originally announced March 2013.

  50. arXiv:1204.4736  [pdf, other

    cs.LO

    Model Checking with Probabilistic Tabled Logic Programming

    Authors: Andrey Gorlin, C. R. Ramakrishnan, Scott A. Smolka

    Abstract: We present a formulation of the problem of probabilistic model checking as one of query evaluation over probabilistic logic programs. To the best of our knowledge, our formulation is the first of its kind, and it covers a rich class of probabilistic models and probabilistic temporal logics. The inference algorithms of existing probabilistic logic-programming systems are well defined only for queri… ▽ More

    Submitted 20 April, 2012; originally announced April 2012.