Skip to main content

Showing 1–24 of 24 results for author: Bhardwaj, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2508.17275  [pdf, ps, other

    cs.CV cs.AI

    Deep Learning-Assisted Detection of Sarcopenia in Cross-Sectional Computed Tomography Imaging

    Authors: Manish Bhardwaj, Huizhi Liang, Ashwin Sivaharan, Sandip Nandhra, Vaclav Snasel, Tamer El-Sayed, Varun Ojha

    Abstract: Sarcopenia is a progressive loss of muscle mass and function linked to poor surgical outcomes such as prolonged hospital stays, impaired mobility, and increased mortality. Although it can be assessed through cross-sectional imaging by measuring skeletal muscle area (SMA), the process is time-consuming and adds to clinical workloads, limiting timely detection and management; however, this process c… ▽ More

    Submitted 24 August, 2025; originally announced August 2025.

    Journal ref: The 9th Euro-China Conference on Intelligent The 9th Euro-China Conference on Intelligent Data Analysis and Applications 2025

  2. arXiv:2505.02215  [pdf, other

    cs.AI cs.CL

    Interpretable Emergent Language Using Inter-Agent Transformers

    Authors: Mannan Bhardwaj

    Abstract: This paper explores the emergence of language in multi-agent reinforcement learning (MARL) using transformers. Existing methods such as RIAL, DIAL, and CommNet enable agent communication but lack interpretability. We propose Differentiable Inter-Agent Transformers (DIAT), which leverage self-attention to learn symbolic, human-understandable communication protocols. Through experiments, DIAT demons… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  3. arXiv:2504.18511  [pdf, other

    cs.SE

    Co-Change Graph Entropy: A New Process Metric for Defect Prediction

    Authors: Ethari Hrishikesh, Amit Kumar, Meher Bhardwaj, Sonali Agarwal

    Abstract: Process metrics, valued for their language independence and ease of collection, have been shown to outperform product metrics in defect prediction. Among these, change entropy (Hassan, 2009) is widely used at the file level and has proven highly effective. Additionally, past research suggests that co-change patterns provide valuable insights into software quality. Building on these findings, we in… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

  4. arXiv:2412.00086  [pdf, other

    cs.RO cs.LG

    Dynamic Non-Prehensile Object Transport via Model-Predictive Reinforcement Learning

    Authors: Neel Jawale, Byron Boots, Balakumar Sundaralingam, Mohak Bhardwaj

    Abstract: We investigate the problem of teaching a robot manipulator to perform dynamic non-prehensile object transport, also known as the `robot waiter' task, from a limited set of real-world demonstrations. We propose an approach that combines batch reinforcement learning (RL) with model-predictive control (MPC) by pretraining an ensemble of value functions from demonstration data, and utilizing them onli… ▽ More

    Submitted 26 November, 2024; originally announced December 2024.

    Comments: 11 pages

  5. arXiv:2411.18923  [pdf, other

    cs.CL cs.AI

    EzSQL: An SQL intermediate representation for improving SQL-to-text Generation

    Authors: Meher Bhardwaj, Hrishikesh Ethari, Dennis Singh Moirangthem

    Abstract: The SQL-to-text generation task traditionally uses template base, Seq2Seq, tree-to-sequence, and graph-to-sequence models. Recent models take advantage of pre-trained generative language models for this task in the Seq2Seq framework. However, treating SQL as a sequence of inputs to the pre-trained models is not optimal. In this work, we put forward a new SQL intermediate representation called EzSQ… ▽ More

    Submitted 9 April, 2025; v1 submitted 28 November, 2024; originally announced November 2024.

    Comments: Under revision and review at Expert System With Applications Journal after first review

  6. arXiv:2402.06102  [pdf, other

    cs.RO cs.LG

    Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning

    Authors: Mohak Bhardwaj, Thomas Lampe, Michael Neunert, Francesco Romano, Abbas Abdolmaleki, Arunkumar Byravan, Markus Wulfmeier, Martin Riedmiller, Jonas Buchli

    Abstract: Recent advances in real-world applications of reinforcement learning (RL) have relied on the ability to accurately simulate systems at scale. However, domains such as fluid dynamical systems exhibit complex dynamic phenomena that are hard to simulate at high integration rates, limiting the direct application of modern deep RL algorithms to often expensive or safety critical hardware. In this work,… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  7. arXiv:2304.09761  [pdf, other

    cs.LG cs.AI q-fin.ST

    An innovative Deep Learning Based Approach for Accurate Agricultural Crop Price Prediction

    Authors: Mayank Ratan Bhardwaj, Jaydeep Pawar, Abhijnya Bhat, Deepanshu, Inavamsi Enaganti, Kartik Sagar, Y. Narahari

    Abstract: Accurate prediction of agricultural crop prices is a crucial input for decision-making by various stakeholders in agriculture: farmers, consumers, retailers, wholesalers, and the Government. These decisions have significant implications including, most importantly, the economic well-being of the farmers. In this paper, our objective is to accurately predict crop prices using historical price infor… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Comments: 9 pages, 3 figures, 3 tables

  8. arXiv:2304.07341  [pdf, other

    cs.GT

    Designing Fair, Cost-optimal Auctions based on Deep Learning for Procuring Agricultural Inputs through Farmer Collectives

    Authors: Mayank Ratan Bhardwaj, Bazil Ahmed, Prathik Diwakar, Ganesh Ghalme, Y. Narahari

    Abstract: Procuring agricultural inputs (agri-inputs for short) such as seeds, fertilizers, and pesticides, at desired quality levels and at affordable cost, forms a critical component of agricultural input operations. This is a particularly challenging problem being faced by small and marginal farmers in any emerging economy. Farmer collectives (FCs), which are cooperative societies of farmers, offer an ex… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: 12 pages, 2 figures, 3 tables

  9. arXiv:2302.11048  [pdf, other

    cs.LG cs.AI

    Adversarial Model for Offline Reinforcement Learning

    Authors: Mohak Bhardwaj, Tengyang Xie, Byron Boots, Nan Jiang, Ching-An Cheng

    Abstract: We propose a novel model-based offline Reinforcement Learning (RL) framework, called Adversarial Model for Offline Reinforcement Learning (ARMOR), which can robustly learn policies to improve upon an arbitrary reference policy regardless of data coverage. ARMOR is designed to optimize policies for the worst-case performance relative to the reference policy through adversarially training a Markov d… ▽ More

    Submitted 24 December, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted at the Neural Information Processing Systems (NeurIPS), 2023. Mohak Bhardwaj and Tengyang Xie contributed equally to this work. arXiv admin note: text overlap with arXiv:2211.04538

  10. arXiv:2211.04538  [pdf, ps, other

    cs.LG cs.AI

    ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data

    Authors: Tengyang Xie, Mohak Bhardwaj, Nan Jiang, Ching-An Cheng

    Abstract: We propose a new model-based offline RL framework, called Adversarial Models for Offline Reinforcement Learning (ARMOR), which can robustly learn policies to improve upon an arbitrary baseline policy regardless of data coverage. Based on the concept of relative pessimism, ARMOR is designed to optimize for the worst-case relative performance when facing uncertainty. In theory, we prove that the lea… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

  11. arXiv:2204.13923  [pdf, other

    cs.GT cs.AI cs.MA

    Maxmin Participatory Budgeting

    Authors: Gogulapati Sreedurga, Mayank Ratan Bhardwaj, Y. Narahari

    Abstract: Participatory Budgeting (PB) is a popular voting method by which a limited budget is divided among a set of projects, based on the preferences of voters over the projects. PB is broadly categorised as divisible PB (if the projects are fractionally implementable) and indivisible PB (if the projects are atomic). Egalitarianism, an important objective in PB, has not received much attention in the con… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: Accepted for long oral presentation at IJCAI-2022 main track

  12. arXiv:2110.04669  [pdf, other

    cs.RO cs.LG

    Leveraging Experience in Lazy Search

    Authors: Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots, Siddhartha Srinivasa

    Abstract: Lazy graph search algorithms are efficient at solving motion planning problems where edge evaluation is the computational bottleneck. These algorithms work by lazily computing the shortest potentially feasible path, evaluating edges along that path, and repeating until a feasible path is found. The order in which edges are selected is critical to minimizing the total number of edge evaluations: a… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: Extended journal version accepted for publication at Autonomous Robots; 17 pages. arXiv admin note: substantial text overlap with arXiv:1907.07238

  13. arXiv:2104.13542  [pdf, other

    cs.RO

    STORM: An Integrated Framework for Fast Joint-Space Model-Predictive Control for Reactive Manipulation

    Authors: Mohak Bhardwaj, Balakumar Sundaralingam, Arsalan Mousavian, Nathan Ratliff, Dieter Fox, Fabio Ramos, Byron Boots

    Abstract: Sampling-based model-predictive control (MPC) is a promising tool for feedback control of robots with complex, non-smooth dynamics, and cost functions. However, the computationally demanding nature of sampling-based MPC algorithms has been a key bottleneck in their application to high-dimensional robotic manipulation problems in the real world. Previous methods have addressed this issue by running… ▽ More

    Submitted 14 September, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

    Comments: Accepted for oral presentation at the Conference on Robot Learning (CoRL), 2021. Code available at: https://github.com/NVlabs/storm

    Journal ref: 5th Annual Conference on Robot Learning, 2021

  14. arXiv:2012.05909  [pdf, other

    cs.LG cs.RO

    Blending MPC & Value Function Approximation for Efficient Reinforcement Learning

    Authors: Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots

    Abstract: Model-Predictive Control (MPC) is a powerful tool for controlling complex, real-world systems that uses a model to make predictions about future behavior. For each state encountered, MPC solves an online optimization problem to choose a control action that will minimize future cost. This is a surprisingly effective strategy, but real-time performance requirements warrant the use of simple models.… ▽ More

    Submitted 13 April, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: 15 pages

    Journal ref: International Conference on Learning Representations (ICLR), 2021

  15. arXiv:2011.03588  [pdf, other

    cs.CL

    Hostility Detection Dataset in Hindi

    Authors: Mohit Bhardwaj, Md Shad Akhtar, Asif Ekbal, Amitava Das, Tanmoy Chakraborty

    Abstract: In this paper, we present a novel hostility detection dataset in Hindi language. We collect and manually annotate ~8200 online posts. The annotated dataset covers four hostility dimensions: fake news, hate speech, offensive, and defamation posts, along with a non-hostile label. The hostile posts are also considered for multi-label tags due to a significant overlap among the hostile classes. We rel… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

  16. arXiv:2010.06906  [pdf, other

    cs.CL cs.LG cs.SI

    No Rumours Please! A Multi-Indic-Lingual Approach for COVID Fake-Tweet Detection

    Authors: Debanjana Kar, Mohit Bhardwaj, Suranjana Samanta, Amar Prakash Azad

    Abstract: The sudden widespread menace created by the present global pandemic COVID-19 has had an unprecedented effect on our lives. Man-kind is going through humongous fear and dependence on social media like never before. Fear inevitably leads to panic, speculations, and the spread of misinformation. Many governments have taken measures to curb the spread of such misinformation for public well being. Besi… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: 6 pages, 4 figures

  17. arXiv:2001.02153  [pdf, other

    cs.LG cs.RO stat.ML

    Information Theoretic Model Predictive Q-Learning

    Authors: Mohak Bhardwaj, Ankur Handa, Dieter Fox, Byron Boots

    Abstract: Model-free Reinforcement Learning (RL) works well when experience can be collected cheaply and model-based RL is effective when system dynamics can be modeled accurately. However, both assumptions can be violated in real world problems such as robotics, where querying the system can be expensive and real-world dynamics can be difficult to model. In contrast to RL, Model Predictive Control (MPC) al… ▽ More

    Submitted 5 May, 2020; v1 submitted 30 December, 2019; originally announced January 2020.

    Comments: Extended version (15 pages) of paper accepted at the 2nd Learning for Dynamics and Control (L4DC) Conference, 2020

  18. arXiv:1907.09591  [pdf, other

    cs.RO

    Differentiable Gaussian Process Motion Planning

    Authors: Mohak Bhardwaj, Byron Boots, Mustafa Mukadam

    Abstract: Modern trajectory optimization based approaches to motion planning are fast, easy to implement, and effective on a wide range of robotics tasks. However, trajectory optimization algorithms have parameters that are typically set in advance (and rarely discussed in detail). Setting these parameters properly can have a significant impact on the practical performance of the algorithm, sometimes making… ▽ More

    Submitted 11 March, 2020; v1 submitted 22 July, 2019; originally announced July 2019.

    Comments: 7 pages, Proceedings of the IEEE Conference on Robotics and Automation (ICRA), 2020

  19. arXiv:1907.07238  [pdf, other

    cs.RO cs.LG

    Leveraging Experience in Lazy Search

    Authors: Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots, Siddhartha Srinivasa

    Abstract: Lazy graph search algorithms are efficient at solving motion planning problems where edge evaluation is the computational bottleneck. These algorithms work by lazily computing the shortest potentially feasible path, evaluating edges along that path, and repeating until a feasible path is found. The order in which edges are selected is critical to minimizing the total number of edge evaluations: a… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.

    Comments: 9 pages

  20. arXiv:1711.06391  [pdf, other

    cs.RO

    Data-driven Planning via Imitation Learning

    Authors: Sanjiban Choudhury, Mohak Bhardwaj, Sankalp Arora, Ashish Kapoor, Gireeja Ranade, Sebastian Scherer, Debadeepta Dey

    Abstract: Robot planning is the process of selecting a sequence of actions that optimize for a task specific objective. The optimal solutions to such tasks are heavily influenced by the implicit structure in the environment, i.e. the configuration of objects in the world. State-of-the-art planning approaches, however, do not exploit this structure, thereby expending valuable effort searching the action spac… ▽ More

    Submitted 16 November, 2017; originally announced November 2017.

  21. arXiv:1707.03034  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Heuristic Search via Imitation

    Authors: Mohak Bhardwaj, Sanjiban Choudhury, Sebastian Scherer

    Abstract: Robotic motion planning problems are typically solved by constructing a search tree of valid maneuvers from a start to a goal configuration. Limited onboard computation and real-time planning constraints impose a limit on how large this search tree can grow. Heuristics play a crucial role in such situations by guiding the search towards potentially good directions and consequently minimizing searc… ▽ More

    Submitted 10 July, 2017; originally announced July 2017.

    Comments: 14 pages

  22. arXiv:1609.09529  [pdf, other

    stat.ME cs.SI math.PR

    Loss of information in feedforward social networks

    Authors: Simon Stolarczyk, Manisha Bhardwaj, Kevin E. Bassler, Wei Ji Ma, Kresimir Josic

    Abstract: We consider model social networks in which information propagates directionally across layers of rational agents. Each agent makes a locally optimal estimate of the state of the world, and communicates this estimate to agents downstream. When agents receive information from the same source their estimates are correlated. We show that the resulting redundancy can lead to the loss of information abo… ▽ More

    Submitted 26 September, 2016; originally announced September 2016.

    Journal ref: Loss of information in feedforward social networks. Journal of Complex Networks, cnx032 (2017)

  23. arXiv:1305.1713  [pdf

    cs.DB

    Optimization of stochastic database cracking

    Authors: Meenesh Bhardwaj

    Abstract: Variant Stochastic cracking is a significantly more resilient approach to adaptive indexing. It showed [1]that Stochastic cracking uses each query as a hint on how to reorganize data, but not blindly so; it gains resilience and avoids performance bottlenecks by deliberately applying certain arbitrary choices in its decision making. Therefore bring, adaptive indexing forward to a mature formulation… ▽ More

    Submitted 8 May, 2013; originally announced May 2013.

  24. arXiv:1209.6129  [pdf

    cs.DS cs.CE q-bio.QM

    A New Middle Path Approach For Alignements In Blast

    Authors: Deepak Garg, S C Saxena, L M Bhardwaj

    Abstract: This paper deals with a new middle path approach developed for reducing alignment calculations in BLAST algorithm. This is a new step which is introduced in BLAST algorithm in between the ungapped and gapped alignments. This step of middle path approach between the ungapped and gapped alignments reduces the number of sequences going for gapped alignment. This results in the improvement in speed fo… ▽ More

    Submitted 27 September, 2012; originally announced September 2012.

    Journal ref: Journal of Biological Systems, Vol. 14, No. 4 , pp. 567-581 ISSN 0218-3390 2006