Skip to main content

Showing 1–6 of 6 results for author: Mausam

Searching in archive stat. Search in all archives.
.
  1. arXiv:2310.10399  [pdf, other

    cs.LG stat.ML

    Towards Fair and Calibrated Models

    Authors: Anand Brahmbhatt, Vipul Rathore, Mausam, Parag Singla

    Abstract: Recent literature has seen a significant focus on building machine learning models with specific properties such as fairness, i.e., being non-biased with respect to a given set of attributes, calibration i.e., model confidence being aligned with its predictive accuracy, and explainability, i.e., ability to be understandable to humans. While there has been work focusing on each of these aspects ind… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  2. arXiv:2008.11990  [pdf, other

    cs.LG stat.ML

    Neural Learning of One-of-Many Solutions for Combinatorial Problems in Structured Output Spaces

    Authors: Yatin Nandwani, Deepanshu Jindal, Mausam, Parag Singla

    Abstract: Recent research has proposed neural architectures for solving combinatorial problems in structured output spaces. In many such problems, there may exist multiple solutions for a given input, e.g. a partially filled Sudoku puzzle may have many completions satisfying all constraints. Further, we are often interested in finding any one of the possible solutions, without any preference between them. E… ▽ More

    Submitted 4 April, 2021; v1 submitted 27 August, 2020; originally announced August 2020.

  3. arXiv:2005.00123  [pdf, other

    cs.LG cs.CL stat.ML

    Unsupervised Learning of KB Queries in Task-Oriented Dialogs

    Authors: Dinesh Raghu, Nikhil Gupta, Mausam

    Abstract: Task-oriented dialog (TOD) systems often need to formulate knowledge base (KB) queries corresponding to the user intent and use the query results to generate system responses. Existing approaches require dialog datasets to explicitly annotate these KB queries -- these annotations can be time consuming, and expensive. In response, we define the novel problems of predicting the KB query and training… ▽ More

    Submitted 3 June, 2021; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: Presented at ACL 2021

    Journal ref: Transactions of the Association for Computational Linguistics (2021) 9: 374-390

  4. arXiv:2002.07375  [pdf, other

    cs.LG cs.AI stat.ML

    Symbolic Network: Generalized Neural Policies for Relational MDPs

    Authors: Sankalp Garg, Aniket Bajpai, Mausam

    Abstract: A Relational Markov Decision Process (RMDP) is a first-order representation to express all instances of a single probabilistic planning domain with possibly unbounded number of objects. Early work in RMDPs outputs generalized (instance-independent) first-order policies or value functions as a means to solve all instances of a domain at once. Unfortunately, this line of work met with limited succes… ▽ More

    Submitted 29 June, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: In Proceeding of ICML 2020. Code can be found at https://github.com/dair-iitd/symnet

  5. arXiv:1902.03081  [pdf, other

    cs.LG stat.ML

    Size Independent Neural Transfer for RDDL Planning

    Authors: Sankalp Garg, Aniket Bajpai, Mausam

    Abstract: Neural planners for RDDL MDPs produce deep reactive policies in an offline fashion. These scale well with large domains, but are sample inefficient and time-consuming to train from scratch for each new problem. To mitigate this, recent work has studied neural transfer learning, so that a generic planner trained on other problems of the same domain can rapidly transfer to a new problem. However, th… ▽ More

    Submitted 4 April, 2019; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: Published in ICAPS 2019

  6. arXiv:1805.01216  [pdf, other

    cs.LG cs.CL stat.ML

    Disentangling Language and Knowledge in Task-Oriented Dialogs

    Authors: Dinesh Raghu, Nikhil Gupta, Mausam

    Abstract: The Knowledge Base (KB) used for real-world applications, such as booking a movie or restaurant reservation, keeps changing over time. End-to-end neural networks trained for these task-oriented dialogs are expected to be immune to any changes in the KB. However, existing approaches breakdown when asked to handle such changes. We propose an encoder-decoder architecture (BoSsNet) with a novel Bag-of… ▽ More

    Submitted 5 April, 2019; v1 submitted 3 May, 2018; originally announced May 2018.

    Comments: Published in NAACL-HLT 2019