Skip to main content

Showing 1–29 of 29 results for author: Nigam, S

.
  1. arXiv:2504.04737  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context

    Authors: Shubham Kumar Nigam, Balaramamahanthi Deepak Patnaik, Shivam Mishra, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya

    Abstract: In the landscape of Fact-based Judgment Prediction and Explanation (FJPE), reliance on factual data is essential for developing robust and realistic AI-driven decision-making tools. This paper introduces TathyaNyaya, the largest annotated dataset for FJPE tailored to the Indian legal context, encompassing judgments from the Supreme Court of India and various High Courts. Derived from the Hindi ter… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  2. arXiv:2504.03486  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej

    Authors: Shubham Kumar Nigam, Balaramamahanthi Deepak Patnaik, Ajay Varghese Thomas, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya

    Abstract: Automating legal document drafting can significantly enhance efficiency, reduce manual effort, and streamline legal workflows. While prior research has explored tasks such as judgment prediction and case summarization, the structured generation of private legal documents in the Indian legal domain remains largely unaddressed. To bridge this gap, we introduce VidhikDastaavej, a novel, anonymized da… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  3. arXiv:2502.05836  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    LegalSeg: Unlocking the Structure of Indian Legal Judgments Through Rhetorical Role Classification

    Authors: Shubham Kumar Nigam, Tanmay Dubey, Govind Sharma, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya

    Abstract: In this paper, we address the task of semantic segmentation of legal documents through rhetorical role classification, with a focus on Indian legal judgments. We introduce LegalSeg, the largest annotated dataset for this task, comprising over 7,000 documents and 1.4 million sentences, labeled with 7 rhetorical roles. To benchmark performance, we evaluate multiple state-of-the-art models, including… ▽ More

    Submitted 9 February, 2025; originally announced February 2025.

    Comments: Accepted on NAACL 2025

  4. arXiv:2412.08385  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    NyayaAnumana & INLegalLlama: The Largest Indian Legal Judgment Prediction Dataset and Specialized Language Model for Enhanced Decision Analysis

    Authors: Shubham Kumar Nigam, Balaramamahanthi Deepak Patnaik, Shivam Mishra, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya

    Abstract: The integration of artificial intelligence (AI) in legal judgment prediction (LJP) has the potential to transform the legal landscape, particularly in jurisdictions like India, where a significant backlog of cases burdens the legal system. This paper introduces NyayaAnumana, the largest and most diverse corpus of Indian legal cases compiled for LJP, encompassing a total of 7,02,945 preprocessed ca… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    Comments: Accepted on COLING 2025

  5. arXiv:2410.10542  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Models

    Authors: Shubham Kumar Nigam, Aniket Deroy, Subhankar Maity, Arnab Bhattacharya

    Abstract: This study investigates judgment prediction in a realistic scenario within the context of Indian judgments, utilizing a range of transformer-based models, including InLegalBERT, BERT, and XLNet, alongside LLMs such as Llama-2 and GPT-3.5 Turbo. In this realistic scenario, we simulate how judgments are predicted at the point when a case is presented for a decision in court, using only the informati… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Accepted on NLLP at EMNLP 2024

  6. arXiv:2408.06693  [pdf, other

    cs.CV cs.AI cs.CG

    DC3DO: Diffusion Classifier for 3D Objects

    Authors: Nursena Koprucu, Meher Shashwat Nigam, Shicheng Xu, Biruk Abere, Gabriele Dominici, Andrew Rodriguez, Sharvaree Vadgama, Berfin Inal, Alberto Tono

    Abstract: Inspired by Geoffrey Hinton emphasis on generative modeling, To recognize shapes, first learn to generate them, we explore the use of 3D diffusion models for object classification. Leveraging the density estimates from these models, our approach, the Diffusion Classifier for 3D Objects (DC3DO), enables zero-shot classification of 3D shapes without additional training. On average, our method achiev… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  7. arXiv:2406.04136  [pdf, other

    cs.CL cs.AI cs.LG

    Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian Courts

    Authors: Shubham Kumar Nigam, Anurag Sharma, Danush Khanna, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya

    Abstract: In the era of Large Language Models (LLMs), predicting judicial outcomes poses significant challenges due to the complexity of legal proceedings and the scarcity of expert-annotated datasets. Addressing this, we introduce \textbf{Pred}iction with \textbf{Ex}planation (\texttt{PredEx}), the largest expert-annotated dataset for legal judgment prediction and explanation in the Indian context, featuri… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  8. arXiv:2311.13350  [pdf, ps, other

    cs.CL cs.AI cs.IR cs.LG

    Fact-based Court Judgment Prediction

    Authors: Shubham Kumar Nigam, Aniket Deroy

    Abstract: This extended abstract extends the research presented in "ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation" \cite{malik-etal-2021-ildc}, focusing on fact-based judgment prediction within the context of Indian legal documents. We introduce two distinct problem variations: one based solely on facts, and another combining facts with rulings from lower courts… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  9. arXiv:2310.11049  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation

    Authors: Shubham Kumar Nigam, Aniket Deroy, Noel Shallum, Ayush Kumar Mishra, Anup Roy, Shubham Kumar Mishra, Arnab Bhattacharya, Saptarshi Ghosh, Kripabandhu Ghosh

    Abstract: This paper describes our submission to the SemEval-2023 for Task 6 on LegalEval: Understanding Legal Texts. Our submission concentrated on three subtasks: Legal Named Entity Recognition (L-NER) for Task-B, Legal Judgment Prediction (LJP) for Task-C1, and Court Judgment Prediction with Explanation (CJPE) for Task-C2. We conducted various experiments on these subtasks and presented the results in de… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Journal ref: https://aclanthology.org/2023.semeval-1.180

  10. arXiv:2309.14735  [pdf, other

    cs.CL cs.AI

    Legal Question-Answering in the Indian Context: Efficacy, Challenges, and Potential of Modern AI Models

    Authors: Shubham Kumar Nigam, Shubham Kumar Mishra, Ayush Kumar Mishra, Noel Shallum, Arnab Bhattacharya

    Abstract: Legal QA platforms bear the promise to metamorphose the manner in which legal experts engage with jurisprudential documents. In this exposition, we embark on a comparative exploration of contemporary AI frameworks, gauging their adeptness in catering to the unique demands of the Indian legal milieu, with a keen emphasis on Indian Legal Question Answering (AILQA). Our discourse zeroes in on an arra… ▽ More

    Submitted 16 October, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

  11. arXiv:2307.03891  [pdf, other

    cs.RO cs.MA

    MARBLER: An Open Platform for Standardized Evaluation of Multi-Robot Reinforcement Learning Algorithms

    Authors: Reza Torbati, Shubham Lohiya, Shivika Singh, Meher Shashwat Nigam, Harish Ravichandar

    Abstract: Multi-Agent Reinforcement Learning (MARL) has enjoyed significant recent progress thanks, in part, to the integration of deep learning techniques for modeling interactions in complex environments. This is naturally starting to benefit multi-robot systems (MRS) in the form of multi-robot RL (MRRL). However, existing infrastructure to train and evaluate policies predominantly focus on the challenges… ▽ More

    Submitted 21 October, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: 7 pages, 3 figures, accepted to MRS 2023, for the associated website, see https://shubhlohiya.github.io/MARBLER/, resubmitting the camera ready version of the paper

  12. arXiv:2211.16882  [pdf, other

    cs.CV cs.RO

    MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves

    Authors: Pranjali Pathre, Anurag Sahu, Ashwin Rao, Avinash Prabhu, Meher Shashwat Nigam, Tanvi Karandikar, Harit Pandya, K. Madhava Krishna

    Abstract: In this paper, we propose and showcase, for the first time, monocular multi-view layout estimation for warehouse racks and shelves. Unlike typical layout estimation methods, MVRackLay estimates multi-layered layouts, wherein each layer corresponds to the layout of a shelf within a rack. Given a sequence of images of a warehouse scene, a dual-headed Convolutional-LSTM architecture outputs segmented… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Journal ref: IEEE International Conference on Robotics and Biomimetics (ROBIO) 2022

  13. arXiv:2211.16360  [pdf, ps, other

    cs.SI

    Is Twitter Enough? Investigating Situational Awareness in Social and Print Media during the Second COVID-19 Wave in India

    Authors: Ishita Vohra, Meher Shashwat Nigam, Aryan Sakaria, Amey Kudari, Nimmi Rangaswamy

    Abstract: The pandemic required efficient allocation of public resources and transforming existing ways of societal functions. To manage any crisis, governments and public health researchers exploit the information available to them in order to make informed decisions, also defined as situational awareness. Gathering situational awareness using social media has been functional to manage epidemics. Previous… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Published at 2022 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

  14. arXiv:2211.02246  [pdf

    cs.CR

    DatChain -- Blockchain implementation in Data transfer for IoT Devices

    Authors: Om Rajput, Suyash Nigam, M. J. Chowdhury, Kayalvizhi Jayavel

    Abstract: Currently, the IoT ecosystem is comprised of fully connected smart devices that exchange data to provide more automated, precise, and fast decisions. This idealised situation can only be accomplished if a system for data transactions is processed efficiently and security is ensured with high scalability and practicability. The integrity of data must be maintained during the exchange or transfer of… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: Keywords - Blockchain, Internet of Things, IOTA, Tangle, Data transfer, IoT Data Analytics

  15. arXiv:2204.07853  [pdf, other

    cs.CL cs.IR cs.LG

    nigam@COLIEE-22: Legal Case Retrieval and Entailment using Cascading of Lexical and Semantic-based models

    Authors: Shubham Kumar Nigam, Navansh Goel

    Abstract: This paper describes our submission to the Competition on Legal Information Extraction/Entailment 2022 (COLIEE-2022) workshop on case law competition for tasks 1 and 2. Task 1 is a legal case retrieval task, which involves reading a new case and extracting supporting cases from the provided case law corpus to support the decision. Task 2 is the legal case entailment task, which involves the identi… ▽ More

    Submitted 16 April, 2022; originally announced April 2022.

    Comments: COLIEE-2022 Workshop paper run in association with the International Workshop in Juris-Informatics (JURISIN 2022)

  16. arXiv:2203.04111  [pdf, other

    cs.CL cs.AI cs.LG

    Plumeria at SemEval-2022 Task 6: Robust Approaches for Sarcasm Detection for English and Arabic Using Transformers and Data Augmentation

    Authors: Shubham Kumar Nigam, Mosab Shaheen

    Abstract: This paper describes our submission to SemEval-2022 Task 6 on sarcasm detection and its five subtasks for English and Arabic. Sarcasm conveys a meaning which contradicts the literal meaning, and it is mainly found on social networks. It has a significant role in understanding the intention of the user. For detecting sarcasm, we used deep learning techniques based on transformers due to its success… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: SemEval-2022 workshop paper, submitted in NAACL-2022 conference. 8 figures and 29 tables. 8 main pages, 4 appendix pages

  17. arXiv:2112.01836  [pdf, other

    cs.CL cs.AI cs.LG

    Semantic Segmentation of Legal Documents via Rhetorical Roles

    Authors: Vijit Malik, Rishabh Sanjay, Shouvik Kumar Guha, Angshuman Hazarika, Shubham Nigam, Arnab Bhattacharya, Ashutosh Modi

    Abstract: Legal documents are unstructured, use legal jargon, and have considerable length, making them difficult to process automatically via conventional text processing techniques. A legal document processing system would benefit substantially if the documents could be segmented into coherent information units. This paper proposes a new corpus of legal documents annotated (with the help of legal experts)… ▽ More

    Submitted 7 November, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: 19 pages, Accepted at Natural Legal Language Processing Workshop, EMNLP 2022

  18. arXiv:2105.13562  [pdf, other

    cs.CL cs.AI

    ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation

    Authors: Vijit Malik, Rishabh Sanjay, Shubham Kumar Nigam, Kripa Ghosh, Shouvik Kumar Guha, Arnab Bhattacharya, Ashutosh Modi

    Abstract: An automated system that could assist a judge in predicting the outcome of a case would help expedite the judicial process. For such a system to be practically useful, predictions by the system should be explainable. To promote research in developing such a system, we introduce ILDC (Indian Legal Documents Corpus). ILDC is a large corpus of 35k Indian Supreme Court cases annotated with original co… ▽ More

    Submitted 31 May, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: Accepted at ACL 2021, 17 Pages (9 Pages main paper, 4 pages references, 4 pages appendix)

  19. Electron trapping and detrapping in an oxide two-dimensional electron gas: The role of ferroelastic twin walls

    Authors: Shashank Kumar Ojha, Sankalpa Hazra, Prithwijit Mandal, Ranjan Kumar Patel, Shivam Nigam, Siddharth Kumar, S. Middey

    Abstract: The choice of electrostatic gating over the conventional chemical doping for phase engineering of quantum materials is attributed to the fact that the former can reversibly tune the carrier density without affecting the system's level of disorder. However, this proposition seems to break down in field-effect transistors involving SrTiO$_3$ (STO) based two-dimensional electron gases. Such peculiar… ▽ More

    Submitted 23 May, 2021; originally announced May 2021.

    Comments: 5 Figures

    Journal ref: Physical Review Applied 15, 054008 (2021)

  20. Synthetic 3D Data Generation Pipeline for Geometric Deep Learning in Architecture

    Authors: Stanislava Fedorova, Alberto Tono, Meher Shashwat Nigam, Jiayao Zhang, Amirhossein Ahmadnia, Cecilia Bolognesi, Dominik L. Michels

    Abstract: With the growing interest in deep learning algorithms and computational design in the architectural field, the need for large, accessible and diverse architectural datasets increases. We decided to tackle this problem by constructing a field-specific synthetic data generation pipeline that generates an arbitrary amount of 3D data along with the associated 2D and 3D annotations. The variety of anno… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Comments: Project Page: https://cdinstitute.github.io/Building-Dataset-Generator/

  21. Monocular Multi-Layer Layout Estimation for Warehouse Racks

    Authors: Meher Shashwat Nigam, Avinash Prabhu, Anurag Sahu, Puru Gupta, Tanvi Karandikar, N. Sai Shankar, Ravi Kiran Sarvadevabhatla, K. Madhava Krishna

    Abstract: Given a monocular colour image of a warehouse rack, we aim to predict the bird's-eye view layout for each shelf in the rack, which we term as multi-layer layout prediction. To this end, we present RackLay, a deep neural network for real-time shelf layout estimation from a single image. Unlike previous layout estimation methods, which provide a single layout for the dominant ground plane alone, Rac… ▽ More

    Submitted 28 October, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: Visit our project repository at https://github.com/Avinash2468/RackLay

  22. arXiv:1904.07964  [pdf, other

    cs.LG cs.CG cs.NE stat.ML

    3D Shape Synthesis for Conceptual Design and Optimization Using Variational Autoencoders

    Authors: Wentai Zhang, Zhangsihao Yang, Haoliang Jiang, Suyash Nigam, Soji Yamakawa, Tomotake Furuhata, Kenji Shimada, Levent Burak Kara

    Abstract: We propose a data-driven 3D shape design method that can learn a generative model from a corpus of existing designs, and use this model to produce a wide range of new designs. The approach learns an encoding of the samples in the training corpus using an unsupervised variational autoencoder-decoder architecture, without the need for an explicit parametric representation of the original designs. To… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

    Comments: Preprint accepted by ASME IDETC/CIE 2019

  23. arXiv:1904.01580  [pdf, other

    physics.flu-dyn

    Scaling of the Puffing Strouhal Number for Buoyant Jets

    Authors: Nicholas T. Wimer, Caelan Lapointe, Jason D. Christopher, Siddharth P. Nigam, Torrey R. S. Hayden, Aniruddha Upadhye, Mark Strobel, Gregory B. Rieker, Peter E. Hamlington

    Abstract: Prior research has shown that round and planar buoyant jets "puff" at a frequency that depends on the balance of momentum and buoyancy fluxes at the inlet, as parametrized by the Richardson number. Experiments have revealed the existence of scaling relations between the Strouhal number of the puffing and the inlet Richardson number, but geometry-specific relations are required when the characteris… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: 4 figures

  24. arXiv:1505.00143  [pdf

    cond-mat.mes-hall physics.atm-clus physics.chem-ph

    Raman and Infrared spectra of (BaF2)n (n=1-6) clusters

    Authors: Ratnesh K. Pandey, Kevin Waters, Sandeep Nigam, Ravindra Pandey, Avinash C. Pandey

    Abstract: The vibrational properties of alkaline-earth metal fluoride clusters (BaF2)n (n=1-6) are investigated in the framework of density functional theory. The calculated Raman and Infrared (IR) spectra reveals shift in Raman and IR peak position towards lower frequency region with the increase in the cluster size. Further the calculated spectra have been compared with the experimental vibrational spectr… ▽ More

    Submitted 1 May, 2015; originally announced May 2015.

    Comments: 3 pages, 4 figures

  25. arXiv:1409.3942  [pdf

    cs.CL cs.IR

    Polarity detection movie reviews in hindi language

    Authors: Richa Sharma, Shweta Nigam, Rekha Jain

    Abstract: Nowadays peoples are actively involved in giving comments and reviews on social networking websites and other websites like shopping websites, news websites etc. large number of people everyday share their opinion on the web, results is a large number of user data is collected .users also find it trivial task to read all the reviews and then reached into the decision. It would be better if these r… ▽ More

    Submitted 13 September, 2014; originally announced September 2014.

  26. arXiv:1408.3829  [pdf

    cs.IR cs.CL

    Opinion mining of movie reviews at document level

    Authors: Richa Sharma, Shweta Nigam, Rekha Jain

    Abstract: The whole world is changed rapidly and using the current technologies Internet becomes an essential need for everyone. Web is used in every field. Most of the people use web for a common purpose like online shopping, chatting etc. During an online shopping large number of reviews/opinions are given by the users that reflect whether the product is good or bad. These reviews need to be explored, ana… ▽ More

    Submitted 17 August, 2014; originally announced August 2014.

    Comments: International Journal on Information Theory (IJIT), Vol.3, No.3, July 2014

  27. arXiv:1406.3714  [pdf

    cs.CL cs.IR

    Mining of product reviews at aspect level

    Authors: Richa Sharma, Shweta Nigam, Rekha Jain

    Abstract: Todays world is a world of Internet, almost all work can be done with the help of it, from simple mobile phone recharge to biggest business deals can be done with the help of this technology. People spent their most of the times on surfing on the Web it becomes a new source of entertainment, education, communication, shopping etc. Users not only use these websites but also give their feedback and… ▽ More

    Submitted 14 June, 2014; originally announced June 2014.

    Journal ref: International Journal in Foundations of Computer Science & Technology (IJFCST), Vol.4, No.3, May 2014

  28. arXiv:1404.4935  [pdf

    cs.IR cs.CL

    Opinion Mining In Hindi Language: A Survey

    Authors: Richa Sharma, Shweta Nigam, Rekha Jain

    Abstract: Opinions are very important in the life of human beings. These Opinions helped the humans to carry out the decisions. As the impact of the Web is increasing day by day, Web documents can be seen as a new source of opinion for human beings. Web contains a huge amount of information generated by the users through blogs, forum entries, and social networking websites and so on To analyze this large am… ▽ More

    Submitted 19 April, 2014; originally announced April 2014.

    Journal ref: International Journal in Foundations of Computer Science & Technology (IJFCST) International Journal in Foundations of Computer Science & Technology (IJFCST), Vol.4, No.2, March 2014

  29. arXiv:1205.4968  [pdf

    cs.DS

    SubGraD- An Approach for Subgraph Detection

    Authors: Akshara Pande, Vivekanand Pant, S. Nigam

    Abstract: A new approach of graph matching is introduced in this paper, which efficiently solves the problem of graph isomorphism and subgraph isomorphism. In this paper we are introducing a new approach called SubGraD, for query graph detection in source graph. Firstly consider the model graph (query graph) and make the possible sets called model sets starting from the chosen initial node or starter. Simil… ▽ More

    Submitted 22 May, 2012; originally announced May 2012.