Skip to main content

Showing 1–28 of 28 results for author: Bhattacharya, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.08119  [pdf, ps, other

    cs.AI

    SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents

    Authors: Subhrangshu Nandi, Arghya Datta, Nikhil Vichare, Indranil Bhattacharya, Huzefa Raja, Jing Xu, Shayan Ray, Giuseppe Carenini, Abhi Srivastava, Aaron Chan, Man Ho Woo, Amar Kandola, Brandon Theresa, Francesco Carbone

    Abstract: Large Language Models (LLMs) demonstrate impressive general-purpose reasoning and problem-solving abilities. However, they struggle with executing complex, long-horizon workflows that demand strict adherence to Standard Operating Procedures (SOPs), a critical requirement for real-world industrial automation. Despite this need, there is a lack of public benchmarks that reflect the complexity, struc… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: Under review

  2. arXiv:2502.00366  [pdf

    eess.IV cs.CV

    Prostate-Specific Foundation Models for Enhanced Detection of Clinically Significant Cancer

    Authors: Jeong Hoon Lee, Cynthia Xinran Li, Hassan Jahanandish, Indrani Bhattacharya, Sulaiman Vesal, Lichun Zhang, Shengtian Sang, Moon Hyung Choi, Simon John Christoph Soerensen, Steve Ran Zhou, Elijah Richard Sommer, Richard Fan, Pejman Ghanouni, Yuze Song, Tyler M. Seibert, Geoffrey A. Sonn, Mirabela Rusu

    Abstract: Accurate prostate cancer diagnosis remains challenging. Even when using MRI, radiologists exhibit low specificity and significant inter-observer variability, leading to potential delays or inaccuracies in identifying clinically significant cancers. This leads to numerous unnecessary biopsies and risks of missing clinically significant cancers. Here we present prostate vision contrastive network (P… ▽ More

    Submitted 4 February, 2025; v1 submitted 1 February, 2025; originally announced February 2025.

    Comments: 44pages

  3. arXiv:2502.00146  [pdf

    eess.IV cs.AI cs.CV

    Multimodal MRI-Ultrasound AI for Prostate Cancer Detection Outperforms Radiologist MRI Interpretation: A Multi-Center Study

    Authors: Hassan Jahanandish, Shengtian Sang, Cynthia Xinran Li, Sulaiman Vesal, Indrani Bhattacharya, Jeong Hoon Lee, Richard Fan, Geoffrey A. Sonna, Mirabela Rusu

    Abstract: Pre-biopsy magnetic resonance imaging (MRI) is increasingly used to target suspicious prostate lesions. This has led to artificial intelligence (AI) applications improving MRI-based detection of clinically significant prostate cancer (CsPCa). However, MRI-detected lesions must still be mapped to transrectal ultrasound (TRUS) images during biopsy, which results in missing CsPCa. This study systemat… ▽ More

    Submitted 31 January, 2025; originally announced February 2025.

  4. arXiv:2406.14313  [pdf, other

    cs.CL cs.AI

    Iterative Repair with Weak Verifiers for Few-shot Transfer in KBQA with Unanswerability

    Authors: Riya Sawhney, Samrat Yadav, Indrajit Bhattacharya, Mausam

    Abstract: Real-world applications of KBQA require models to handle unanswerable questions with a limited volume of in-domain labeled training data. We propose the novel task of few-shot transfer for KBQA with unanswerable questions and contribute two new datasets for performance evaluation. We present FUn-FuSIC - a novel solution for our task that extends FuSIC KBQA, the state-of-the-art few-shot transfer m… ▽ More

    Submitted 21 February, 2025; v1 submitted 20 June, 2024; originally announced June 2024.

  5. arXiv:2403.10849  [pdf, other

    cs.CL

    RetinaQA: A Robust Knowledge Base Question Answering Model for both Answerable and Unanswerable Questions

    Authors: Prayushi Faldu, Indrajit Bhattacharya, Mausam

    Abstract: An essential requirement for a real-world Knowledge Base Question Answering (KBQA) system is the ability to detect the answerability of questions when generating logical forms. However, state-of-the-art KBQA models assume all questions to be answerable. Recent research has found that such models, when superficially adapted to detect answerability, struggle to satisfactorily identify the different… ▽ More

    Submitted 2 November, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

  6. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  7. arXiv:2312.05334  [pdf, other

    eess.IV cs.CV

    ProsDectNet: Bridging the Gap in Prostate Cancer Detection via Transrectal B-mode Ultrasound Imaging

    Authors: Sulaiman Vesal, Indrani Bhattacharya, Hassan Jahanandish, Xinran Li, Zachary Kornberg, Steve Ran Zhou, Elijah Richard Sommer, Moon Hyung Choi, Richard E. Fan, Geoffrey A. Sonn, Mirabela Rusu

    Abstract: Interpreting traditional B-mode ultrasound images can be challenging due to image artifacts (e.g., shadowing, speckle), leading to low sensitivity and limited diagnostic accuracy. While Magnetic Resonance Imaging (MRI) has been proposed as a solution, it is expensive and not widely available. Furthermore, most biopsies are guided by Transrectal Ultrasound (TRUS) alone and can miss up to 52% cancer… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: Accepted in NeurIPS 2023 (Medical Imaging meets NeurIPS Workshop)

  8. arXiv:2311.08894  [pdf, other

    cs.CL cs.AI

    Few-shot Transfer Learning for Knowledge Base Question Answering: Fusing Supervised Models with In-Context Learning

    Authors: Mayur Patidar, Riya Sawhney, Avinash Singh, Biswajit Chatterjee, Mausam, Indrajit Bhattacharya

    Abstract: Existing Knowledge Base Question Answering (KBQA) architectures are hungry for annotated data, which make them costly and time-consuming to deploy. We introduce the problem of few-shot transfer learning for KBQA, where the target domain offers only a few labeled examples, but a large labeled training dataset is available in a source domain. We propose a novel KBQA architecture called FuSIC-KBQA th… ▽ More

    Submitted 13 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: ACL-2024 camera-ready version

  9. arXiv:2311.02961  [pdf, other

    cs.CL

    Adapting Pre-trained Generative Models for Extractive Question Answering

    Authors: Prabir Mallick, Tapas Nayak, Indrajit Bhattacharya

    Abstract: Pre-trained Generative models such as BART, T5, etc. have gained prominence as a preferred method for text generation in various natural language processing tasks, including abstractive long-form question answering (QA) and summarization. However, the potential of generative models in extractive QA tasks, where discriminative models are commonly employed, remains largely unexplored. Discriminative… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted in GEM workshop @ EMNLP 2023

  10. arXiv:2310.00696  [pdf, other

    cs.CL

    Do the Benefits of Joint Models for Relation Extraction Extend to Document-level Tasks?

    Authors: Pratik Saini, Tapas Nayak, Indrajit Bhattacharya

    Abstract: Two distinct approaches have been proposed for relational triple extraction - pipeline and joint. Joint models, which capture interactions across triples, are the more recent development, and have been shown to outperform pipeline models for sentence-level extraction tasks. Document-level extraction is a more challenging setting where interactions across triples can be long-range, and individual t… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    Comments: Accepted in IJCNLP-AACL 2023 (Short)

  11. arXiv:2302.09887  [pdf, other

    cs.CL

    90% F1 Score in Relational Triple Extraction: Is it Real ?

    Authors: Pratik Saini, Samiran Pal, Tapas Nayak, Indrajit Bhattacharya

    Abstract: Extracting relational triples from text is a crucial task for constructing knowledge bases. Recent advancements in joint entity and relation extraction models have demonstrated remarkable F1 scores ($\ge 90\%$) in accurately extracting relational triples from free text. However, these models have been evaluated under restrictive experimental settings and unrealistic datasets. They overlook sentenc… ▽ More

    Submitted 27 October, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Accepted in GenBench workshop @ EMNLP 2023

  12. arXiv:2212.10189  [pdf, other

    cs.CL cs.AI

    Do I have the Knowledge to Answer? Investigating Answerability of Knowledge Base Questions

    Authors: Mayur Patidar, Prayushi Faldu, Avinash Singh, Lovekesh Vig, Indrajit Bhattacharya, Mausam

    Abstract: When answering natural language questions over knowledge bases, missing facts, incomplete schema and limited scope naturally lead to many questions being unanswerable. While answerability has been explored in other QA settings, it has not been studied for QA over knowledge bases (KBQA). We create GrailQAbility, a new benchmark KBQA dataset with unanswerability, by first identifying various forms o… ▽ More

    Submitted 24 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  13. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  14. arXiv:2209.14657  [pdf, other

    eess.IV cs.CV

    Correlated Feature Aggregation by Region Helps Distinguish Aggressive from Indolent Clear Cell Renal Cell Carcinoma Subtypes on CT

    Authors: Karin Stacke, Indrani Bhattacharya, Justin R. Tse, James D. Brooks, Geoffrey A. Sonn, Mirabela Rusu

    Abstract: Renal cell carcinoma (RCC) is a common cancer that varies in clinical behavior. Indolent RCC is often low-grade without necrosis and can be monitored without treatment. Aggressive RCC is often high-grade and can cause metastasis and death if not promptly detected and treated. While most kidney cancers are detected on CT scans, grading is based on histology from invasive biopsy or surgery. Determin… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: Submitted to Medical Image Analysis

  15. arXiv:2209.02126  [pdf, other

    eess.IV cs.CV

    Domain Generalization for Prostate Segmentation in Transrectal Ultrasound Images: A Multi-center Study

    Authors: Sulaiman Vesal, Iani Gayo, Indrani Bhattacharya, Shyam Natarajan, Leonard S. Marks, Dean C Barratt, Richard E. Fan, Yipeng Hu, Geoffrey A. Sonn, Mirabela Rusu

    Abstract: Prostate biopsy and image-guided treatment procedures are often performed under the guidance of ultrasound fused with magnetic resonance images (MRI). Accurate image fusion relies on accurate segmentation of the prostate on ultrasound images. Yet, the reduced signal-to-noise ratio and artifacts (e.g., speckle and shadowing) in ultrasound images limit the performance of automated prostate segmentat… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: Accepted to the journal of Medical Image Analysis (MedIA)

  16. arXiv:2208.03609  [pdf, other

    eess.IV cs.CV cs.LG

    Continual Learning for Tumor Classification in Histopathology Images

    Authors: Veena Kaustaban, Qinle Ba, Ipshita Bhattacharya, Nahil Sobh, Satarupa Mukherjee, Jim Martin, Mohammad Saleh Miri, Christoph Guetter, Amal Chaturvedi

    Abstract: Recent years have seen great advancements in the development of deep learning models for histopathology image analysis in digital pathology applications, evidenced by the increasingly common deployment of these models in both research and clinical settings. Although such models have shown unprecedented performance in solving fundamental computational tasks in DP applications, they suffer from cata… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: Accepted by MOVI, a MICCAI2022 workshop: https://sites.google.com/view/movi2022

  17. arXiv:2112.02164  [pdf, other

    eess.IV cs.CV

    Bridging the gap between prostate radiology and pathology through machine learning

    Authors: Indrani Bhattacharya, David S. Lim, Han Lin Aung, Xingchen Liu, Arun Seetharaman, Christian A. Kunder, Wei Shao, Simon J. C. Soerensen, Richard E. Fan, Pejman Ghanouni, Katherine J. To'o, James D. Brooks, Geoffrey A. Sonn, Mirabela Rusu

    Abstract: Prostate cancer is the second deadliest cancer for American men. While Magnetic Resonance Imaging (MRI) is increasingly used to guide targeted biopsies for prostate cancer diagnosis, its utility remains limited due to high rates of false positives and false negatives as well as low inter-reader agreements. Machine learning methods to detect and localize cancer on prostate MRI can help standardize… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

    Comments: Indrani Bhattacharya and David S. Lim contributed equally as first authors. Geoffrey A. Sonn and Mirabela Rusu contributed equally as senior authors

  18. Your instruction may be crisp, but not clear to me!

    Authors: Pradip Pramanick, Chayan Sarkar, Indrajit Bhattacharya

    Abstract: The number of robots deployed in our daily surroundings is ever-increasing. Even in the industrial set-up, the use of coworker robots is increasing rapidly. These cohabitant robots perform various tasks as instructed by co-located human beings. Thus, a natural interaction mechanism plays a big role in the usability and acceptability of the robot, especially by a non-expert user. The recent develop… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Journal ref: Published in: 2019 28th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)

  19. Enabling human-like task identification from natural conversation

    Authors: Pradip Pramanick, Chayan Sarkar, Balamuralidhar P, Ajay Kattepur, Indrajit Bhattacharya, Arpan Pal

    Abstract: A robot as a coworker or a cohabitant is becoming mainstream day-by-day with the development of low-cost sophisticated hardware. However, an accompanying software stack that can aid the usability of the robotic hardware remains the bottleneck of the process, especially if the robot is not dedicated to a single job. Programming a multi-purpose robot requires an on the fly mission scheduling capabil… ▽ More

    Submitted 29 August, 2020; v1 submitted 23 August, 2020; originally announced August 2020.

    Journal ref: Published in: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  20. arXiv:2008.00119  [pdf, other

    eess.IV cs.CV

    CorrSigNet: Learning CORRelated Prostate Cancer SIGnatures from Radiology and Pathology Images for Improved Computer Aided Diagnosis

    Authors: Indrani Bhattacharya, Arun Seetharaman, Wei Shao, Rewa Sood, Christian A. Kunder, Richard E. Fan, Simon John Christoph Soerensen, Jeffrey B. Wang, Pejman Ghanouni, Nikola C. Teslovich, James D. Brooks, Geoffrey A. Sonn, Mirabela Rusu

    Abstract: Magnetic Resonance Imaging (MRI) is widely used for screening and staging prostate cancer. However, many prostate cancers have subtle features which are not easily identifiable on MRI, resulting in missed diagnoses and alarming variability in radiologist interpretation. Machine learning models have been developed in an effort to improve cancer identification, but current models localize cancer usi… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

    Comments: Accepted to MICCAI 2020

  21. arXiv:1901.09854  [pdf, other

    cs.CV

    Multi-modal dialog for browsing large visual catalogs using exploration-exploitation paradigm in a joint embedding space

    Authors: Indrani Bhattacharya, Arkabandhu Chowdhury, Vikas Raykar

    Abstract: We present a multi-modal dialog system to assist online shoppers in visually browsing through large catalogs. Visual browsing is different from visual search in that it allows the user to explore the wide range of products in a catalog, beyond the exact search matches. We focus on a slightly asymmetric version of the complete multi-modal dialog where the system can understand both text and image q… ▽ More

    Submitted 29 January, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

    Comments: 10 pages including reference, 8 figures. First two authors are equal contributors

  22. arXiv:1809.04487  [pdf, other

    cs.LG stat.ML

    Discovering Topical Interactions in Text-based Cascades using Hidden Markov Hawkes Processes

    Authors: Srikanta Bedathur, Indrajit Bhattacharya, Jayesh Choudhari, Anirban Dasgupta

    Abstract: Social media conversations unfold based on complex interactions between users, topics and time. While recent models have been proposed to capture network strengths between users, users' topical preferences and temporal patterns between posting and response times, interaction patterns between topics has not been studied. We propose the Hidden Markov Hawkes Process (HMHP) that incorporates topical M… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

    Comments: Accepted as a short paper at ICDM-2018

  23. arXiv:1705.05893  [pdf, other

    cs.GR

    Computed Axial Lithography (CAL): Toward Single Step 3D Printing of Arbitrary Geometries

    Authors: Brett Kelly, Indrasen Bhattacharya, Maxim Shusteff, Robert M. Panas, Hayden K. Taylor, Christopher M. Spadaccini

    Abstract: Most additive manufacturing processes today operate by printing voxels (3D pixels) serially point-by-point to build up a 3D part. In some more recently-developed techniques, for example optical printing methods such as projection stereolithography [Zheng et al. 2012], [Tumbleston et al. 2015], parts are printed layer-by-layer by curing full 2d (very thin in one dimension) layers of the 3d part in… ▽ More

    Submitted 16 May, 2017; originally announced May 2017.

    Comments: 10 pages, 17 figure, ACM SIGGRAPH format

    ACM Class: I.3.1; I.3.3; I.3.5; I.3.7; I.3.8; I.4.0

  24. arXiv:1606.05275  [pdf, other

    stat.ML cs.CY

    Designing Intelligent Automation based Solutions for Complex Social Problems

    Authors: Sanjay Podder, Janardan Misra, Senthil Kumaresan, Neville Dubash, Indrani Bhattacharya

    Abstract: Deciding effective and timely preventive measures against complex social problems affecting relatively low income geographies is a difficult challenge. There is a strong need to adopt intelligent automation based solutions with low cost imprints to tackle these problems at larger scales. Starting with the hypothesis that analytical modelling and analysis of social phenomena with high accuracy is i… ▽ More

    Submitted 16 June, 2016; originally announced June 2016.

    Comments: presented at 2016 ICML Workshop on #Data4Good: Machine Learning in Social Good Applications, New York, NY

  25. arXiv:1508.06446  [pdf, ps, other

    stat.ML cs.LG

    Nested Hierarchical Dirichlet Processes for Multi-Level Non-Parametric Admixture Modeling

    Authors: Lavanya Sita Tekumalla, Priyanka Agrawal, Indrajit Bhattacharya

    Abstract: Dirichlet Process(DP) is a Bayesian non-parametric prior for infinite mixture modeling, where the number of mixture components grows with the number of data items. The Hierarchical Dirichlet Process (HDP), is an extension of DP for grouped data, often used for non-parametric topic modeling, where each group is a mixture over shared mixture densities. The Nested Dirichlet Process (nDP), on the othe… ▽ More

    Submitted 27 August, 2015; v1 submitted 26 August, 2015; originally announced August 2015.

    Comments: Proceedings of European Conference of Machine Learning (ECML) 2013

  26. arXiv:1312.0790   

    cs.AI cs.LG stat.ML

    Test Set Selection using Active Information Acquisition for Predictive Models

    Authors: Sneha Chaudhari, Pankaj Dayama, Vinayaka Pandit, Indrajit Bhattacharya

    Abstract: In this paper, we consider active information acquisition when the prediction model is meant to be applied on a targeted subset of the population. The goal is to label a pre-specified fraction of customers in the target or test set by iteratively querying for information from the non-target or training set. The number of queries is limited by an overall budget. Arising in the context of two rather… ▽ More

    Submitted 14 March, 2014; v1 submitted 3 December, 2013; originally announced December 2013.

    Comments: The paper has been withdrawn by the authors. The current version is incomplete and the work is still on going. The algorithm gives poor results for a particular setting and we are working on it. However, we are not planning to submit a revision of the paper. This work is going to take some time and we want to withdraw the current version since it is not in a good shape and needs a lot more work to be in publishable condition

  27. arXiv:1205.1456  [pdf, ps, other

    cs.SI cs.LG physics.soc-ph

    Dynamic Multi-Relational Chinese Restaurant Process for Analyzing Influences on Users in Social Media

    Authors: Himabindu Lakkaraju, Indrajit Bhattacharya, Chiranjib Bhattacharyya

    Abstract: We study the problem of analyzing influence of various factors affecting individual messages posted in social media. The problem is challenging because of various types of influences propagating through the social media network that act simultaneously on any user. Additionally, the topic composition of the influencing factors and the susceptibility of users to these influences evolve over time. Th… ▽ More

    Submitted 7 May, 2012; originally announced May 2012.

    Comments: 9 pages

  28. arXiv:1111.0045  [pdf, ps, other

    cs.DB cs.AI

    Query-time Entity Resolution

    Authors: I. Bhattacharya, L. Getoor

    Abstract: Entity resolution is the problem of reconciling database references corresponding to the same real-world entities. Given the abundance of publicly available databases that have unresolved entities, we motivate the problem of query-time entity resolution quick and accurate resolution for answering queries over such unclean databases at query-time. Since collective entity resolution approaches ---… ▽ More

    Submitted 31 October, 2011; originally announced November 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 30, pages 621-657, 2007