Skip to main content

Showing 1–5 of 5 results for author: Kalra, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.19548  [pdf, ps, other

    cs.CL cs.IR

    Health Sentinel: An AI Pipeline For Real-time Disease Outbreak Detection

    Authors: Devesh Pant, Rishi Raj Grandhe, Vipin Samaria, Mukul Paul, Sudhir Kumar, Saransh Khanna, Jatin Agrawal, Jushaan Singh Kalra, Akhil VSSG, Satish V Khalikar, Vipin Garg, Himanshu Chauhan, Pranay Verma, Neha Khandelwal, Soma S Dhavala, Minesh Mathew

    Abstract: Early detection of disease outbreaks is crucial to ensure timely intervention by the health authorities. Due to the challenges associated with traditional indicator-based surveillance, monitoring informal sources such as online media has become increasingly popular. However, owing to the number of online articles getting published everyday, manual screening of the articles is impractical. To addre… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  2. arXiv:2506.15862  [pdf, ps, other

    cs.IR cs.AI cs.CL

    MoR: Better Handling Diverse Queries with a Mixture of Sparse, Dense, and Human Retrievers

    Authors: Jushaan Singh Kalra, Xinran Zhao, To Eun Kim, Fengyu Cai, Fernando Diaz, Tongshuang Wu

    Abstract: Retrieval-augmented Generation (RAG) is powerful, but its effectiveness hinges on which retrievers we use and how. Different retrievers offer distinct, often complementary signals: BM25 captures lexical matches; dense retrievers, semantic similarity. Yet in practice, we typically fix a single retriever based on heuristics, which fails to generalize across diverse information needs. Can we dynamica… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 19 pages, 3 figures

  3. arXiv:2310.13856  [pdf, other

    cs.CL

    Implications of Annotation Artifacts in Edge Probing Test Datasets

    Authors: Sagnik Ray Choudhury, Jushaan Kalra

    Abstract: Edge probing tests are classification tasks that test for grammatical knowledge encoded in token representations coming from contextual encoders such as large language models (LLMs). Many LLM encoders have shown high performance in EP tests, leading to conjectures about their ability to encode linguistic knowledge. However, a large body of research claims that the tests necessarily do not measure… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted CoNLL 2023, code: https://github.com/Josh1108/EPtest.git

    ACM Class: I.2.7

  4. arXiv:2110.12780  [pdf, other

    cs.CL

    Battling Hateful Content in Indic Languages HASOC '21

    Authors: Aditya Kadam, Anmol Goel, Jivitesh Jain, Jushaan Singh Kalra, Mallika Subramanian, Manvith Reddy, Prashant Kodali, T. H. Arjun, Manish Shrivastava, Ponnurangam Kumaraguru

    Abstract: The extensive rise in consumption of online social media (OSMs) by a large number of people poses a critical problem of curbing the spread of hateful content on these platforms. With the growing usage of OSMs in multiple languages, the task of detecting and characterizing hate becomes more complex. The subtle variations of code-mixed texts along with switching scripts only add to the complexity. T… ▽ More

    Submitted 5 November, 2021; v1 submitted 25 October, 2021; originally announced October 2021.

    Comments: 12 pages, 6 figures, 2 tables, Accepted at FIRE 2021, CEUR Workshop Proceedings (http://fire.irsi.res.in/fire/2021/home)

  5. arXiv:2004.12283  [pdf, other

    cs.SI physics.soc-ph

    Hierarchical Clustering of World Cuisines

    Authors: Tript Sharma, Utkarsh Upadhyay, Jushaan Kalra, Sakshi Arora, Saad Ahmad, Bhavay Aggarwal, Ganesh Bagler

    Abstract: Cultures across the world have evolved to have unique patterns despite shared ingredients and cooking techniques. Using data obtained from RecipeDB, an online resource for recipes, we extract patterns in 26 world cuisines and further probe for their inter-relatedness. By application of frequent itemset mining and ingredient authenticity we characterize the quintessential patterns in the cuisines a… ▽ More

    Submitted 25 April, 2020; originally announced April 2020.

    Comments: 36th IEEE International Conference on Data Engineering (ICDE 2020), DECOR Workshop; 6 pages, 6 figures, 1 table