Skip to main content

Showing 1–10 of 10 results for author: Grover, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.03194  [pdf, ps, other

    cs.CV cs.AI cs.LG

    HueManity: Probing Fine-Grained Visual Perception in MLLMs

    Authors: Rynaa Grover, Jayant Sravan Tamarapalli, Sahiti Yerramilli, Nilay Pande

    Abstract: Multimodal Large Language Models (MLLMs) excel at high-level visual reasoning, but their performance on nuanced perceptual tasks remains surprisingly limited. We present HueManity, a benchmark designed to assess visual perception in MLLMs. The dataset comprises 83,850 images featuring two-character alphanumeric strings embedded in Ishihara test style dot patterns, challenging models on precise pat… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  2. arXiv:2506.00785  [pdf, ps, other

    cs.AI cs.CV cs.LG

    GeoChain: Multimodal Chain-of-Thought for Geographic Reasoning

    Authors: Sahiti Yerramilli, Nilay Pande, Rynaa Grover, Jayant Sravan Tamarapalli

    Abstract: This paper introduces GeoChain, a large-scale benchmark for evaluating step-by-step geographic reasoning in multimodal large language models (MLLMs). Leveraging 1.46 million Mapillary street-level images, GeoChain pairs each image with a 21-step chain-of-thought (CoT) question sequence (over 30 million Q&A pairs). These sequences guide models from coarse attributes to fine-grained localization acr… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  3. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  4. arXiv:2407.15227  [pdf, other

    cs.CL cs.SI

    A Community-Centric Perspective for Characterizing and Detecting Anti-Asian Violence-Provoking Speech

    Authors: Gaurav Verma, Rynaa Grover, Jiawei Zhou, Binny Mathew, Jordan Kraemer, Munmun De Choudhury, Srijan Kumar

    Abstract: Violence-provoking speech -- speech that implicitly or explicitly promotes violence against the members of the targeted community, contributed to a massive surge in anti-Asian crimes during the pandemic. While previous works have characterized and built tools for detecting other forms of harmful speech, like fear speech and hate speech, our work takes a community-centric approach to studying anti-… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: Accepted to ACL 2024 Main

  5. arXiv:2204.08653  [pdf, other

    cs.SE cs.CL cs.LG

    On The Cross-Modal Transfer from Natural Language to Code through Adapter Modules

    Authors: Divyam Goel, Ramansh Grover, Fatemeh H. Fard

    Abstract: Pre-trained neural Language Models (PTLM), such as CodeBERT, are recently used in software engineering as models pre-trained on large source code corpora. Their knowledge is transferred to downstream tasks (e.g. code clone detection) via fine-tuning. In natural language processing (NLP), other alternatives for transferring the knowledge of PTLMs are explored through using adapters, compact, parame… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: 11 pages, 6 figures, ICPC 2022. 30th International Conference on Program Comprehension (ICPC '22), May 16--17, 2022, Virtual Event, USA}

  6. arXiv:2203.09193  [pdf, other

    cs.SI cs.HC

    Database of Indian Social Media Influencers on Twitter

    Authors: Arshia Arya, Soham De, Dibyendu Mishra, Gazal Shekhawat, Ankur Sharma, Anmol Panda, Faisal Lalani, Parantak Singh, Ramaravind Kommiya Mothilal, Rynaa Grover, Sachita Nishal, Saloni Dash, Shehla Shora, Syeda Zainab Akbar, Joyojeet Pal

    Abstract: Databases of highly networked individuals have been indispensable in studying narratives and influence on social media. To support studies on Twitter in India, we present a systematically categorised database of accounts of influence on Twitter in India, identified and annotated through an iterative process of friends, networks, and self-described profile information, verified manually. We built a… ▽ More

    Submitted 5 May, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: 7 pages (incl. references), 2 figures, 3 tables

  7. arXiv:2111.03906  [pdf, other

    cs.SI

    Insights Into Incitement: A Computational Perspective on Dangerous Speech on Twitter in India

    Authors: Saloni Dash, Rynaa Grover, Gazal Shekhawat, Sukhnidh Kaur, Dibyendu Mishra, Joyojeet Pal

    Abstract: Dangerous speech on social media platforms can be framed as blatantly inflammatory, or be couched in innuendo. It is also centrally tied to who engages it - it can be driven by openly sectarian social media accounts, or through subtle nudges by influential accounts, allowing for complex means of reinforcing vilification of marginalized groups, an increasingly significant problem in the media envir… ▽ More

    Submitted 6 November, 2021; originally announced November 2021.

  8. arXiv:2102.04031  [pdf, other

    cs.SI cs.CY

    Rihanna versus Bollywood: Twitter Influencers and the Indian Farmers' Protest

    Authors: Dibyendu Mishra, Syeda Zainab Akbar, Arshia Arya, Saloni Dash, Rynaa Grover, Joyojeet Pal

    Abstract: A tweet from popular entertainer and businesswoman, Rihanna, bringing attention to farmers' protests around Delhi set off heightened activity on Indian social media. An immediate consequence was the weighing in by Indian politicians, entertainers, media and other influencers on the issue. In this paper, we use data from Twitter and an archive of debunked misinformation stories to understand some o… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: 13 pages, 12 figures

  9. arXiv:1407.0454  [pdf, other

    cs.DB

    AsterixDB: A Scalable, Open Source BDMS

    Authors: Sattam Alsubaiee, Yasser Altowim, Hotham Altwaijry, Alexander Behm, Vinayak Borkar, Yingyi Bu, Michael Carey, Inci Cetindil, Madhusudan Cheelangi, Khurram Faraaz, Eugenia Gabrielova, Raman Grover, Zachary Heilbron, Young-Seok Kim, Chen Li, Guangqiang Li, Ji Mahn Ok, Nicola Onose, Pouria Pirzadeh, Vassilis Tsotras, Rares Vernica, Jian Wen, Till Westmann

    Abstract: AsterixDB is a new, full-function BDMS (Big Data Management System) with a feature set that distinguishes it from other platforms in today's open source Big Data ecosystem. Its features make it well-suited to applications like web data warehousing, social data storage and analysis, and other use cases related to Big Data. AsterixDB has a flexible NoSQL style data model; a query language that suppo… ▽ More

    Submitted 2 July, 2014; originally announced July 2014.

  10. arXiv:1405.1705  [pdf, other

    cs.DB

    Scalable Fault-Tolerant Data Feeds in AsterixDB

    Authors: Raman Grover, Michael J. Carey

    Abstract: In this paper we describe the support for data feed ingestion in AsterixDB, an open-source Big Data Management System (BDMS) that provides a platform for storage and analysis of large volumes of semi-structured data. Data feeds are a mechanism for having continuous data arrive into a BDMS from external sources and incrementally populate a persisted dataset and associated indexes. The need to persi… ▽ More

    Submitted 7 May, 2014; originally announced May 2014.