Skip to main content

Showing 1–7 of 7 results for author: Sudhir, A

Searching in archive cs. Search in all archives.
.
  1. Tree Boosting Methods for Balanced andImbalanced Classification and their Robustness Over Time in Risk Assessment

    Authors: Gissel Velarde, Michael Weichert, Anuj Deshmunkh, Sanjay Deshmane, Anindya Sudhir, Khushboo Sharma, Vaibhav Joshi

    Abstract: Most real-world classification problems deal with imbalanced datasets, posing a challenge for Artificial Intelligence (AI), i.e., machine learning algorithms, because the minority class, which is of extreme interest, often proves difficult to be detected. This paper empirically evaluates tree boosting methods' performance given different dataset sizes and class distributions, from perfectly balanc… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

    Comments: 14 pages. arXiv admin note: text overlap with arXiv:2303.15218

    Journal ref: Intelligent Systems with Applications 22 (2024) 200354

  2. arXiv:2504.03731  [pdf, other

    cs.AI

    A Benchmark for Scalable Oversight Protocols

    Authors: Abhimanyu Pallavi Sudhir, Jackson Kaunismaa, Arjun Panickssery

    Abstract: As AI agents surpass human capabilities, scalable oversight -- the problem of effectively supplying human feedback to potentially superhuman AI models -- becomes increasingly critical to ensure alignment. While numerous scalable oversight protocols have been proposed, they lack a systematic empirical framework to evaluate and compare them. While recent works have tried to empirically study scalabl… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

    Comments: Accepted at the ICLR 2025 Workshop on Bidirectional Human-AI Alignment (BiAlign)

  3. arXiv:2503.05828  [pdf, other

    cs.AI econ.TH

    Market-based Architectures in RL and Beyond

    Authors: Abhimanyu Pallavi Sudhir, Long Tran-Thanh

    Abstract: Market-based agents refer to reinforcement learning agents which determine their actions based on an internal market of sub-agents. We introduce a new type of market-based algorithm where the state itself is factored into several axes called ``goods'', which allows for greater specialization and parallelism than existing market-based RL algorithms. Furthermore, we argue that market-based algorithm… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: Accepted at AAMAS 2025

  4. arXiv:2412.18544  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Consistency Checks for Language Model Forecasters

    Authors: Daniel Paleka, Abhimanyu Pallavi Sudhir, Alejandro Alvarez, Vineeth Bhat, Adam Shen, Evan Wang, Florian Tramèr

    Abstract: Forecasting is a task that is difficult to evaluate: the ground truth can only be known in the future. Recent work showing LLM forecasters rapidly approaching human-level performance begs the question: how can we benchmark and evaluate these forecasters instantaneously? Following the consistency check framework, we measure the performance of forecasters in terms of the consistency of their predict… ▽ More

    Submitted 9 January, 2025; v1 submitted 24 December, 2024; originally announced December 2024.

    Comments: 55 pages, 25 figures. Submitted to ICLR 2025

  5. arXiv:2402.14021  [pdf, ps, other

    cs.GT cs.AI cs.LO

    Betting on what is neither verifiable nor falsifiable

    Authors: Abhimanyu Pallavi Sudhir, Long Tran-Thanh

    Abstract: Prediction markets are useful for estimating probabilities of claims whose truth will be revealed at some fixed time -- this includes questions about the values of real-world events (i.e. statistical uncertainty), and questions about the values of primitive recursive functions (i.e. logical or algorithmic uncertainty). However, they cannot be directly applied to questions without a fixed resolutio… ▽ More

    Submitted 29 January, 2024; originally announced February 2024.

    Comments: 15 pages, 4 figures

    MSC Class: 91B26 (Primary); 03F03 (Secondary) ACM Class: F.4.1; I.2.11

  6. arXiv:2303.15218  [pdf, other

    cs.LG cs.AI

    Evaluating XGBoost for Balanced and Imbalanced Data: Application to Fraud Detection

    Authors: Gissel Velarde, Anindya Sudhir, Sanjay Deshmane, Anuj Deshmunkh, Khushboo Sharma, Vaibhav Joshi

    Abstract: This paper evaluates XGboost's performance given different dataset sizes and class distributions, from perfectly balanced to highly imbalanced. XGBoost has been selected for evaluation, as it stands out in several benchmarks due to its detection performance and speed. After introducing the problem of fraud detection, the paper reviews evaluation metrics for detection systems or binary classifiers,… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: 17 pages, 8 figures, 9 tables, Presented at NVIDIA GTC, The Conference for the Era of AI and the Metaverse, March 23, 2023. [S51129]

  7. arXiv:2002.03058  [pdf, other

    cs.HC cs.CR

    Lessons Learned Developing and Extending a Visual Analytics Solution for Investigative Analysis of Scamming Activities

    Authors: Ronak Tanna, Shivam Dhar, Ashwin Sudhir, Shreyash Devan, Shubham Verma

    Abstract: Cybersecurity analysts work on large communication data sets to perform investigative analysis by painstakingly going over thousands of email conversations to find potential scamming activities and the network of cyber scammers. Traditionally,experts used email clients, database systems and text editors to perform this investigation. With the advent of technology,elaborate tools that summarize dat… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.