Skip to main content

Showing 1–50 of 96 results for author: Varshney, K R

.
  1. arXiv:2506.05586  [pdf, ps, other

    cs.LG cs.AI

    CoFrNets: Interpretable Neural Architecture Inspired by Continued Fractions

    Authors: Isha Puri, Amit Dhurandhar, Tejaswini Pedapati, Kartikeyan Shanmugam, Dennis Wei, Kush R. Varshney

    Abstract: In recent years there has been a considerable amount of research on local post hoc explanations for neural networks. However, work on building interpretable neural architectures has been relatively sparse. In this paper, we present a novel neural architecture, CoFrNet, inspired by the form of continued fractions which are known to have many attractive properties in number theory, such as fast conv… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS) 2021, vol 34, pp 21668-21690

  2. arXiv:2505.18422  [pdf

    cs.CY

    A Task-Driven Human-AI Collaboration: When to Automate, When to Collaborate, When to Challenge

    Authors: Saleh Afroogh, Kush R. Varshney, Jason DCruz

    Abstract: According to several empirical investigations, despite enhancing human capabilities, human-AI cooperation frequently falls short of expectations and fails to reach true synergy. We propose a task-driven framework that reverses prevalent approaches by assigning AI roles according to how the task's requirements align with the capabilities of AI technology. Three major AI roles are identified through… ▽ More

    Submitted 28 May, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

  3. arXiv:2503.05780  [pdf, other

    cs.CY cs.HC

    AI Risk Atlas: Taxonomy and Tooling for Navigating AI Risks and Resources

    Authors: Frank Bagehorn, Kristina Brimijoin, Elizabeth M. Daly, Jessica He, Michael Hind, Luis Garces-Erice, Christopher Giblin, Ioana Giurgiu, Jacquelyn Martino, Rahul Nair, David Piorkowski, Ambrish Rawat, John Richards, Sean Rooney, Dhaval Salwala, Seshu Tirupathi, Peter Urbanetz, Kush R. Varshney, Inge Vejsbjerg, Mira L. Wolf-Bauwens

    Abstract: The rapid evolution of generative AI has expanded the breadth of risks associated with AI systems. While various taxonomies and frameworks exist to classify these risks, the lack of interoperability between them creates challenges for researchers, practitioners, and policymakers seeking to operationalise AI governance. To address this gap, we introduce the AI Risk Atlas, a structured taxonomy that… ▽ More

    Submitted 26 February, 2025; originally announced March 2025.

    Comments: 4.5 page main text, 22 page supporting material, 2 figures

  4. arXiv:2503.00237  [pdf, other

    cs.AI

    Agentic AI Needs a Systems Theory

    Authors: Erik Miehling, Karthikeyan Natesan Ramamurthy, Kush R. Varshney, Matthew Riemer, Djallel Bouneffouf, John T. Richards, Amit Dhurandhar, Elizabeth M. Daly, Michael Hind, Prasanna Sattigeri, Dennis Wei, Ambrish Rawat, Jasmina Gajcin, Werner Geyer

    Abstract: The endowment of AI with reasoning capabilities and some degree of agency is widely viewed as a path toward more capable and generalizable systems. Our position is that the current development of agentic AI requires a more holistic, systems-theoretic perspective in order to fully understand their capabilities and mitigate any emergent risks. The primary motivation for our position is that AI devel… ▽ More

    Submitted 28 February, 2025; originally announced March 2025.

  5. arXiv:2502.05148  [pdf, ps, other

    cs.CY cs.CL

    An Annotated Reading of 'The Singer of Tales' in the LLM Era

    Authors: Kush R. Varshney

    Abstract: The Parry-Lord oral-formulaic theory was a breakthrough in understanding how oral narrative poetry is learned, composed, and transmitted by illiterate bards. In this paper, we provide an annotated reading of the mechanism underlying this theory from the lens of large language models (LLMs) and generative artificial intelligence (AI). We point out the the similarities and differences between oral c… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  6. arXiv:2501.12405  [pdf, other

    cs.CY cs.AI cs.CL

    Scopes of Alignment

    Authors: Kush R. Varshney, Zahra Ashktorab, Djallel Bouneffouf, Matthew Riemer, Justin D. Weisz

    Abstract: Much of the research focus on AI alignment seeks to align large language models and other foundation models to the context-less and generic values of helpfulness, harmlessness, and honesty. Frontier model providers also strive to align their models with these values. In this paper, we motivate why we need to move beyond such a limited conception and propose three dimensions for doing so. The first… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

    Comments: The 2nd International Workshop on AI Governance (AIGOV) held in conjunction with AAAI 2025

  7. arXiv:2412.07724  [pdf, other

    cs.CL

    Granite Guardian

    Authors: Inkit Padhi, Manish Nagireddy, Giandomenico Cornacchia, Subhajit Chaudhury, Tejaswini Pedapati, Pierre Dognin, Keerthiram Murugesan, Erik Miehling, Martín Santillán Cooper, Kieran Fraser, Giulio Zizzo, Muhammad Zaid Hameed, Mark Purcell, Michael Desmond, Qian Pan, Zahra Ashktorab, Inge Vejsbjerg, Elizabeth M. Daly, Michael Hind, Werner Geyer, Ambrish Rawat, Kush R. Varshney, Prasanna Sattigeri

    Abstract: We introduce the Granite Guardian models, a suite of safeguards designed to provide risk detection for prompts and responses, enabling safe and responsible use in combination with any large language model (LLM). These models offer comprehensive coverage across multiple risk dimensions, including social bias, profanity, violence, sexual content, unethical behavior, jailbreaking, and hallucination-r… ▽ More

    Submitted 16 December, 2024; v1 submitted 10 December, 2024; originally announced December 2024.

  8. arXiv:2410.15467  [pdf, other

    cs.CL cs.AI cs.HC

    Hey GPT, Can You be More Racist? Analysis from Crowdsourced Attempts to Elicit Biased Content from Generative AI

    Authors: Hangzhi Guo, Pranav Narayanan Venkit, Eunchae Jang, Mukund Srinath, Wenbo Zhang, Bonam Mingole, Vipul Gupta, Kush R. Varshney, S. Shyam Sundar, Amulya Yadav

    Abstract: The widespread adoption of large language models (LLMs) and generative AI (GenAI) tools across diverse applications has amplified the importance of addressing societal biases inherent within these technologies. While the NLP community has extensively studied LLM bias, research investigating how non-expert users perceive and interact with biases from these systems remains limited. As these technolo… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

  9. arXiv:2409.15398  [pdf, other

    cs.CR cs.AI cs.LG

    Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAI

    Authors: Ambrish Rawat, Stefan Schoepf, Giulio Zizzo, Giandomenico Cornacchia, Muhammad Zaid Hameed, Kieran Fraser, Erik Miehling, Beat Buesser, Elizabeth M. Daly, Mark Purcell, Prasanna Sattigeri, Pin-Yu Chen, Kush R. Varshney

    Abstract: As generative AI, particularly large language models (LLMs), become increasingly integrated into production applications, new attack surfaces and vulnerabilities emerge and put a focus on adversarial threats in natural language and multi-modal systems. Red-teaming has gained importance in proactively identifying weaknesses in these systems, while blue-teaming works to protect against such adversar… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  10. arXiv:2408.10392  [pdf, other

    cs.CL cs.LG

    Value Alignment from Unstructured Text

    Authors: Inkit Padhi, Karthikeyan Natesan Ramamurthy, Prasanna Sattigeri, Manish Nagireddy, Pierre Dognin, Kush R. Varshney

    Abstract: Aligning large language models (LLMs) to value systems has emerged as a significant area of research within the fields of AI and NLP. Currently, this alignment process relies on the availability of high-quality supervised and preference data, which can be both time-consuming and expensive to curate or annotate. In this paper, we introduce a systematic end-to-end methodology for aligning LLMs to th… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  11. arXiv:2408.08846  [pdf

    cs.CY

    When Trust is Zero Sum: Automation Threat to Epistemic Agency

    Authors: Emmie Malone, Saleh Afroogh, Jason DCruz, Kush R Varshney

    Abstract: AI researchers and ethicists have long worried about the threat that automation poses to human dignity, autonomy, and to the sense of personal value that is tied to work. Typically, proposed solutions to this problem focus on ways in which we can reduce the number of job losses which result from automation, ways to retrain those that lose their jobs, or ways to mitigate the social consequences of… ▽ More

    Submitted 18 August, 2024; v1 submitted 16 August, 2024; originally announced August 2024.

  12. arXiv:2403.12805  [pdf, other

    cs.AI cs.CL

    Contextual Moral Value Alignment Through Context-Based Aggregation

    Authors: Pierre Dognin, Jesus Rios, Ronny Luss, Inkit Padhi, Matthew D Riemer, Miao Liu, Prasanna Sattigeri, Manish Nagireddy, Kush R. Varshney, Djallel Bouneffouf

    Abstract: Developing value-aligned AI agents is a complex undertaking and an ongoing challenge in the field of AI. Specifically within the domain of Large Language Models (LLMs), the capability to consolidate multiple independently trained dialogue agents, each aligned with a distinct moral value, into a unified system that can adapt to and be aligned with multiple moral values is of paramount importance. I… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  13. arXiv:2403.10638  [pdf, other

    cs.LG cs.CY stat.ML

    A resource-constrained stochastic scheduling algorithm for homeless street outreach and gleaning edible food

    Authors: Conor M. Artman, Aditya Mate, Ezinne Nwankwo, Aliza Heching, Tsuyoshi Idé, Jiří Navrátil, Karthikeyan Shanmugam, Wei Sun, Kush R. Varshney, Lauri Goldkind, Gidi Kroch, Jaclyn Sawyer, Ian Watson

    Abstract: We developed a common algorithmic solution addressing the problem of resource-constrained outreach encountered by social change organizations with different missions and operations: Breaking Ground -- an organization that helps individuals experiencing homelessness in New York transition to permanent housing and Leket -- the national food bank of Israel that rescues food from farms and elsewhere t… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  14. arXiv:2403.09704  [pdf, other

    cs.CL cs.AI cs.LG

    Alignment Studio: Aligning Large Language Models to Particular Contextual Regulations

    Authors: Swapnaja Achintalwar, Ioana Baldini, Djallel Bouneffouf, Joan Byamugisha, Maria Chang, Pierre Dognin, Eitan Farchi, Ndivhuwo Makondo, Aleksandra Mojsilovic, Manish Nagireddy, Karthikeyan Natesan Ramamurthy, Inkit Padhi, Orna Raz, Jesus Rios, Prasanna Sattigeri, Moninder Singh, Siphiwe Thwala, Rosario A. Uceda-Sosa, Kush R. Varshney

    Abstract: The alignment of large language models is usually done by model providers to add or control behaviors that are common or universally understood across use cases and contexts. In contrast, in this article, we present an approach and architecture that empowers application developers to tune a model to their particular values, social norms, laws and other regulations, and orchestrate between potentia… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures

  15. arXiv:2403.06009  [pdf, other

    cs.LG

    Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

    Authors: Swapnaja Achintalwar, Adriana Alvarado Garcia, Ateret Anaby-Tavor, Ioana Baldini, Sara E. Berger, Bishwaranjan Bhattacharjee, Djallel Bouneffouf, Subhajit Chaudhury, Pin-Yu Chen, Lamogha Chiazor, Elizabeth M. Daly, Kirushikesh DB, Rogério Abreu de Paula, Pierre Dognin, Eitan Farchi, Soumya Ghosh, Michael Hind, Raya Horesh, George Kour, Ja Young Lee, Nishtha Madaan, Sameep Mehta, Erik Miehling, Keerthiram Murugesan, Manish Nagireddy , et al. (13 additional authors not shown)

    Abstract: Large language models (LLMs) are susceptible to a variety of risks, from non-faithful output to biased and toxic generations. Due to several limiting factors surrounding LLMs (training cost, API access, data availability, etc.), it may not always be feasible to impose direct safety constraints on a deployed model. Therefore, an efficient and reliable alternative is required. To this end, we presen… ▽ More

    Submitted 19 August, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  16. arXiv:2402.08787  [pdf, other

    cs.LG cs.CL

    Rethinking Machine Unlearning for Large Language Models

    Authors: Sijia Liu, Yuanshun Yao, Jinghan Jia, Stephen Casper, Nathalie Baracaldo, Peter Hase, Yuguang Yao, Chris Yuhao Liu, Xiaojun Xu, Hang Li, Kush R. Varshney, Mohit Bansal, Sanmi Koyejo, Yang Liu

    Abstract: We explore machine unlearning (MU) in the domain of large language models (LLMs), referred to as LLM unlearning. This initiative aims to eliminate undesirable data influence (e.g., sensitive or illegal information) and the associated model capabilities, while maintaining the integrity of essential knowledge generation and not affecting causally unrelated information. We envision LLM unlearning bec… ▽ More

    Submitted 6 December, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted by Nature Machine Intelligence

  17. arXiv:2401.14523  [pdf, ps, other

    cs.CY cs.AI cs.CL

    Empathy and the Right to Be an Exception: What LLMs Can and Cannot Do

    Authors: William Kidder, Jason D'Cruz, Kush R. Varshney

    Abstract: Advances in the performance of large language models (LLMs) have led some researchers to propose the emergence of theory of mind (ToM) in artificial intelligence (AI). LLMs can attribute beliefs, desires, intentions, and emotions, and they will improve in their accuracy. Rather than employing the characteristically human method of empathy, they learn to attribute mental states by recognizing lingu… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  18. arXiv:2309.05030  [pdf, other

    cs.CY cs.AI stat.ML

    Decolonial AI Alignment: Openness, Viśe\d{s}a-Dharma, and Including Excluded Knowledges

    Authors: Kush R. Varshney

    Abstract: Prior work has explicated the coloniality of artificial intelligence (AI) development and deployment through mechanisms such as extractivism, automation, sociological essentialism, surveillance, and containment. However, that work has not engaged much with alignment: teaching behaviors to a large language model (LLM) in line with desired values, and has not considered a mechanism that arises withi… ▽ More

    Submitted 2 May, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

  19. arXiv:2305.12620  [pdf, other

    cs.CL

    Keeping Up with the Language Models: Systematic Benchmark Extension for Bias Auditing

    Authors: Ioana Baldini, Chhavi Yadav, Manish Nagireddy, Payel Das, Kush R. Varshney

    Abstract: Bias auditing of language models (LMs) has received considerable attention as LMs are becoming widespread. As such, several benchmarks for bias auditing have been proposed. At the same time, the rapid evolution of LMs can make these benchmarks irrelevant in no time. Bias auditing is further complicated by LM brittleness: when a presumably biased outcome is observed, is it due to model bias or mode… ▽ More

    Submitted 25 September, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

  20. arXiv:2304.00416  [pdf, other

    cs.AI cs.CL cs.CY cs.HC cs.LG

    Towards Healthy AI: Large Language Models Need Therapists Too

    Authors: Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi, Kush R. Varshney

    Abstract: Recent advances in large language models (LLMs) have led to the development of powerful AI chatbots capable of engaging in natural and human-like conversations. However, these chatbots can be potentially harmful, exhibiting manipulative, gaslighting, and narcissistic behaviors. We define Healthy AI to be safe, trustworthy and ethical. To create healthy AI systems, we present the SafeguardGPT frame… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

  21. arXiv:2302.09190  [pdf, other

    cs.LG cs.CY

    Function Composition in Trustworthy Machine Learning: Implementation Choices, Insights, and Questions

    Authors: Manish Nagireddy, Moninder Singh, Samuel C. Hoffman, Evaline Ju, Karthikeyan Natesan Ramamurthy, Kush R. Varshney

    Abstract: Ensuring trustworthiness in machine learning (ML) models is a multi-dimensional task. In addition to the traditional notion of predictive performance, other notions such as privacy, fairness, robustness to distribution shift, adversarial robustness, interpretability, explainability, and uncertainty quantification are important considerations to evaluate and improve (if deficient). However, these s… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  22. arXiv:2212.06803  [pdf, other

    cs.LG cs.CY stat.ML

    Fair Infinitesimal Jackknife: Mitigating the Influence of Biased Training Data Points Without Refitting

    Authors: Prasanna Sattigeri, Soumya Ghosh, Inkit Padhi, Pierre Dognin, Kush R. Varshney

    Abstract: In consequential decision-making applications, mitigating unwanted biases in machine learning models that yield systematic disadvantage to members of groups delineated by sensitive attributes such as race and gender is one key intervention to strive for equity. Focusing on demographic parity and equality of opportunity, in this paper we propose an algorithm that improves the fairness of a pre-trai… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: Accepted at Neurips 2022

  23. arXiv:2211.01498  [pdf, other

    cs.LG stat.ML

    On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

    Authors: Dennis Wei, Rahul Nair, Amit Dhurandhar, Kush R. Varshney, Elizabeth M. Daly, Moninder Singh

    Abstract: Interpretable and explainable machine learning has seen a recent surge of interest. We focus on safety as a key motivation behind the surge and make the relationship between interpretability and safety more quantitative. Toward assessing safety, we introduce the concept of maximum deviation via an optimization problem to find the largest deviation of a supervised learning model from a reference mo… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: Published at NeurIPS 2022

  24. arXiv:2210.06475  [pdf, other

    cs.LG cs.CL

    Equi-Tuning: Group Equivariant Fine-Tuning of Pretrained Models

    Authors: Sourya Basu, Prasanna Sattigeri, Karthikeyan Natesan Ramamurthy, Vijil Chenthamarakshan, Kush R. Varshney, Lav R. Varshney, Payel Das

    Abstract: We introduce equi-tuning, a novel fine-tuning method that transforms (potentially non-equivariant) pretrained models into group equivariant models while incurring minimum $L_2$ loss between the feature representations of the pretrained and the equivariant models. Large pretrained models can be equi-tuned for different groups to satisfy the needs of various downstream tasks. Equi-tuned models benef… ▽ More

    Submitted 4 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Journal ref: AAAI 2023

  25. arXiv:2208.10451  [pdf, other

    cs.LG cs.CY stat.ML

    Minimax AUC Fairness: Efficient Algorithm with Provable Convergence

    Authors: Zhenhuan Yang, Yan Lok Ko, Kush R. Varshney, Yiming Ying

    Abstract: The use of machine learning models in consequential decision making often exacerbates societal inequity, in particular yielding disparate impact on members of marginalized groups defined by race and gender. The area under the ROC curve (AUC) is widely used to evaluate the performance of a scoring function in machine learning, but is studied in algorithmic fairness less than other performance metri… ▽ More

    Submitted 28 November, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

  26. arXiv:2208.01305  [pdf, other

    cs.CY

    Humble Machines: Attending to the Underappreciated Costs of Misplaced Distrust

    Authors: Bran Knowles, Jason D'Cruz, John T. Richards, Kush R. Varshney

    Abstract: It is curious that AI increasingly outperforms human decision makers, yet much of the public distrusts AI to make decisions affecting their lives. In this paper we explore a novel theory that may explain one reason for this. We propose that public distrust of AI is a moral consequence of designing systems that prioritize reduction of costs of false positives over less tangible costs of false negat… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    ACM Class: K.4.m

  27. arXiv:2201.09046  [pdf, other

    cs.LG cs.CR

    Differentially Private SGDA for Minimax Problems

    Authors: Zhenhuan Yang, Shu Hu, Yunwen Lei, Kush R. Varshney, Siwei Lyu, Yiming Ying

    Abstract: Stochastic gradient descent ascent (SGDA) and its variants have been the workhorse for solving minimax problems. However, in contrast to the well-studied stochastic gradient descent (SGD) with differential privacy (DP) constraints, there is little work on understanding the generalization (utility) of SGDA with DP constraints. In this paper, we use the algorithmic stability approach to establish th… ▽ More

    Submitted 29 July, 2022; v1 submitted 22 January, 2022; originally announced January 2022.

    Comments: To appear in UAI 2022

  28. arXiv:2110.10790  [pdf, other

    cs.AI cs.HC

    Human-Centered Explainable AI (XAI): From Algorithms to User Experiences

    Authors: Q. Vera Liao, Kush R. Varshney

    Abstract: In recent years, the field of explainable AI (XAI) has produced a vast collection of algorithms, providing a useful toolbox for researchers and practitioners to build XAI applications. With the rich application opportunities, explainability is believed to have moved beyond a demand by data scientists or researchers to comprehend the models they develop, to an essential requirement for people to tr… ▽ More

    Submitted 19 April, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: draft for a book chapter

  29. arXiv:2109.14653  [pdf, other

    cs.LG cs.CY

    An Empirical Study of Accuracy, Fairness, Explainability, Distributional Robustness, and Adversarial Robustness

    Authors: Moninder Singh, Gevorg Ghalachyan, Kush R. Varshney, Reginald E. Bryant

    Abstract: To ensure trust in AI models, it is becoming increasingly apparent that evaluation of models must be extended beyond traditional performance metrics, like accuracy, to other dimensions, such as fairness, explainability, adversarial robustness, and distribution shift. We describe an empirical study to evaluate multiple model types on various metrics along these dimensions on several datasets. Our r… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Journal ref: presented at the 2021 KDD Workshop on Measures and Best Practices for Responsible AI

  30. arXiv:2109.12151  [pdf, other

    cs.LG cs.AI

    AI Explainability 360: Impact and Design

    Authors: Vijay Arya, Rachel K. E. Bellamy, Pin-Yu Chen, Amit Dhurandhar, Michael Hind, Samuel C. Hoffman, Stephanie Houde, Q. Vera Liao, Ronny Luss, Aleksandra Mojsilovic, Sami Mourad, Pablo Pedemonte, Ramya Raghavendra, John Richards, Prasanna Sattigeri, Karthikeyan Shanmugam, Moninder Singh, Kush R. Varshney, Dennis Wei, Yunfeng Zhang

    Abstract: As artificial intelligence and machine learning algorithms become increasingly prevalent in society, multiple stakeholders are calling for these algorithms to provide explanations. At the same time, these stakeholders, whether they be affected citizens, government regulators, domain experts, or system developers, have different explanation needs. To address these needs, in 2019, we created AI Expl… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: arXiv admin note: text overlap with arXiv:1909.03012

    Journal ref: IAAI 2022

  31. arXiv:2108.08077  [pdf, other

    q-bio.QM cs.LG

    Towards Interpreting Zoonotic Potential of Betacoronavirus Sequences With Attention

    Authors: Kahini Wadhawan, Payel Das, Barbara A. Han, Ilya R. Fischhoff, Adrian C. Castellanos, Arvind Varsani, Kush R. Varshney

    Abstract: Current methods for viral discovery target evolutionarily conserved proteins that accurately identify virus families but remain unable to distinguish the zoonotic potential of newly discovered viruses. Here, we apply an attention-enhanced long-short-term memory (LSTM) deep neural net classifier to a highly conserved viral protein target to predict zoonotic potential across betacoronaviruses. The c… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: 11 pages, 8 figures, 1 table, accepted at ICLR 2021 workshop Machine learning for preventing and combating pandemics

  32. arXiv:2106.09502  [pdf, other

    cs.CL cs.LG

    Biomedical Interpretable Entity Representations

    Authors: Diego Garcia-Olano, Yasumasa Onoe, Ioana Baldini, Joydeep Ghosh, Byron C. Wallace, Kush R. Varshney

    Abstract: Pre-trained language models induce dense entity representations that offer strong performance on entity-centric NLP tasks, but such representations are not immediately interpretable. This can be a barrier to model uptake in important domains such as biomedicine. There has been recent work on general interpretable representation learning (Onoe and Durrett, 2020), but these domain-agnostic represent… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted into Findings of ACL-IJCNLP 2021

  33. arXiv:2106.01410  [pdf, other

    cs.AI

    Uncertainty Quantification 360: A Holistic Toolkit for Quantifying and Communicating the Uncertainty of AI

    Authors: Soumya Ghosh, Q. Vera Liao, Karthikeyan Natesan Ramamurthy, Jiri Navratil, Prasanna Sattigeri, Kush R. Varshney, Yunfeng Zhang

    Abstract: In this paper, we describe an open source Python toolkit named Uncertainty Quantification 360 (UQ360) for the uncertainty quantification of AI models. The goal of this toolkit is twofold: first, to provide a broad range of capabilities to streamline as well as foster the common practices of quantifying, evaluating, improving, and communicating uncertainty in the AI application development lifecycl… ▽ More

    Submitted 3 June, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: Added references

  34. arXiv:2104.04633  [pdf, other

    cs.CY

    Automated Meta-Analysis: A Causal Learning Perspective

    Authors: Lu Cheng, Dmitriy A. Katz-Rogozhnikov, Kush R. Varshney, Ioana Baldini

    Abstract: Meta-analysis is a systematic approach for understanding a phenomenon by analyzing the results of many previously published experimental studies. It is central to deriving conclusions about the summary effect of treatments and interventions in medicine, poverty alleviation, and other applications with social impact. Unfortunately, meta-analysis involves great human effort, rendering a process that… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: 11 pages, 6 figures

  35. arXiv:2102.02279  [pdf, other

    cs.CY

    Insiders and Outsiders in Research on Machine Learning and Society

    Authors: Yu Tao, Kush R. Varshney

    Abstract: A subset of machine learning research intersects with societal issues, including fairness, accountability and transparency, as well as the use of machine learning for social good. In this work, we analyze the scholars contributing to this research at the intersection of machine learning and society through the lens of the sociology of science. By analyzing the authorship of all machine learning pa… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

  36. Disparate Impact Diminishes Consumer Trust Even for Advantaged Users

    Authors: Tim Draws, Zoltán Szlávik, Benjamin Timmermans, Nava Tintarev, Kush R. Varshney, Michael Hind

    Abstract: Systems aiming to aid consumers in their decision-making (e.g., by implementing persuasive techniques) are more likely to be effective when consumers trust them. However, recent research has demonstrated that the machine learning algorithms that often underlie such technology can act unfairly towards specific groups (e.g., by making more favorable predictions for men than for women). An undesired… ▽ More

    Submitted 5 July, 2021; v1 submitted 29 January, 2021; originally announced January 2021.

    Journal ref: Persuasive Technology, Cham, 2021, p. 135-149

  37. arXiv:2101.02032  [pdf, other

    cs.CY cs.AI

    Socially Responsible AI Algorithms: Issues, Purposes, and Challenges

    Authors: Lu Cheng, Kush R. Varshney, Huan Liu

    Abstract: In the current era, people and society have grown increasingly reliant on artificial intelligence (AI) technologies. AI has the potential to drive us towards a future in which all of humanity flourishes. It also comes with substantial risks for oppression and calamity. Discussions about whether we should (re)trust AI have repeatedly emerged in recent years and in many quarters, including industry,… ▽ More

    Submitted 21 August, 2021; v1 submitted 1 January, 2021; originally announced January 2021.

    Comments: 45 pages, 8 figures

    Journal ref: Journal of Artificial Intelligence Research 71 (2021) 1137-1181

  38. arXiv:2012.12141  [pdf, other

    cs.LG stat.ML

    Learning to Initialize Gradient Descent Using Gradient Descent

    Authors: Kartik Ahuja, Amit Dhurandhar, Kush R. Varshney

    Abstract: Non-convex optimization problems are challenging to solve; the success and computational expense of a gradient descent algorithm or variant depend heavily on the initialization strategy. Often, either random initialization is used or initialization rules are carefully designed by exploiting the nature of the problem class. As a simple alternative to hand-crafted initialization rules, we propose an… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

  39. arXiv:2010.16412  [pdf, other

    cs.LG stat.ML

    Empirical or Invariant Risk Minimization? A Sample Complexity Perspective

    Authors: Kartik Ahuja, Jun Wang, Amit Dhurandhar, Karthikeyan Shanmugam, Kush R. Varshney

    Abstract: Recently, invariant risk minimization (IRM) was proposed as a promising solution to address out-of-distribution (OOD) generalization. However, it is unclear when IRM should be preferred over the widely-employed empirical risk minimization (ERM) framework. In this work, we analyze both these frameworks from the perspective of sample complexity, thus taking a firm step towards answering this importa… ▽ More

    Submitted 19 August, 2022; v1 submitted 30 October, 2020; originally announced October 2020.

  40. arXiv:2010.07938  [pdf, other

    cs.HC cs.LG

    Deciding Fast and Slow: The Role of Cognitive Biases in AI-assisted Decision-making

    Authors: Charvi Rastogi, Yunfeng Zhang, Dennis Wei, Kush R. Varshney, Amit Dhurandhar, Richard Tomsett

    Abstract: Several strands of research have aimed to bridge the gap between artificial intelligence (AI) and human decision-makers in AI-assisted decision-making, where humans are the consumers of AI model predictions and the ultimate decision-makers in high-stakes applications. However, people's perception and understanding are often distorted by their cognitive biases, such as confirmation bias, anchoring… ▽ More

    Submitted 4 April, 2022; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: 22 pages, 4 figures

  41. arXiv:2006.11356  [pdf, ps, other

    cs.CY cs.CR

    Trust and Transparency in Contact Tracing Applications

    Authors: Stacy Hobson, Michael Hind, Aleksandra Mojsilovic, Kush R. Varshney

    Abstract: The global outbreak of COVID-19 has led to focus on efforts to manage and mitigate the continued spread of the disease. One of these efforts include the use of contact tracing to identify people who are at-risk of developing the disease through exposure to an infected person. Historically, contact tracing has been primarily manual but given the exponential spread of the virus that causes COVID-19,… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

    Comments: 9 pages

  42. arXiv:2006.06053  [pdf, other

    cs.LG cs.CY cs.DB stat.ML

    Causal Feature Selection for Algorithmic Fairness

    Authors: Sainyam Galhotra, Karthikeyan Shanmugam, Prasanna Sattigeri, Kush R. Varshney

    Abstract: The use of machine learning (ML) in high-stakes societal decisions has encouraged the consideration of fairness throughout the ML lifecycle. Although data integration is one of the primary steps to generate high quality training data, most of the fairness literature ignores this stage. In this work, we consider fairness in the integration component of data management, aiming to identify features t… ▽ More

    Submitted 31 March, 2022; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: Full version of the paper at SIGMOD 2022

  43. arXiv:2002.04692  [pdf, other

    cs.LG stat.ML

    Invariant Risk Minimization Games

    Authors: Kartik Ahuja, Karthikeyan Shanmugam, Kush R. Varshney, Amit Dhurandhar

    Abstract: The standard risk minimization paradigm of machine learning is brittle when operating in environments whose test distributions are different from the training distribution due to spurious correlations. Training on data from many environments and finding invariant predictors reduces the effect of spurious features by concentrating models on features that have a causal relationship with the outcome.… ▽ More

    Submitted 18 March, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

  44. Joint Optimization of AI Fairness and Utility: A Human-Centered Approach

    Authors: Yunfeng Zhang, Rachel K. E. Bellamy, Kush R. Varshney

    Abstract: Today, AI is increasingly being used in many high-stakes decision-making applications in which fairness is an important concern. Already, there are many examples of AI being biased and making questionable and unfair decisions. The AI research community has proposed many methods to measure and mitigate unwanted biases, but few of them involve inputs from human policy makers. We argue that because d… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

    Comments: To appear in AIES 2020 proceedings

  45. arXiv:1911.08293  [pdf, ps, other

    cs.CY cs.HC

    Experiences with Improving the Transparency of AI Models and Services

    Authors: Michael Hind, Stephanie Houde, Jacquelyn Martino, Aleksandra Mojsilovic, David Piorkowski, John Richards, Kush R. Varshney

    Abstract: AI models and services are used in a growing number of highstakes areas, resulting in a need for increased transparency. Consistent with this, several proposals for higher quality and more consistent documentation of AI data, models, and systems have emerged. Little is known, however, about the needs of those who would produce or consume these new forms of documentation. Through semi-structured de… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.

  46. arXiv:1911.07819  [pdf, other

    cs.CL cs.LG stat.ML

    Drug Repurposing for Cancer: An NLP Approach to Identify Low-Cost Therapies

    Authors: Shivashankar Subramanian, Ioana Baldini, Sushma Ravichandran, Dmitriy A. Katz-Rogozhnikov, Karthikeyan Natesan Ramamurthy, Prasanna Sattigeri, Kush R. Varshney, Annmarie Wang, Pradeep Mangalath, Laura B. Kleiman

    Abstract: More than 200 generic drugs approved by the U.S. Food and Drug Administration for non-cancer indications have shown promise for treating cancer. Due to their long history of safe patient use, low cost, and widespread availability, repurposing of generic drugs represents a major opportunity to rapidly improve outcomes for cancer patients and reduce healthcare costs worldwide. Evidence on the effica… ▽ More

    Submitted 5 December, 2019; v1 submitted 18 November, 2019; originally announced November 2019.

  47. arXiv:1911.03674  [pdf, other

    cs.LG cs.CR stat.ML

    Preservation of Anomalous Subgroups On Machine Learning Transformed Data

    Authors: Samuel C. Maina, Reginald E. Bryant, William O. Goal, Robert-Florian Samoilescu, Kush R. Varshney, Komminist Weldemariam

    Abstract: In this paper, we investigate the effect of machine learning based anonymization on anomalous subgroup preservation. In particular, we train a binary classifier to discover the most anomalous subgroup in a dataset by maximizing the bias between the group's predicted odds ratio from the model and observed odds ratio from the data. We then perform anonymization using a variational autoencoder (VAE)… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

    Comments: 5 pages, 3 figures, 2 tables, submitted to icassp 2019

  48. arXiv:1910.13983  [pdf, other

    cs.LG cs.CY stat.ML

    DADI: Dynamic Discovery of Fair Information with Adversarial Reinforcement Learning

    Authors: Michiel A. Bakker, Duy Patrick Tu, Humberto Riverón Valdés, Krishna P. Gummadi, Kush R. Varshney, Adrian Weller, Alex Pentland

    Abstract: We introduce a framework for dynamic adversarial discovery of information (DADI), motivated by a scenario where information (a feature set) is used by third parties with unknown objectives. We train a reinforcement learning agent to sequentially acquire a subset of the information while balancing accuracy and fairness of predictors downstream. Based on the set of already acquired features, the age… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: Accepted at NeurIPS 2019 HCML Workshop

  49. arXiv:1910.13268  [pdf, other

    cs.CV cs.CY stat.ML

    Estimating Skin Tone and Effects on Classification Performance in Dermatology Datasets

    Authors: Newton M. Kinyanjui, Timothy Odonga, Celia Cintas, Noel C. F. Codella, Rameswar Panda, Prasanna Sattigeri, Kush R. Varshney

    Abstract: Recent advances in computer vision and deep learning have led to breakthroughs in the development of automated skin image analysis. In particular, skin cancer classification models have achieved performance higher than trained expert dermatologists. However, no attempt has been made to evaluate the consistency in performance of machine learning models across populations with varying skin tones. In… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019 Workshop on Fair ML for Health

  50. arXiv:1910.07870  [pdf, other

    stat.ML cs.CY cs.IT cs.LG

    Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing

    Authors: Sanghamitra Dutta, Dennis Wei, Hazar Yueksel, Pin-Yu Chen, Sijia Liu, Kush R. Varshney

    Abstract: A trade-off between accuracy and fairness is almost taken as a given in the existing literature on fairness in machine learning. Yet, it is not preordained that accuracy should decrease with increased fairness. Novel to this work, we examine fair classification through the lens of mismatched hypothesis testing: trying to find a classifier that distinguishes between two ideal distributions when giv… ▽ More

    Submitted 10 December, 2020; v1 submitted 17 October, 2019; originally announced October 2019.

    Comments: This paper appears in the Proceedings of the 37th International Conference on Machine Learning, pp. 2803--2813, 2020