Skip to main content

Showing 1–43 of 43 results for author: Caruana, R

.
  1. arXiv:2410.04560  [pdf, other

    cs.LG stat.ML

    GAMformer: In-Context Learning for Generalized Additive Models

    Authors: Andreas Mueller, Julien Siems, Harsha Nori, David Salinas, Arber Zela, Rich Caruana, Frank Hutter

    Abstract: Generalized Additive Models (GAMs) are widely recognized for their ability to create fully interpretable machine learning models for tabular data. Traditionally, training GAMs involves iterative learning algorithms, such as splines, boosted trees, or neural networks, which refine the additive components through repeated error reduction. In this paper, we introduce GAMformer, the first method to le… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

    Comments: 20 pages, 12 figures

  2. arXiv:2404.06209  [pdf, other

    cs.LG cs.AI cs.CL

    Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models

    Authors: Sebastian Bordt, Harsha Nori, Vanessa Rodrigues, Besmira Nushi, Rich Caruana

    Abstract: While many have shown how Large Language Models (LLMs) can be applied to a diverse set of tasks, the critical issues of data contamination and memorization are often glossed over. In this work, we address this concern for tabular data. Specifically, we introduce a variety of different techniques to assess whether a language model has seen a tabular dataset during training. This investigation revea… ▽ More

    Submitted 4 December, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: COLM camera ready, fix typo

  3. arXiv:2403.06644  [pdf, other

    cs.LG cs.CL

    Elephants Never Forget: Testing Language Models for Memorization of Tabular Data

    Authors: Sebastian Bordt, Harsha Nori, Rich Caruana

    Abstract: While many have shown how Large Language Models (LLMs) can be applied to a diverse set of tasks, the critical issues of data contamination and memorization are often glossed over. In this work, we address this concern for tabular data. Starting with simple qualitative tests for whether an LLM knows the names and values of features, we introduce a variety of different techniques to assess the degre… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Table Representation Learning Workshop at NeurIPS 2023

  4. arXiv:2402.14474  [pdf, other

    cs.LG cs.CL

    Data Science with LLMs and Interpretable Models

    Authors: Sebastian Bordt, Ben Lengerich, Harsha Nori, Rich Caruana

    Abstract: Recent years have seen important advances in the building of interpretable models, machine learning models that are designed to be easily understood by humans. In this work, we show that large language models (LLMs) are remarkably good at working with interpretable models, too. In particular, we show that LLMs can describe, interpret, and debug Generalized Additive Models (GAMs). Combining the fle… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: XAI4Sci Workshop at AAAI-24

  5. arXiv:2402.01761  [pdf, other

    cs.CL cs.AI cs.LG

    Rethinking Interpretability in the Era of Large Language Models

    Authors: Chandan Singh, Jeevana Priya Inala, Michel Galley, Rich Caruana, Jianfeng Gao

    Abstract: Interpretable machine learning has exploded as an area of interest over the last decade, sparked by the rise of increasingly large datasets and deep neural networks. Simultaneously, large language models (LLMs) have demonstrated remarkable capabilities across a wide array of tasks, offering a chance to rethink opportunities in interpretable machine learning. Notably, the capability to explain in n… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

    Comments: 7 pages

  6. arXiv:2311.13454  [pdf, other

    cs.LG cs.CR cs.NE stat.ML

    Explaining high-dimensional text classifiers

    Authors: Odelia Melamed, Rich Caruana

    Abstract: Explainability has become a valuable tool in the last few years, helping humans better understand AI-guided decisions. However, the classic explainability tools are sometimes quite limited when considering high-dimensional inputs and neural network classifiers. We present a new explainability method using theoretically proven high-dimensional properties in neural network classifiers. We present tw… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: Accepted to "XAI in Action" workshop @ NeurIPS 2023

  7. Interpretable Predictive Models to Understand Risk Factors for Maternal and Fetal Outcomes

    Authors: Tomas M. Bosschieter, Zifei Xu, Hui Lan, Benjamin J. Lengerich, Harsha Nori, Ian Painter, Vivienne Souter, Rich Caruana

    Abstract: Although most pregnancies result in a good outcome, complications are not uncommon and can be associated with serious implications for mothers and babies. Predictive modeling has the potential to improve outcomes through better understanding of risk factors, heightened surveillance for high risk patients, and more timely and appropriate interventions, thereby helping obstetricians deliver better c… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 25 pages (including appendix and references), 12 figures, 2 tables. J Healthc Inform Res (2023)

  8. arXiv:2308.01157  [pdf, other

    stat.ML cs.AI cs.LG

    LLMs Understand Glass-Box Models, Discover Surprises, and Suggest Repairs

    Authors: Benjamin J. Lengerich, Sebastian Bordt, Harsha Nori, Mark E. Nunnally, Yin Aphinyanaphongs, Manolis Kellis, Rich Caruana

    Abstract: We show that large language models (LLMs) are remarkably good at working with interpretable models that decompose complex outcomes into univariate graph-represented components. By adopting a hierarchical approach to reasoning, LLMs can provide comprehensive model-level summaries without ever requiring the entire model to fit in context. This approach enables LLMs to apply their extensive backgroun… ▽ More

    Submitted 7 August, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

  9. arXiv:2306.17337  [pdf, other

    cs.AI

    Diagnosis Uncertain Models For Medical Risk Prediction

    Authors: Alexander Peysakhovich, Rich Caruana, Yin Aphinyanaphongs

    Abstract: We consider a patient risk models which has access to patient features such as vital signs, lab values, and prior history but does not have access to a patient's diagnosis. For example, this occurs in a model deployed at intake time for triage purposes. We show that such `all-cause' risk models have good generalization across diagnoses but have a predictable failure mode. When the same lab/vital/h… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  10. arXiv:2305.16526  [pdf, other

    cs.CV cond-mat.quant-gas cs.LG

    Extending Explainable Boosting Machines to Scientific Image Data

    Authors: Daniel Schug, Sai Yerramreddy, Rich Caruana, Craig Greenberg, Justyna P. Zwolak

    Abstract: As the deployment of computer vision technology becomes increasingly common in science, the need for explanations of the system and its output has become a focus of great concern. Driven by the pressing need for interpretable models in science, we propose the use of Explainable Boosting Machines (EBMs) for scientific image data. Inspired by an important application underpinning the development of… ▽ More

    Submitted 30 November, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 7 pages, 2 figures

    Journal ref: Proceedings of the Machine Learning and the Physical Sciences Workshop at NeurIPS 2023, New Orleans, LA

  11. arXiv:2304.11749  [pdf, other

    cs.LG

    Missing Values and Imputation in Healthcare Data: Can Interpretable Machine Learning Help?

    Authors: Zhi Chen, Sarah Tan, Urszula Chajewska, Cynthia Rudin, Rich Caruana

    Abstract: Missing values are a fundamental problem in data science. Many datasets have missing values that must be properly handled because the way missing values are treated can have large impact on the resulting machine learning model. In medical applications, the consequences may affect healthcare decisions. There are many methods in the literature for dealing with missing values, including state-of-the-… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: Preprint of a paper accepted by CHIL 2023

  12. arXiv:2302.14165  [pdf, other

    cs.LG cs.AI cs.HC

    GAM Coach: Towards Interactive and User-centered Algorithmic Recourse

    Authors: Zijie J. Wang, Jennifer Wortman Vaughan, Rich Caruana, Duen Horng Chau

    Abstract: Machine learning (ML) recourse techniques are increasingly used in high-stakes domains, providing end users with actions to alter ML predictions, but they assume ML developers understand what input variables can be changed. However, a recourse plan's actionability is subjective and unlikely to match developers' expectations completely. We present GAM Coach, a novel open-source system that adapts i… ▽ More

    Submitted 28 February, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted to CHI 2023. 20 pages, 12 figures. For a demo video, see https://youtu.be/ubacP34H9XE. For a live demo, visit https://poloclub.github.io/gam-coach/

  13. arXiv:2211.08991  [pdf, other

    cs.LG cs.CY

    Estimating Discontinuous Time-Varying Risk Factors and Treatment Benefits for COVID-19 with Interpretable ML

    Authors: Benjamin Lengerich, Mark E. Nunnally, Yin Aphinyanaphongs, Rich Caruana

    Abstract: Treatment protocols, disease understanding, and viral characteristics changed over the course of the COVID-19 pandemic; as a result, the risks associated with patient comorbidities and biomarkers also changed. We add to the conversation regarding inflammation, hemostasis and vascular function in COVID-19 by performing a time-varying observational analysis of over 4000 patients hospitalized for COV… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 14 pages

  14. arXiv:2209.11799  [pdf, other

    cs.AI cs.CL cs.LG stat.ME

    Augmenting Interpretable Models with LLMs during Training

    Authors: Chandan Singh, Armin Askari, Rich Caruana, Jianfeng Gao

    Abstract: Recent large language models (LLMs) have demonstrated remarkable prediction performance for a growing array of tasks. However, their proliferation into high-stakes domains (e.g. medicine) and compute-limited settings has created a burgeoning need for interpretability and efficiency. We address this need by proposing Augmented Interpretable Models (Aug-imodels), a framework for leveraging the knowl… ▽ More

    Submitted 24 April, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Journal ref: Nature Communications, 2023

  15. arXiv:2207.05322  [pdf, other

    cs.LG stat.AP

    Using Interpretable Machine Learning to Predict Maternal and Fetal Outcomes

    Authors: Tomas M. Bosschieter, Zifei Xu, Hui Lan, Benjamin J. Lengerich, Harsha Nori, Kristin Sitcov, Vivienne Souter, Rich Caruana

    Abstract: Most pregnancies and births result in a good outcome, but complications are not uncommon and when they do occur, they can be associated with serious implications for mothers and babies. Predictive modeling has the potential to improve outcomes through better understanding of risk factors, heightened surveillance, and more timely and appropriate interventions, thereby helping obstetricians deliver… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: DSHealth at SIGKDD 2022, 5 pages, 3 figures

  16. arXiv:2206.15465  [pdf, other

    cs.LG cs.AI cs.HC

    Interpretability, Then What? Editing Machine Learning Models to Reflect Human Knowledge and Values

    Authors: Zijie J. Wang, Alex Kale, Harsha Nori, Peter Stella, Mark E. Nunnally, Duen Horng Chau, Mihaela Vorvoreanu, Jennifer Wortman Vaughan, Rich Caruana

    Abstract: Machine learning (ML) interpretability techniques can reveal undesirable patterns in data that models exploit to make predictions--potentially causing harms once deployed. However, how to take action to address these patterns is not always clear. In a collaboration between ML and human-computer interaction researchers, physicians, and data scientists, we develop GAM Changer, the first interactive… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Accepted at KDD 2022. 11 pages, 19 figures. For a demo video, see https://youtu.be/D6whtfInqTc. For a live demo, visit https://interpret.ml/gam-changer

  17. arXiv:2202.11043  [pdf, other

    stat.ML cs.CR cs.LG econ.EM

    Differentially Private Estimation of Heterogeneous Causal Effects

    Authors: Fengshi Niu, Harsha Nori, Brian Quistorff, Rich Caruana, Donald Ngwe, Aadharsh Kannan

    Abstract: Estimating heterogeneous treatment effects in domains such as healthcare or social science often involves sensitive data where protecting privacy is important. We introduce a general meta-algorithm for estimating conditional average treatment effects (CATE) with differential privacy (DP) guarantees. Our meta-algorithm can work with simple, single-stage CATE estimators such as S-learner and more co… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  18. arXiv:2112.03245  [pdf, other

    cs.LG cs.AI cs.HC

    GAM Changer: Editing Generalized Additive Models with Interactive Visualization

    Authors: Zijie J. Wang, Alex Kale, Harsha Nori, Peter Stella, Mark Nunnally, Duen Horng Chau, Mihaela Vorvoreanu, Jennifer Wortman Vaughan, Rich Caruana

    Abstract: Recent strides in interpretable machine learning (ML) research reveal that models exploit undesirable patterns in the data to make predictions, which potentially causes harms in deployment. However, it is unclear how we can fix these models. We present our ongoing work, GAM Changer, an open-source interactive system to help data scientists and domain experts easily and responsibly edit their Gener… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: 7 pages, 15 figures, accepted to the Research2Clinics workshop at NeurIPS 2021. For a demo video, see https://youtu.be/2gVSoPoSeJ8. For a live demo, visit https://interpret.ml/gam-changer/

  19. arXiv:2110.15165  [pdf, other

    cs.LG

    Extracting Expert's Goals by What-if Interpretable Modeling

    Authors: Chun-Hao Chang, George Alexandru Adam, Rich Caruana, Anna Goldenberg

    Abstract: Although reinforcement learning (RL) has tremendous success in many fields, applying RL to real-world settings such as healthcare is challenging when the reward is hard to specify and no exploration is allowed. In this work, we focus on recovering clinicians' rewards in treating patients. We incorporate the what-if reasoning to explain the clinician's treatments based on their potential future out… ▽ More

    Submitted 13 June, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

  20. arXiv:2106.09680  [pdf, other

    cs.LG cs.CR

    Accuracy, Interpretability, and Differential Privacy via Explainable Boosting

    Authors: Harsha Nori, Rich Caruana, Zhiqi Bu, Judy Hanwen Shen, Janardhan Kulkarni

    Abstract: We show that adding differential privacy to Explainable Boosting Machines (EBMs), a recent method for training interpretable ML models, yields state-of-the-art accuracy while protecting privacy. Our experiments on multiple classification and regression datasets show that DP-EBM models suffer surprisingly little accuracy loss even with strong differential privacy guarantees. In addition to high acc… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: To be published in ICML 2021. 12 pages, 6 figures

  21. arXiv:2106.01613  [pdf, other

    cs.LG

    NODE-GAM: Neural Generalized Additive Model for Interpretable Deep Learning

    Authors: Chun-Hao Chang, Rich Caruana, Anna Goldenberg

    Abstract: Deployment of machine learning models in real high-risk settings (e.g. healthcare) often depends not only on the model's accuracy but also on its fairness, robustness, and interpretability. Generalized Additive Models (GAMs) are a class of interpretable models with a long history of use in these high-risk domains, but they lack desirable features of deep learning such as differentiability and scal… ▽ More

    Submitted 16 March, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: 2022 ICLR Spotlight paper

  22. arXiv:2102.05107  [pdf, other

    physics.ao-ph cs.LG

    Sub-seasonal forecasting with a large ensemble of deep-learning weather prediction models

    Authors: Jonathan A. Weyn, Dale R. Durran, Rich Caruana, Nathaniel Cresswell-Clay

    Abstract: We present an ensemble prediction system using a Deep Learning Weather Prediction (DLWP) model that recursively predicts key atmospheric variables with six-hour time resolution. This model uses convolutional neural networks (CNNs) on a cubed sphere grid to produce global forecasts. The approach is computationally efficient, requiring just three minutes on a single GPU to produce a 320-member set o… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: Submitted to Journal of Advances in Modeling Earth Systems

    Journal ref: Sub-Seasonal Forecasting With a Large Ensemble of Deep-Learning Weather Prediction Models. Journal of Advances in Modeling Earth Systems, 2021

  23. arXiv:2007.00823  [pdf, other

    cs.LG stat.ML

    Dropout as a Regularizer of Interaction Effects

    Authors: Benjamin Lengerich, Eric P. Xing, Rich Caruana

    Abstract: We examine Dropout through the perspective of interactions. This view provides a symmetry to explain Dropout: given $N$ variables, there are ${N \choose k}$ possible sets of $k$ variables to form an interaction (i.e. $\mathcal{O}(N^k)$); conversely, the probability an interaction of $k$ variables survives Dropout at rate $p$ is $(1-p)^k$ (decaying with $k$). These rates effectively cancel, and so… ▽ More

    Submitted 17 October, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

  24. arXiv:2006.06466  [pdf, other

    cs.LG stat.ML

    How Interpretable and Trustworthy are GAMs?

    Authors: Chun-Hao Chang, Sarah Tan, Ben Lengerich, Anna Goldenberg, Rich Caruana

    Abstract: Generalized additive models (GAMs) have become a leading modelclass for interpretable machine learning. However, there are many algorithms for training GAMs, and these can learn different or even contradictory models, while being equally accurate. Which GAM should we trust? In this paper, we quantitatively and qualitatively investigate a variety of GAM algorithms on real and simulated datasets. We… ▽ More

    Submitted 6 June, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted in 2021 KDD

  25. arXiv:2004.13912  [pdf, other

    cs.LG cs.AI stat.ML

    Neural Additive Models: Interpretable Machine Learning with Neural Nets

    Authors: Rishabh Agarwal, Levi Melnick, Nicholas Frosst, Xuezhou Zhang, Ben Lengerich, Rich Caruana, Geoffrey Hinton

    Abstract: Deep neural networks (DNNs) are powerful black-box predictors that have achieved impressive performance on a wide variety of tasks. However, their accuracy comes at the cost of intelligibility: it is usually unclear how they make their decisions. This hinders their applicability to high stakes decision-making domains such as healthcare. We propose Neural Additive Models (NAMs) which combine some o… ▽ More

    Submitted 24 October, 2021; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: Spotlight (Top 3%) at NeurIPS 2021

  26. arXiv:2003.11927  [pdf, other

    physics.ao-ph cs.LG stat.ML

    Improving data-driven global weather prediction using deep convolutional neural networks on a cubed sphere

    Authors: Jonathan A. Weyn, Dale R. Durran, Rich Caruana

    Abstract: We present a significantly-improved data-driven global weather forecasting framework using a deep convolutional neural network (CNN) to forecast several basic atmospheric variables on a global grid. New developments in this framework include an offline volume-conservative mapping to a cubed-sphere grid, improvements to the CNN architecture, and the minimization of the loss function over multiple s… ▽ More

    Submitted 15 March, 2020; originally announced March 2020.

    Comments: Manuscript submitted to Journal of Advances in Modeling Earth Systems

  27. arXiv:1911.04974  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Purifying Interaction Effects with the Functional ANOVA: An Efficient Algorithm for Recovering Identifiable Additive Models

    Authors: Benjamin Lengerich, Sarah Tan, Chun-Hao Chang, Giles Hooker, Rich Caruana

    Abstract: Models which estimate main effects of individual variables alongside interaction effects have an identifiability challenge: effects can be freely moved between main effects and interaction effects without changing the model prediction. This is a critical problem for interpretability because it permits "contradictory" models to represent the same function. To solve this problem, we propose pure int… ▽ More

    Submitted 1 May, 2020; v1 submitted 12 November, 2019; originally announced November 2019.

    Comments: AISTATS 2020

  28. arXiv:1909.09223  [pdf, other

    cs.LG stat.ML

    InterpretML: A Unified Framework for Machine Learning Interpretability

    Authors: Harsha Nori, Samuel Jenkins, Paul Koch, Rich Caruana

    Abstract: InterpretML is an open-source Python package which exposes machine learning interpretability algorithms to practitioners and researchers. InterpretML exposes two types of interpretability - glassbox models, which are machine learning models designed for interpretability (ex: linear models, rule lists, generalized additive models), and blackbox explainability techniques for explaining existing syst… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

  29. arXiv:1905.13360  [pdf, other

    cs.LG stat.ML

    Efficient Forward Architecture Search

    Authors: Hanzhang Hu, John Langford, Rich Caruana, Saurajit Mukherjee, Eric Horvitz, Debadeepta Dey

    Abstract: We propose a neural architecture search (NAS) algorithm, Petridish, to iteratively add shortcut connections to existing network layers. The added shortcut connections effectively perform gradient boosting on the augmented layers. The proposed algorithm is motivated by the feature selection algorithm forward stage-wise linear regression, since we consider NAS as a generalization of feature selectio… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

    Comments: preprint

  30. arXiv:1810.09092  [pdf, other

    cs.LG stat.ML

    Axiomatic Interpretability for Multiclass Additive Models

    Authors: Xuezhou Zhang, Sarah Tan, Paul Koch, Yin Lou, Urszula Chajewska, Rich Caruana

    Abstract: Generalized additive models (GAMs) are favored in many regression and binary classification problems because they are able to fit complex, nonlinear functions while still remaining interpretable. In the first part of this paper, we generalize a state-of-the-art GAM learning algorithm based on boosted trees to the multiclass setting, and show that this multiclass algorithm outperforms existing GAM… ▽ More

    Submitted 30 May, 2019; v1 submitted 22 October, 2018; originally announced October 2018.

    Comments: KDD 2019

  31. Considerations When Learning Additive Explanations for Black-Box Models

    Authors: Sarah Tan, Giles Hooker, Paul Koch, Albert Gordo, Rich Caruana

    Abstract: Many methods to explain black-box models, whether local or global, are additive. In this paper, we study global additive explanations for non-additive models, focusing on four explanation methods: partial dependence, Shapley explanations adapted to a global setting, distilled additive explanations, and gradient-based explanations. We show that different explanation methods characterize non-additiv… ▽ More

    Submitted 31 July, 2023; v1 submitted 25 January, 2018; originally announced January 2018.

    Comments: Published at Machine Learning (2023). Previously titled "Learning Global Additive Explanations for Neural Nets Using Model Distillation". A short version was presented at NeurIPS 2018 Machine Learning for Health Workshop

  32. arXiv:1711.09889   

    stat.ML

    Proceedings of NIPS 2017 Symposium on Interpretable Machine Learning

    Authors: Andrew Gordon Wilson, Jason Yosinski, Patrice Simard, Rich Caruana, William Herlands

    Abstract: This is the Proceedings of NIPS 2017 Symposium on Interpretable Machine Learning, held in Long Beach, California, USA on December 7, 2017

    Submitted 11 December, 2017; v1 submitted 26 November, 2017; originally announced November 2017.

    Comments: 25 papers

  33. arXiv:1710.06169  [pdf, other

    stat.ML cs.AI cs.LG

    Distill-and-Compare: Auditing Black-Box Models Using Transparent Model Distillation

    Authors: Sarah Tan, Rich Caruana, Giles Hooker, Yin Lou

    Abstract: Black-box risk scoring models permeate our lives, yet are typically proprietary or opaque. We propose Distill-and-Compare, a model distillation and comparison approach to audit such models. To gain insight into black-box models, we treat them as teachers, training transparent student models to mimic the risk scores assigned by black-box models. We compare the student model trained with distillatio… ▽ More

    Submitted 11 October, 2018; v1 submitted 17 October, 2017; originally announced October 2017.

    Comments: Camera-ready version for AAAI/ACM AIES 2018. Data and pseudocode at https://github.com/shftan/auditblackbox. Previously titled "Detecting Bias in Black-Box Models Using Transparent Model Distillation". A short version was presented at NIPS 2017 Symposium on Interpretable Machine Learning

  34. arXiv:1707.01154  [pdf, other

    cs.AI

    Interpretable & Explorable Approximations of Black Box Models

    Authors: Himabindu Lakkaraju, Ece Kamar, Rich Caruana, Jure Leskovec

    Abstract: We propose Black Box Explanations through Transparent Approximations (BETA), a novel model agnostic framework for explaining the behavior of any black-box classifier by simultaneously optimizing for fidelity to the original model and interpretability of the explanation. To this end, we develop a novel objective function which allows us to learn (with optimality guarantees), a small number of compa… ▽ More

    Submitted 4 July, 2017; originally announced July 2017.

    Comments: Presented as a poster at the 2017 Workshop on Fairness, Accountability, and Transparency in Machine Learning

  35. arXiv:1610.09064  [pdf, other

    cs.AI

    Identifying Unknown Unknowns in the Open World: Representations and Policies for Guided Exploration

    Authors: Himabindu Lakkaraju, Ece Kamar, Rich Caruana, Eric Horvitz

    Abstract: Predictive models deployed in the real world may assign incorrect labels to instances with high confidence. Such errors or unknown unknowns are rooted in model incompleteness, and typically arise because of the mismatch between training data and the cases encountered at test time. As the models are blind to such errors, input from an oracle is needed to identify these failures. In this paper, we f… ▽ More

    Submitted 10 December, 2016; v1 submitted 27 October, 2016; originally announced October 2016.

    Comments: To appear in AAAI 2017; Presented at NIPS Workshop on Reliability in ML, 2016

  36. arXiv:1603.05691  [pdf, other

    stat.ML cs.LG

    Do Deep Convolutional Nets Really Need to be Deep and Convolutional?

    Authors: Gregor Urban, Krzysztof J. Geras, Samira Ebrahimi Kahou, Ozlem Aslan, Shengjie Wang, Rich Caruana, Abdelrahman Mohamed, Matthai Philipose, Matt Richardson

    Abstract: Yes, they do. This paper provides the first empirical demonstration that deep convolutional models really need to be both deep and convolutional, even when trained with methods such as distillation that allow small or shallow models of high accuracy to be trained. Although previous research showed that shallow feed-forward nets sometimes can learn the complex functions previously learned by deep n… ▽ More

    Submitted 3 March, 2017; v1 submitted 17 March, 2016; originally announced March 2016.

  37. arXiv:1602.01137  [pdf, other

    cs.IR

    A Dual Embedding Space Model for Document Ranking

    Authors: Bhaskar Mitra, Eric Nalisnick, Nick Craswell, Rich Caruana

    Abstract: A fundamental goal of search engines is to identify, given a query, documents that have relevant text. This is intrinsically difficult because the query and the document may use different vocabulary, or the document may contain query words without being relevant. We investigate neural word embeddings as a source of evidence in document ranking. We train a word2vec embedding model on a large unlabe… ▽ More

    Submitted 2 February, 2016; originally announced February 2016.

    Comments: This paper is an extended evaluation and analysis of the model proposed in a poster to appear in WWW'16, April 11 - 15, 2016, Montreal, Canada

  38. arXiv:1511.06433  [pdf, other

    cs.LG

    Blending LSTMs into CNNs

    Authors: Krzysztof J. Geras, Abdel-rahman Mohamed, Rich Caruana, Gregor Urban, Shengjie Wang, Ozlem Aslan, Matthai Philipose, Matthew Richardson, Charles Sutton

    Abstract: We consider whether deep convolutional networks (CNNs) can represent decision functions with similar accuracy as recurrent networks such as LSTMs. First, we show that a deep CNN with an architecture inspired by the models recently introduced in image recognition can yield better accuracy than previous convolutional and LSTM networks on the standard 309h Switchboard automatic speech recognition tas… ▽ More

    Submitted 14 September, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

  39. arXiv:1407.4729  [pdf, other

    stat.ME cs.LG stat.ML

    Sparse Partially Linear Additive Models

    Authors: Yin Lou, Jacob Bien, Rich Caruana, Johannes Gehrke

    Abstract: The generalized partially linear additive model (GPLAM) is a flexible and interpretable approach to building predictive models. It combines features in an additive manner, allowing each to have either a linear or nonlinear effect on the response. However, the choice of which features to treat as linear or nonlinear is typically assumed known. Thus, to make a GPLAM a viable approach in situations i… ▽ More

    Submitted 27 March, 2018; v1 submitted 17 July, 2014; originally announced July 2014.

    Comments: Corrected typos

  40. arXiv:1312.6184  [pdf, other

    cs.LG cs.NE

    Do Deep Nets Really Need to be Deep?

    Authors: Lei Jimmy Ba, Rich Caruana

    Abstract: Currently, deep neural networks are the state of the art on problems such as speech recognition and computer vision. In this extended abstract, we show that shallow feed-forward networks can learn the complex functions previously learned by deep nets and achieve accuracies previously only achievable with deep models. Moreover, in some cases the shallow neural nets can learn these deep functions us… ▽ More

    Submitted 10 October, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: final revision coming soon

  41. arXiv:1311.7184  [pdf, other

    stat.ML cs.LG

    Using Multiple Samples to Learn Mixture Models

    Authors: Jason D Lee, Ran Gilad-Bachrach, Rich Caruana

    Abstract: In the mixture models problem it is assumed that there are $K$ distributions $θ_{1},\ldots,θ_{K}$ and one gets to observe a sample from a mixture of these distributions with unknown coefficients. The goal is to associate instances with their generating distributions, or to identify the parameters of the hidden distributions. In this work we make the assumption that we have access to several sample… ▽ More

    Submitted 27 November, 2013; originally announced November 2013.

    Comments: Published in Neural Information Processing Systems (NIPS) 2013

  42. arXiv:1304.2733  [pdf

    cs.AI

    The Automatic Training of Rule Bases that Use Numerical Uncertainty Representations

    Authors: Richard A. Caruana

    Abstract: The use of numerical uncertainty representations allows better modeling of some aspects of human evidential reasoning. It also makes knowledge acquisition and system development, test, and modification more difficult. We propose that where possible, the assignment and/or refinement of rule weights should be performed automatically. We present one approach to performing this training - numerical op… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Third Conference on Uncertainty in Artificial Intelligence (UAI1987)

    Report number: UAI-P-1987-PG-198-204

  43. arXiv:1207.1403  [pdf

    cs.LG stat.ML

    Obtaining Calibrated Probabilities from Boosting

    Authors: Alexandru Niculescu-Mizil, Richard A. Caruana

    Abstract: Boosted decision trees typically yield good accuracy, precision, and ROC area. However, because the outputs from boosting are not well calibrated posterior probabilities, boosting yields poor squared error and cross-entropy. We empirically demonstrate why AdaBoost predicts distorted probabilities and examine three calibration methods for correcting this distortion: Platt Scaling, Isotonic Regressi… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-413-420