Skip to main content

Showing 1–10 of 10 results for author: Krauth, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2207.01616  [pdf, other

    cs.IR cs.LG stat.ML

    Breaking Feedback Loops in Recommender Systems with Causal Inference

    Authors: Karl Krauth, Yixin Wang, Michael I. Jordan

    Abstract: Recommender systems play a key role in shaping modern web ecosystems. These systems alternate between (1) making recommendations (2) collecting user responses to these recommendations, and (3) retraining the recommendation algorithm based on this feedback. During this process the recommender system influences the user behavioral data that is subsequently used to update it, thus creating a feedback… ▽ More

    Submitted 14 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

  2. arXiv:2207.01609  [pdf, other

    cs.IR cs.LG stat.ML

    Recommendation Systems with Distribution-Free Reliability Guarantees

    Authors: Anastasios N. Angelopoulos, Karl Krauth, Stephen Bates, Yixin Wang, Michael I. Jordan

    Abstract: When building recommendation systems, we seek to output a helpful set of items to the user. Under the hood, a ranking model predicts which of two candidate items is better, and we must distill these pairwise comparisons into the user-facing output. However, a learned ranking model is never perfect, so taking its predictions at face value gives no guarantee that the user-facing output is reliable.… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  3. arXiv:2206.13102  [pdf, other

    cs.GT cs.CY cs.IR cs.LG stat.ML

    Modeling Content Creator Incentives on Algorithm-Curated Platforms

    Authors: Jiri Hron, Karl Krauth, Michael I. Jordan, Niki Kilbertus, Sarah Dean

    Abstract: Content creators compete for user attention. Their reach crucially depends on algorithmic choices made by developers on online platforms. To maximize exposure, many creators adapt strategically, as evidenced by examples like the sprawling search engine optimization industry. This begets competition for the finite user attention pool. We formalize these dynamics in what we call an exposure game, a… ▽ More

    Submitted 6 July, 2023; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: presented at ICLR 2023 (top 5%)

  4. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, AdriĆ  Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  5. arXiv:2106.14979  [pdf, other

    cs.IR cs.LG stat.ML

    On component interactions in two-stage recommender systems

    Authors: Jiri Hron, Karl Krauth, Michael I. Jordan, Niki Kilbertus

    Abstract: Thanks to their scalability, two-stage recommenders are used by many of today's largest online platforms, including YouTube, LinkedIn, and Pinterest. These systems produce recommendations in two steps: (i) multiple nominators, tuned for low prediction latency, preselect a small subset of candidates from the whole item pool; (ii) a slower but more accurate ranker further narrows down the nominated… ▽ More

    Submitted 12 January, 2022; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Appears in the proceedings of the NeurIPS 2021 conference

  6. arXiv:2009.08956  [pdf, other

    cs.IR cs.LG stat.ML

    Exploration in two-stage recommender systems

    Authors: Jiri Hron, Karl Krauth, Michael I. Jordan, Niki Kilbertus

    Abstract: Two-stage recommender systems are widely adopted in industry due to their scalability and maintainability. These systems produce recommendations in two steps: (i) multiple nominators preselect a small number of items from a large pool using cheap-to-compute item embeddings; (ii) with a richer set of features, a ranker rearranges the nominated items and serves them to the user. A key challenge of t… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

    Comments: Published at the REVEAL 2020 workshop (RecSys 2020)

  7. arXiv:2004.14444  [pdf, other

    cs.LG cs.CL stat.ML

    The Effect of Natural Distribution Shift on Question Answering Models

    Authors: John Miller, Karl Krauth, Benjamin Recht, Ludwig Schmidt

    Abstract: We build four new test sets for the Stanford Question Answering Dataset (SQuAD) and evaluate the ability of question-answering systems to generalize to new data. Our first test set is from the original Wikipedia domain and measures the extent to which existing systems overfit the original test set. Despite several years of heavy test set re-use, we find no evidence of adaptive overfitting. The rem… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

  8. arXiv:1905.12842  [pdf, other

    cs.LG math.OC stat.ML

    Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator

    Authors: Karl Krauth, Stephen Tu, Benjamin Recht

    Abstract: We study the sample complexity of approximate policy iteration (PI) for the Linear Quadratic Regulator (LQR), building on a recent line of work using LQR as a testbed to understand the limits of reinforcement learning (RL) algorithms on continuous control tasks. Our analysis quantifies the tension between policy improvement and policy evaluation, and suggests that policy evaluation is the dominant… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

  9. arXiv:1610.05392  [pdf, other

    stat.ML

    AutoGP: Exploring the Capabilities and Limitations of Gaussian Process Models

    Authors: Karl Krauth, Edwin V. Bonilla, Kurt Cutajar, Maurizio Filippone

    Abstract: We investigate the capabilities and limitations of Gaussian process models by jointly exploring three complementary directions: (i) scalable and statistically efficient inference; (ii) flexible kernels; and (iii) objective functions for hyperparameter learning alternative to the marginal likelihood. Our approach outperforms all previously reported GP methods on the standard MNIST dataset; performs… ▽ More

    Submitted 5 March, 2017; v1 submitted 17 October, 2016; originally announced October 2016.

    Comments: Edited results on RECTANGLES-IMAGE and related comments; minor additional edits

  10. arXiv:1609.00577  [pdf, other

    stat.ML

    Generic Inference in Latent Gaussian Process Models

    Authors: Edwin V. Bonilla, Karl Krauth, Amir Dezfouli

    Abstract: We develop an automated variational method for inference in models with Gaussian process (GP) priors and general likelihoods. The method supports multiple outputs and multiple latent functions and does not require detailed knowledge of the conditional likelihood, only needing its evaluation as a black-box function. Using a mixture of Gaussians as the variational distribution, we show that the evid… ▽ More

    Submitted 5 November, 2018; v1 submitted 2 September, 2016; originally announced September 2016.

    Comments: 61 pages