Skip to main content

Showing 1–4 of 4 results for author: Manning, C D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, AdriĆ  Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  2. arXiv:2004.01291  [pdf

    cs.DL stat.AP

    Mapping Three Decades of Intellectual Change in Academia

    Authors: Daniel Ramage, Christopher D. Manning, Daniel A. McFarland

    Abstract: Research on the development of science has focused on the creation of multidisciplinary teams. However, while this coming together of people is symmetrical, the ideas, methods, and vocabulary of science have a directional flow. We present a statistical model of the text of dissertation abstracts from 1980 to 2010, revealing for the first time the large-scale flow of language across fields. Results… ▽ More

    Submitted 18 June, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: 10 pages and 6 figures plus appendix of 5 pages and 1 figure

  3. arXiv:1312.6205  [pdf, other

    stat.ML cs.LG

    Relaxations for inference in restricted Boltzmann machines

    Authors: Sida I. Wang, Roy Frostig, Percy Liang, Christopher D. Manning

    Abstract: We propose a relaxation-based approximate inference algorithm that samples near-MAP configurations of a binary pairwise Markov random field. We experiment on MAP inference tasks in several restricted Boltzmann machines. We also use our underlying sampler to estimate the log-partition function of restricted Boltzmann machines and compare against other sampling-based methods.

    Submitted 2 January, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: ICLR 2014 workshop track submission

  4. arXiv:1305.4987  [pdf, other

    cs.AI cs.LG stat.ML

    Robust Logistic Regression using Shift Parameters (Long Version)

    Authors: Julie Tibshirani, Christopher D. Manning

    Abstract: Annotation errors can significantly hurt classifier performance, yet datasets are only growing noisier with the increased use of Amazon Mechanical Turk and techniques like distant supervision that automatically generate labels. In this paper, we present a robust extension of logistic regression that incorporates the possibility of mislabelling directly into the objective. Our model can be trained… ▽ More

    Submitted 29 April, 2014; v1 submitted 21 May, 2013; originally announced May 2013.