Skip to main content

Showing 1–13 of 13 results for author: Hancock, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.21783  [pdf, other

    cs.AI cs.CL cs.CV

    The Llama 3 Herd of Models

    Authors: Aaron Grattafiori, Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Alex Vaughan, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere , et al. (536 additional authors not shown)

    Abstract: Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical… ▽ More

    Submitted 23 November, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

  2. arXiv:2311.13028  [pdf, other

    cs.LG cs.AI cs.DC eess.SP

    DMLR: Data-centric Machine Learning Research -- Past, Present and Future

    Authors: Luis Oala, Manil Maskey, Lilith Bat-Leah, Alicia Parrish, Nezihe Merve Gürel, Tzu-Sheng Kuo, Yang Liu, Rotem Dror, Danilo Brajovic, Xiaozhe Yao, Max Bartolo, William A Gaviria Rojas, Ryan Hileman, Rainier Aliment, Michael W. Mahoney, Meg Risdal, Matthew Lease, Wojciech Samek, Debojyoti Dutta, Curtis G Northcutt, Cody Coleman, Braden Hancock, Bernard Koch, Girmaw Abebe Tadesse, Bojan Karlaš , et al. (13 additional authors not shown)

    Abstract: Drawing from discussions at the inaugural DMLR workshop at ICML 2023 and meetings prior, in this report we outline the relevance of community engagement and infrastructure development for the creation of next-generation public datasets that will advance machine learning science. We chart a path forward as a collective effort to sustain the creation and maintenance of these datasets and methods tow… ▽ More

    Submitted 1 June, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: Published in the Journal of Data-centric Machine Learning Research (DMLR) at https://data.mlr.press/assets/pdf/v01-5.pdf

  3. arXiv:2205.02318  [pdf, other

    cs.LG cs.CL

    Language Models in the Loop: Incorporating Prompting into Weak Supervision

    Authors: Ryan Smith, Jason A. Fries, Braden Hancock, Stephen H. Bach

    Abstract: We propose a new strategy for applying large pre-trained language models to novel tasks when labeled training data is limited. Rather than apply the model in a typical zero-shot or few-shot fashion, we treat the model as the basis for labeling functions in a weak supervision framework. To create a classifier, we first prompt the model to answer multiple distinct queries about an example and define… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

  4. arXiv:2004.04069  [pdf

    cs.CV

    Convolutional neural net face recognition works in non-human-like ways

    Authors: P. J. B. Hancock, R. S. Somai, V. R. Mileva

    Abstract: Convolutional neural networks (CNNs) give state of the art performance in many pattern recognition problems but can be fooled by carefully crafted patterns of noise. We report that CNN face recognition systems also make surprising "errors". We tested six commercial face recognition CNNs and found that they outperform typical human participants on standard face matching tasks. However, they also de… ▽ More

    Submitted 23 June, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: 8 pages, 2 figures. Submitted to Royal Society Open Science

    ACM Class: I.4; I.5

  5. arXiv:1901.05415  [pdf, other

    cs.CL cs.AI cs.HC cs.LG stat.ML

    Learning from Dialogue after Deployment: Feed Yourself, Chatbot!

    Authors: Braden Hancock, Antoine Bordes, Pierre-Emmanuel Mazaré, Jason Weston

    Abstract: The majority of conversations a dialogue agent sees over its lifetime occur after it has already been trained and deployed, leaving a vast store of potential training signal untapped. In this work, we propose the self-feeding chatbot, a dialogue agent with the ability to extract new training examples from the conversations it participates in. As our agent engages in conversation, it also estimates… ▽ More

    Submitted 13 June, 2019; v1 submitted 16 January, 2019; originally announced January 2019.

    Comments: ACL 2019

  6. arXiv:1812.00417  [pdf, other

    cs.LG stat.ML

    Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale

    Authors: Stephen H. Bach, Daniel Rodriguez, Yintao Liu, Chong Luo, Haidong Shao, Cassandra Xia, Souvik Sen, Alexander Ratner, Braden Hancock, Houman Alborzi, Rahul Kuchhal, Christopher Ré, Rob Malkin

    Abstract: Labeling training data is one of the most costly bottlenecks in developing machine learning-based applications. We present a first-of-its-kind study showing how existing knowledge resources from across an organization can be used as weak supervision in order to bring development time and cost down by an order of magnitude, and introduce Snorkel DryBell, a new weak supervision management system for… ▽ More

    Submitted 3 June, 2019; v1 submitted 2 December, 2018; originally announced December 2018.

    Journal ref: Proceedings of the International Conference on Management of Data (SIGMOD), 2019

  7. arXiv:1810.02840  [pdf, other

    stat.ML cs.LG

    Training Complex Models with Multi-Task Weak Supervision

    Authors: Alexander Ratner, Braden Hancock, Jared Dunnmon, Frederic Sala, Shreyash Pandey, Christopher Ré

    Abstract: As machine learning models continue to increase in complexity, collecting large hand-labeled training sets has become one of the biggest roadblocks in practice. Instead, weaker forms of supervision that provide noisier but cheaper labels are often used. However, these weak supervision sources have diverse and unknown accuracies, may output correlated labels, and may label different tasks or apply… ▽ More

    Submitted 7 December, 2018; v1 submitted 5 October, 2018; originally announced October 2018.

  8. arXiv:1807.00099  [pdf, other

    cs.CL cs.LG stat.ML

    Generating Titles for Web Tables

    Authors: Braden Hancock, Hongrae Lee, Cong Yu

    Abstract: Descriptive titles provide crucial context for interpreting tables that are extracted from web pages and are a key component of table-based web applications. Prior approaches have attempted to produce titles by selecting existing text snippets associated with the table. These approaches, however, are limited by their dependence on suitable titles existing a priori. In our user study, we observe th… ▽ More

    Submitted 4 June, 2019; v1 submitted 29 June, 2018; originally announced July 2018.

    Comments: WWW 2019

  9. arXiv:1805.03818  [pdf, other

    cs.CL

    Training Classifiers with Natural Language Explanations

    Authors: Braden Hancock, Paroma Varma, Stephanie Wang, Martin Bringmann, Percy Liang, Christopher Ré

    Abstract: Training accurate classifiers requires many labels, but each label provides only limited information (one bit for binary classification). In this work, we propose BabbleLabble, a framework for training classifiers in which an annotator provides a natural language explanation for each labeling decision. A semantic parser converts these explanations into programmatic labeling functions that generate… ▽ More

    Submitted 25 August, 2018; v1 submitted 10 May, 2018; originally announced May 2018.

    Comments: ACL 2018; v4 adds references and link to code

  10. Fonduer: Knowledge Base Construction from Richly Formatted Data

    Authors: Sen Wu, Luke Hsiao, Xiao Cheng, Braden Hancock, Theodoros Rekatsinas, Philip Levis, Christopher Ré

    Abstract: We focus on knowledge base construction (KBC) from richly formatted data. In contrast to KBC from text or tabular data, KBC from richly formatted data aims to extract relations conveyed jointly via textual, structural, tabular, and visual expressions. We introduce Fonduer, a machine-learning-based KBC system for richly formatted data. Fonduer presents a new data model that accounts for three chall… ▽ More

    Submitted 1 March, 2018; v1 submitted 15 March, 2017; originally announced March 2017.

    Journal ref: SIGMOD 2018

  11. arXiv:1610.02060  [pdf, other

    cs.SI cs.CY

    After Sandy Hook Elementary: A Year in the Gun Control Debate on Twitter

    Authors: Adrian Benton, Braden Hancock, Glen Coppersmith, John W. Ayers, Mark Dredze

    Abstract: The mass shooting at Sandy Hook elementary school on December 14, 2012 catalyzed a year of active debate and legislation on gun control in the United States. Social media hosted an active public discussion where people expressed their support and opposition to a variety of issues surrounding gun legislation. In this paper, we show how a content-based analysis of Twitter data can provide insights a… ▽ More

    Submitted 6 October, 2016; originally announced October 2016.

    Comments: Presented at the Data For Good Exchange 2016

  12. Parallel Vectorized Algebraic AES in MATLAB for Rapid Prototyping of Encrypted Sensor Processing Algorithms and Database Analytics

    Authors: Jeremy Kepner, Vijay Gadepally, Braden Hancock, Peter Michaleas, Elizabeth Michel, Mayank Varia

    Abstract: The increasing use of networked sensor systems and networked databases has led to an increased interest in incorporating encryption directly into sensor algorithms and database analytics. MATLAB is the dominant tool for rapid prototyping of sensor algorithms and has extensive database analytics capabilities. The advent of high level and high performance Galois Field mathematical environments allow… ▽ More

    Submitted 29 June, 2015; originally announced June 2015.

    Comments: 6 pages; accepted to IEEE High Performance Extreme Computing Conference (HPEC) 2015

  13. Computing on Masked Data to improve the Security of Big Data

    Authors: Vijay Gadepally, Braden Hancock, Benjamin Kaiser, Jeremy Kepner, Pete Michaleas, Mayank Varia, Arkady Yerukhimovich

    Abstract: Organizations that make use of large quantities of information require the ability to store and process data from central locations so that the product can be shared or distributed across a heterogeneous group of users. However, recent events underscore the need for improving the security of data stored in such untrusted servers or databases. Advances in cryptographic techniques and database techn… ▽ More

    Submitted 6 April, 2015; originally announced April 2015.

    Comments: 6 pages, Accepted to IEEE HST Conference