Skip to main content

Showing 1–4 of 4 results for author: Buchanan, E K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.17227  [pdf, other

    cs.CL cs.LG q-bio.NC

    Brain-to-Text Benchmark '24: Lessons Learned

    Authors: Francis R. Willett, Jingyuan Li, Trung Le, Chaofei Fan, Mingfei Chen, Eli Shlizerman, Yue Chen, Xin Zheng, Tatsuo S. Okubo, Tyler Benster, Hyun Dong Lee, Maxwell Kounga, E. Kelly Buchanan, David Zoltowski, Scott W. Linderman, Jaimie M. Henderson

    Abstract: Speech brain-computer interfaces aim to decipher what a person is trying to say from neural activity alone, restoring communication to people with paralysis who have lost the ability to speak intelligibly. The Brain-to-Text Benchmark '24 and associated competition was created to foster the advancement of decoding algorithms that convert neural activity to text. Here, we summarize the lessons learn… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

  2. arXiv:2409.15254  [pdf, other

    cs.LG cs.AI cs.CL

    Archon: An Architecture Search Framework for Inference-Time Techniques

    Authors: Jon Saad-Falcon, Adrian Gamarra Lafuente, Shlok Natarajan, Nahum Maru, Hristo Todorov, Etash Guha, E. Kelly Buchanan, Mayee Chen, Neel Guha, Christopher RĂ©, Azalia Mirhoseini

    Abstract: Inference-time techniques are emerging as highly effective tools to enhance large language model (LLM) capabilities. However, best practices for developing systems that combine these techniques remain underdeveloped due to our limited understanding of the utility of individual inference-time techniques and the interactions between them. Additionally, efficiently and automatically searching the spa… ▽ More

    Submitted 3 October, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

  3. arXiv:2302.00704  [pdf, other

    cs.LG stat.ML

    Pathologies of Predictive Diversity in Deep Ensembles

    Authors: Taiga Abe, E. Kelly Buchanan, Geoff Pleiss, John P. Cunningham

    Abstract: Classic results establish that encouraging predictive diversity improves performance in ensembles of low-capacity models, e.g. through bagging or boosting. Here we demonstrate that these intuitions do not apply to high-capacity neural network ensembles (deep ensembles), and in fact the opposite is often true. In a large scale study of nearly 600 neural network classification ensembles, we examine… ▽ More

    Submitted 9 January, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: now published in Transactions on Machine Learning Research

  4. arXiv:2202.06985  [pdf, other

    cs.LG stat.ML

    Deep Ensembles Work, But Are They Necessary?

    Authors: Taiga Abe, E. Kelly Buchanan, Geoff Pleiss, Richard Zemel, John P. Cunningham

    Abstract: Ensembling neural networks is an effective way to increase accuracy, and can often match the performance of individual larger models. This observation poses a natural question: given the choice between a deep ensemble and a single neural network with similar accuracy, is one preferable over the other? Recent work suggests that deep ensembles may offer distinct benefits beyond predictive power: nam… ▽ More

    Submitted 13 October, 2022; v1 submitted 14 February, 2022; originally announced February 2022.