Skip to main content

Showing 1–10 of 10 results for author: Denton, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16436  [pdf

    cs.SD cs.AI cs.LG eess.AS

    Leveraging tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics

    Authors: Ben Williams, Bart van Merriënboer, Vincent Dumoulin, Jenny Hamer, Eleni Triantafillou, Abram B. Fleishman, Matthew McKown, Jill E. Munger, Aaron N. Rice, Ashlee Lillis, Clemency E. White, Catherine A. D. Hobbs, Tries B. Razak, Kate E. Jones, Tom Denton

    Abstract: Machine learning has the potential to revolutionize passive acoustic monitoring (PAM) for ecological assessments. However, high annotation and compute costs limit the field's efficacy. Generalizable pretrained networks can overcome these costs, but high-quality pretraining requires vast annotated libraries, limiting its current applicability primarily to bird taxa. Here, we identify the optimum pr… ▽ More

    Submitted 7 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 18 pages, 5 figures

  2. arXiv:2404.06498  [pdf, other

    cs.LG stat.ML

    Simultaneous linear connectivity of neural networks modulo permutation

    Authors: Ekansh Sharma, Devin Kwok, Tom Denton, Daniel M. Roy, David Rolnick, Gintare Karolina Dziugaite

    Abstract: Neural networks typically exhibit permutation symmetries which contribute to the non-convexity of the networks' loss landscapes, since linearly interpolating between two permuted versions of a trained network tends to encounter a high loss barrier. Recent work has argued that permutation symmetries are the only sources of non-convexity, meaning there are essentially no such barriers between traine… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 11 pages, 6 figures

  3. arXiv:2402.15360  [pdf, other

    q-bio.QM cs.LG cs.SD eess.AS

    All Thresholds Barred: Direct Estimation of Call Density in Bioacoustic Data

    Authors: Amanda K. Navine, Tom Denton, Matthew J. Weldy, Patrick J. Hart

    Abstract: Passive acoustic monitoring (PAM) studies generate thousands of hours of audio, which may be used to monitor specific animal populations, conduct broad biodiversity surveys, detect threats such as poachers, and more. Machine learning classifiers for species identification are increasingly being used to process the vast amount of audio generated by bioacoustic surveys, expediting analysis and incre… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 14 pages, 6 figures, 3 tables; submitted to Frontiers in Bird Science; Our Hawaiian PAM dataset and classifier scores, as well as annotation information for the three study species, can be found on Zenodo at https://doi.org/10.5281/zenodo.10581530. The fully annotated Powdermill dataset assembled by Chronister et al. that was used in this study is available at https://doi.org/10.1002/ecy.3329

  4. arXiv:2312.07439  [pdf, other

    cs.LG

    BIRB: A Generalization Benchmark for Information Retrieval in Bioacoustics

    Authors: Jenny Hamer, Eleni Triantafillou, Bart van Merriënboer, Stefan Kahl, Holger Klinck, Tom Denton, Vincent Dumoulin

    Abstract: The ability for a machine learning model to cope with differences in training and deployment conditions--e.g. in the presence of distribution shift or the generalization to new classes altogether--is crucial for real-world use cases. However, most empirical work in this area has focused on the image domain with artificial benchmarks constructed to measure individual aspects of generalization. We p… ▽ More

    Submitted 13 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

  5. arXiv:2308.05619  [pdf, other

    stat.ML cs.AI cs.LG

    Updating Clinical Risk Stratification Models Using Rank-Based Compatibility: Approaches for Evaluating and Optimizing Clinician-Model Team Performance

    Authors: Erkin Ötleş, Brian T. Denton, Jenna Wiens

    Abstract: As data shift or new data become available, updating clinical machine learning models may be necessary to maintain or improve performance over time. However, updating a model can introduce compatibility issues when the behavior of the updated model does not align with user expectations, resulting in poor user-model team performance. Existing compatibility measures depend on model decision threshol… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: Conference paper accepted at the 2023 Machine Learning for Healthcare Conference Includes supplemental: 32 pages, 17 figures

  6. Global birdsong embeddings enable superior transfer learning for bioacoustic classification

    Authors: Burooj Ghani, Tom Denton, Stefan Kahl, Holger Klinck

    Abstract: Automated bioacoustic analysis aids understanding and protection of both marine and terrestrial animals and their habitats across extensive spatiotemporal scales, and typically involves analyzing vast collections of acoustic data. With the advent of deep learning models, classification of important signals from these datasets has markedly improved. These models power critical data analyses for res… ▽ More

    Submitted 17 November, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

  7. arXiv:2302.06658  [pdf, other

    cs.LG

    In Search for a Generalizable Method for Source Free Domain Adaptation

    Authors: Malik Boudiaf, Tom Denton, Bart van Merriënboer, Vincent Dumoulin, Eleni Triantafillou

    Abstract: Source-free domain adaptation (SFDA) is compelling because it allows adapting an off-the-shelf model to a new domain using only unlabelled data. In this work, we apply existing SFDA techniques to a challenging set of naturally-occurring distribution shifts in bioacoustics, which are very different from the ones commonly studied in computer vision. We find existing methods perform differently relat… ▽ More

    Submitted 24 June, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  8. arXiv:2207.02262  [pdf, other

    cs.SD cs.LG eess.AS

    Ultra-Low-Bitrate Speech Coding with Pretrained Transformers

    Authors: Ali Siahkoohi, Michael Chinen, Tom Denton, W. Bastiaan Kleijn, Jan Skoglund

    Abstract: Speech coding facilitates the transmission of speech over low-bandwidth networks with minimal distortion. Neural-network based speech codecs have recently demonstrated significant improvements in quality over traditional approaches. While this new generation of codecs is capable of synthesizing high-fidelity speech, their use of recurrent or convolutional layers often restricts their effective rec… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: Proceedings of INTERSPEECH 2022

  9. arXiv:2102.11906  [pdf, other

    eess.AS cs.SD

    Handling Background Noise in Neural Speech Generation

    Authors: Tom Denton, Alejandro Luebs, Felicia S. C. Lim, Andrew Storus, Hengchin Yeh, W. Bastiaan Kleijn, Jan Skoglund

    Abstract: Recent advances in neural-network based generative modeling of speech has shown great potential for speech coding. However, the performance of such models drops when the input is not clean speech, e.g., in the presence of background noise, preventing its use in practical applications. In this paper we examine the reason and discuss methods to overcome this issue. Placing a denoising preprocessing… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: 5 pages, 3 figures, presented at the Asilomar Conference on Signals, Systems, and Computers 2020

  10. arXiv:2102.09660  [pdf, other

    eess.AS cs.SD

    Generative Speech Coding with Predictive Variance Regularization

    Authors: W. Bastiaan Kleijn, Andrew Storus, Michael Chinen, Tom Denton, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Hengchin Yeh

    Abstract: The recent emergence of machine-learning based generative models for speech suggests a significant reduction in bit rate for speech codecs is possible. However, the performance of generative models deteriorates significantly with the distortions present in real-world input signals. We argue that this deterioration is due to the sensitivity of the maximum likelihood criterion to outliers and the in… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

    MSC Class: 94 ACM Class: I.m