Skip to main content

Showing 1–8 of 8 results for author: Cusimano, M

.
  1. arXiv:2504.08764  [pdf

    cs.IR cs.CL

    Evaluation of the phi-3-mini SLM for identification of texts related to medicine, health, and sports injuries

    Authors: Chris Brogly, Saif Rjaibi, Charlotte Liang, Erica Lam, Edward Wang, Adam Levitan, Sarah Paleczny, Michael Cusimano

    Abstract: Small Language Models (SLMs) have potential to be used for automatically labelling and identifying aspects of text data for medicine/health-related purposes from documents and the web. As their resource requirements are significantly lower than Large Language Models (LLMs), these can be deployed potentially on more types of devices. SLMs often are benchmarked on health/medicine-related tasks, such… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

  2. arXiv:2503.02389  [pdf, other

    cs.SD cs.LG eess.AS

    Robust detection of overlapping bioacoustic sound events

    Authors: Louis Mahon, Benjamin Hoffman, Logan S James, Maddie Cusimano, Masato Hagiwara, Sarah C Woolley, Olivier Pietquin

    Abstract: We propose a method for accurately detecting bioacoustic sound events that is robust to overlapping events, a common issue in domains such as ethology, ecology and conservation. While standard methods employ a frame-based, multi-label approach, we introduce an onset-based detection method which we name Voxaboxen. It takes inspiration from object detection methods in computer vision, but simultaneo… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

  3. arXiv:2410.03427  [pdf, ps, other

    cs.SD eess.AS

    Biodenoising: Animal Vocalization Denoising without Access to Clean Data

    Authors: Marius Miron, Sara Keen, Jen-Yu Liu, Benjamin Hoffman, Masato Hagiwara, Olivier Pietquin, Felix Effenberger, Maddie Cusimano

    Abstract: Animal vocalization denoising is a task similar to human speech enhancement, which is relatively well-studied. In contrast to the latter, it comprises a higher diversity of sound production mechanisms and recording environments, and this higher diversity is a challenge for existing models. Adding to the challenge and in contrast to speech, we lack large and diverse datasets comprising clean vocali… ▽ More

    Submitted 10 March, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: 5 pages, 2 tables

  4. A benchmark for computational analysis of animal behavior, using animal-borne tags

    Authors: Benjamin Hoffman, Maddie Cusimano, Vittorio Baglione, Daniela Canestrari, Damien Chevallier, Dominic L. DeSantis, Lorène Jeantet, Monique A. Ladds, Takuya Maekawa, Vicente Mata-Silva, Víctor Moreno-González, Anthony Pagano, Eva Trapote, Outi Vainio, Antti Vehkaoja, Ken Yoda, Katherine Zacarian, Ari Friedlaender

    Abstract: Animal-borne sensors (`bio-loggers') can record a suite of kinematic and environmental data, which are used to elucidate animal ecophysiology and improve conservation efforts. Machine learning techniques are used for interpreting the large amounts of data recorded by bio-loggers, but there exists no common framework for comparing the different machine learning techniques in this domain. This makes… ▽ More

    Submitted 27 September, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: For associated code repositories, see https://github.com/earthspecies/BEBE/ and https://github.com/earthspecies/BEBE-datasets/ . For data repository, see https://zenodo.org/record/7947104

  5. arXiv:2210.12300  [pdf, other

    cs.SD eess.AS

    BEANS: The Benchmark of Animal Sounds

    Authors: Masato Hagiwara, Benjamin Hoffman, Jen-Yu Liu, Maddie Cusimano, Felix Effenberger, Katie Zacarian

    Abstract: The use of machine learning (ML) based techniques has become increasingly popular in the field of bioacoustics over the last years. Fundamental requirements for the successful application of ML based techniques are curated, agreed upon, high-quality datasets and benchmark tasks to be learned on a given dataset. However, the field of bioacoustics so far lacks such public benchmarks which cover mult… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  6. arXiv:2210.10857  [pdf, other

    cs.SD eess.AS

    Modeling Animal Vocalizations through Synthesizers

    Authors: Masato Hagiwara, Maddie Cusimano, Jen-Yu Liu

    Abstract: Modeling real-world sound is a fundamental problem in the creative use of machine learning and many other fields, including human speech processing and bioacoustics. Transformer-based generative models and some prior work (e.g., DDSP) are known to produce realistic sound, although they have limited control and are hard to interpret. As an alternative, we aim to use modular synthesizers, i.e., comp… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  7. arXiv:2112.08984  [pdf, other

    eess.AS cs.SD eess.SP physics.app-ph

    Object-based synthesis of scraping and rolling sounds based on non-linear physical constraints

    Authors: Vinayak Agarwal, Maddie Cusimano, James Traer, Josh McDermott

    Abstract: Sustained contact interactions like scraping and rolling produce a wide variety of sounds. Previous studies have explored ways to synthesize these sounds efficiently and intuitively but could not fully mimic the rich structure of real instances of these sounds. We present a novel source-filter model for realistic synthesis of scraping and rolling sounds with physically and perceptually relevant co… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Journal ref: Proceeding of the 24th International Conference on Digital Audio Effects (DAFx-20in21), 2021

  8. arXiv:2012.10517  [pdf, ps, other

    cs.LG

    Machine learning applications using diffusion tensor imaging of human brain: A PubMed literature review

    Authors: Ashirbani Saha, Pantea Fadaiefard, Jessica E. Rabski, Alireza Sadeghian, Michael D. Cusimano

    Abstract: We performed a PubMed search to find 148 papers published between January 2010 and December 2019 related to human brain, Diffusion Tensor Imaging (DTI), and Machine Learning (ML). The studies focused on healthy cohorts (n = 15), mental health disorders (n = 25), tumor (n = 19), trauma (n = 5), dementia (n = 24), developmental disorders (n = 5), movement disorders (n = 9), other neurological disord… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 20 pages, 1 figure, 1 Supplementary file