-
Speaker Diarization for Low-Resource Languages Through Wav2vec Fine-Tuning
Authors:
Abdulhady Abas Abdullah,
Sarkhel H. Taher Karim,
Sara Azad Ahmed,
Kanar R. Tariq,
Tarik A. Rashid
Abstract:
Speaker diarization is a fundamental task in speech processing that involves dividing an audio stream by speaker. Although state-of-the-art models have advanced performance in high-resource languages, low-resource languages such as Kurdish pose unique challenges due to limited annotated data, multiple dialects and frequent code-switching. In this study, we address these issues by training the Wav2…
▽ More
Speaker diarization is a fundamental task in speech processing that involves dividing an audio stream by speaker. Although state-of-the-art models have advanced performance in high-resource languages, low-resource languages such as Kurdish pose unique challenges due to limited annotated data, multiple dialects and frequent code-switching. In this study, we address these issues by training the Wav2Vec 2.0 self-supervised learning model on a dedicated Kurdish corpus. By leveraging transfer learning, we adapted multilingual representations learned from other languages to capture the phonetic and acoustic characteristics of Kurdish speech. Relative to a baseline method, our approach reduced the diarization error rate by seven point two percent and improved cluster purity by thirteen percent. These findings demonstrate that enhancements to existing models can significantly improve diarization performance for under-resourced languages. Our work has practical implications for developing transcription services for Kurdish-language media and for speaker segmentation in multilingual call centers, teleconferencing and video-conferencing systems. The results establish a foundation for building effective diarization systems in other understudied languages, contributing to greater equity in speech technology.
△ Less
Submitted 23 April, 2025;
originally announced April 2025.
-
Portus: Linking Alloy with SMT-based Finite Model Finding
Authors:
Ryan Dancy,
Nancy A. Day,
Owen Zila,
Khadija Tariq,
Joseph Poremba
Abstract:
Alloy is a well-known, formal, declarative language for modelling systems early in the software development process. Currently, it uses the Kodkod library as a back-end for finite model finding. Kodkod translates the model to a SAT problem; however, this method can often handle only problems of fairly low-size sets and is inherently finite. We present Portus, a method for translating Alloy into an…
▽ More
Alloy is a well-known, formal, declarative language for modelling systems early in the software development process. Currently, it uses the Kodkod library as a back-end for finite model finding. Kodkod translates the model to a SAT problem; however, this method can often handle only problems of fairly low-size sets and is inherently finite. We present Portus, a method for translating Alloy into an equivalent many-sorted first-order logic problem (MSFOL). Once in MSFOL, the problem can be evaluated by an SMT-based finite model finding method implemented in the Fortress library, creating an alternative back-end for the Alloy Analyzer. Fortress converts the MSFOL finite model finding problem into the logic of uninterpreted functions with equality (EUF), a decidable fragment of first-order logic that is well-supported in many SMT solvers. We compare the performance of Portus with Kodkod on a corpus of 64 Alloy models written by experts. Our method is fully integrated into the Alloy Analyzer.
△ Less
Submitted 23 May, 2025; v1 submitted 24 November, 2024;
originally announced November 2024.
-
Measurements of the Higgs Boson Coupling Properties to Fermions with the ATLAS Detector
Authors:
Khuram Tariq
Abstract:
Testing the Yukawa couplings of the Higgs boson to quarks and leptons is important to understand the origin of fermion masses. These proceedings will review several measurements of Higgs boson decays to two bottom quarks or two tau leptons, searches for Higgs boson decays to two charm quarks or two muons, as well as direct constraints on the charm-Yukawa coupling. The production of Higgs boson in…
▽ More
Testing the Yukawa couplings of the Higgs boson to quarks and leptons is important to understand the origin of fermion masses. These proceedings will review several measurements of Higgs boson decays to two bottom quarks or two tau leptons, searches for Higgs boson decays to two charm quarks or two muons, as well as direct constraints on the charm-Yukawa coupling. The production of Higgs boson in association with top quarks will also be discussed. These analyses are based on 139 fb$^{-1}$ of Run-2 data from proton-proton collisions collected by the ATLAS experiment at the Large Hadron Collider (LHC) with a center-of-mass energy of 13 TeV.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.