Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection

Liu, Yin-Long; Feng, Rui; Yuan, Jia-Hong; Ling, Zhen-Hua

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2412.06259 (eess)

[Submitted on 9 Dec 2024]

Title:Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection

Authors:Yin-Long Liu, Rui Feng, Jia-Hong Yuan, Zhen-Hua Ling

View PDF HTML (experimental)

Abstract:Compared to other clinical screening techniques, speech-and-language-based automated Alzheimer's disease (AD) detection methods are characterized by their non-invasiveness, cost-effectiveness, and convenience. Previous studies have demonstrated the efficacy of fine-tuning pre-trained language models (PLMs) for AD detection. However, the objective of this traditional fine-tuning method, which involves inputting only transcripts, is inconsistent with the masked language modeling (MLM) task used during the pre-training phase of PLMs. In this paper, we investigate prompt-based fine-tuning of PLMs, converting the classification task into a MLM task by inserting prompt templates into the transcript inputs. We also explore the impact of incorporating pause information from forced alignment into manual transcripts. Additionally, we compare the performance of various automatic speech recognition (ASR) models and select the Whisper model to generate ASR-based transcripts for comparison with manual transcripts. Furthermore, majority voting and ensemble techniques are applied across different PLMs (BERT and RoBERTa) using different random seeds. Ultimately, we obtain maximum detection accuracy of 95.8% (with mean 87.9%, std 3.3%) using manual transcripts, achieving state-of-the-art performance for AD detection using only transcripts on the ADReSS test set.

Comments:	Accepted by ISCSLP 2024
Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as:	arXiv:2412.06259 [eess.AS]
	(or arXiv:2412.06259v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2412.06259

Submission history

From: Yinlong Liu [view email]
[v1] Mon, 9 Dec 2024 07:18:29 UTC (883 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators