T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events

Wang, Shu; Hu, Yuhuang; Liu, Shih-Chii

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2202.03204 (eess)

[Submitted on 7 Feb 2022]

Title:T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events

Authors:Shu Wang, Yuhuang Hu, Shih-Chii Liu

View PDF

Abstract:Spiking silicon cochlea sensors encode sound as an asynchronous stream of spikes from different frequency channels. The lack of labeled training datasets for spiking cochleas makes it difficult to train deep neural networks on the outputs of these sensors. This work proposes a self-supervised method called Temporal Network Grafting Algorithm (T-NGA), which grafts a recurrent network pretrained on spectrogram features so that the network works with the cochlea event features. T-NGA training requires only temporally aligned audio spectrograms and event features. Our experiments show that the accuracy of the grafted network was similar to the accuracy of a supervised network trained from scratch on a speech recognition task using events from a software spiking cochlea model. Despite the circuit non-idealities of the spiking silicon cochlea, the grafted network accuracy on the silicon cochlea spike recordings was only about 5% lower than the supervised network accuracy using the N-TIDIGITS18 dataset. T-NGA can train networks to process spiking audio sensor events in the absence of large labeled spike datasets.

Comments:	5 pages, 4 figures; accepted at IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Singapore, 2022
Subjects:	Audio and Speech Processing (eess.AS); Neural and Evolutionary Computing (cs.NE); Sound (cs.SD)
Cite as:	arXiv:2202.03204 [eess.AS]
	(or arXiv:2202.03204v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2202.03204

Submission history

From: Shu Wang [view email]
[v1] Mon, 7 Feb 2022 14:14:14 UTC (1,833 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators