A Multisensory Learning Architecture for Rotation-invariant Object Recognition

Kirtay, Murat; Schillaci, Guido; Hafner, Verena V.

Computer Science > Robotics

arXiv:2009.06292 (cs)

[Submitted on 14 Sep 2020]

Title:A Multisensory Learning Architecture for Rotation-invariant Object Recognition

Authors:Murat Kirtay, Guido Schillaci, Verena V. Hafner

View PDF

Abstract:This study presents a multisensory machine learning architecture for object recognition by employing a novel dataset that was constructed with the iCub robot, which is equipped with three cameras and a depth sensor. The proposed architecture combines convolutional neural networks to form representations (i.e., features) for grayscaled color images and a multi-layer perceptron algorithm to process depth data. To this end, we aimed to learn joint representations of different modalities (e.g., color and depth) and employ them for recognizing objects. We evaluate the performance of the proposed architecture by benchmarking the results obtained with the models trained separately with the input of different sensors and a state-of-the-art data fusion technique, namely decision level fusion. The results show that our architecture improves the recognition accuracy compared with the models that use inputs from a single modality and decision level multimodal fusion method.

Comments:	The manuscript consists of 8 pages with 6 figures and two results tables. Additionally, we provide a dedicated website to reach the dataset that we employed for this study: this http URL
Subjects:	Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2009.06292 [cs.RO]
	(or arXiv:2009.06292v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2009.06292

Submission history

From: Murat Kirtay [view email]
[v1] Mon, 14 Sep 2020 09:39:48 UTC (2,051 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.RO

< prev | next >

new | recent | 2020-09

Change to browse by:

cs
cs.CV
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Guido Schillaci
Verena V. Hafner

export BibTeX citation

Computer Science > Robotics

Title:A Multisensory Learning Architecture for Rotation-invariant Object Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:A Multisensory Learning Architecture for Rotation-invariant Object Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators