We gratefully acknowledge support from
the Simons Foundation and member institutions.

Jordi Luque and Xavier Giró-I-Nieto are qualified to endorse.

Transcription-Enriched Joint Embeddings for Spoken Descriptions of Images and Videos

Benet Oriol Sabat: Is registered as an author of this paper.
Not currently an endorser. (why?)
Jordi Luque: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CL, cs.CR, cs.CV, cs.CY, cs.IR, cs.LG, cs.MM, cs.NI, cs.SD, cs.SI, cs.SY, eess.AS, eess.SP, eess.SY. (why?)
Xavier Giró-I-Nieto: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CL, cs.CV, cs.HC, cs.IR, cs.MM. (why?)

Ferran Diego is not registered as an owner of this paper. (why?)