Skip to main content

Showing 1–4 of 4 results for author: Werk, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.18902  [pdf, ps, other

    cs.AI cs.CL cs.IR

    jina-embeddings-v4: Universal Embeddings for Multimodal Multilingual Retrieval

    Authors: Michael Günther, Saba Sturua, Mohammad Kalim Akram, Isabelle Mohr, Andrei Ungureanu, Bo Wang, Sedigheh Eslami, Scott Martens, Maximilian Werk, Nan Wang, Han Xiao

    Abstract: We introduce jina-embeddings-v4, a 3.8 billion parameter multimodal embedding model that unifies text and image representations through a novel architecture supporting both single-vector and multi-vector embeddings in the late interaction style. The model incorporates task-specific Low-Rank Adaptation (LoRA) adapters to optimize performance across diverse retrieval scenarios, including query-docum… ▽ More

    Submitted 7 July, 2025; v1 submitted 23 June, 2025; originally announced June 2025.

    Comments: 22 pages, 1-10 main, 14-22 experimental results, benchmark tables

    MSC Class: 68T50 ACM Class: I.2.7

  2. arXiv:2405.20204  [pdf, other

    cs.CL cs.AI cs.CV cs.IR

    Jina CLIP: Your CLIP Model Is Also Your Text Retriever

    Authors: Andreas Koukounas, Georgios Mastrapas, Michael Günther, Bo Wang, Scott Martens, Isabelle Mohr, Saba Sturua, Mohammad Kalim Akram, Joan Fontanals Martínez, Saahil Ognawala, Susana Guzman, Maximilian Werk, Nan Wang, Han Xiao

    Abstract: Contrastive Language-Image Pretraining (CLIP) is widely used to train models to align images and texts in a common embedding space by mapping them to fixed-sized vectors. These models are key to multimodal information retrieval and related tasks. However, CLIP models generally underperform in text-only tasks compared to specialized text models. This creates inefficiencies for information retrieval… ▽ More

    Submitted 26 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: 4 pages, MFM-EAI@ICML2024

    MSC Class: 68T50 ACM Class: I.2.7

  3. arXiv:2402.17016  [pdf, other

    cs.CL cs.AI cs.IR

    Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings

    Authors: Isabelle Mohr, Markus Krimmel, Saba Sturua, Mohammad Kalim Akram, Andreas Koukounas, Michael Günther, Georgios Mastrapas, Vinit Ravishankar, Joan Fontanals Martínez, Feng Wang, Qi Liu, Ziniu Yu, Jie Fu, Saahil Ognawala, Susana Guzman, Bo Wang, Maximilian Werk, Nan Wang, Han Xiao

    Abstract: We introduce a novel suite of state-of-the-art bilingual text embedding models that are designed to support English and another target language. These models are capable of processing lengthy text inputs with up to 8192 tokens, making them highly versatile for a range of natural language processing tasks such as text retrieval, clustering, and semantic textual similarity (STS) calculations. By f… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  4. arXiv:2310.19923  [pdf, other

    cs.CL cs.AI cs.LG

    Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents

    Authors: Michael Günther, Jackmin Ong, Isabelle Mohr, Alaeddine Abdessalem, Tanguy Abel, Mohammad Kalim Akram, Susana Guzman, Georgios Mastrapas, Saba Sturua, Bo Wang, Maximilian Werk, Nan Wang, Han Xiao

    Abstract: Text embedding models have emerged as powerful tools for transforming sentences into fixed-sized feature vectors that encapsulate semantic information. While these models are essential for tasks like information retrieval, semantic clustering, and text re-ranking, most existing open-source models, especially those built on architectures like BERT, struggle to represent lengthy documents and often… ▽ More

    Submitted 4 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: 14 pages

    MSC Class: 68T50 ACM Class: I.2.7