Skip to main content

Showing 1–13 of 13 results for author: Sabet, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03303  [pdf, other

    cs.CV

    Learning Visual Prompts for Guiding the Attention of Vision Transformers

    Authors: Razieh Rezaei, Masoud Jalili Sabet, Jindong Gu, Daniel Rueckert, Philip Torr, Ashkan Khakzar

    Abstract: Visual prompting infuses visual information into the input image to adapt models toward specific predictions and tasks. Recently, manually crafted markers such as red circles are shown to guide the model to attend to a target region on the image. However, these markers only work on models trained with data containing those markers. Moreover, finding these prompts requires guesswork or prior knowle… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Short version (4-pages) accepted as a spotlight paper at T4V workshop, CVPR 2024

  2. Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages

    Authors: Ayyoob Imani, Peiqin Lin, Amir Hossein Kargaran, Silvia Severini, Masoud Jalili Sabet, Nora Kassner, Chunlan Ma, Helmut Schmid, André F. T. Martins, François Yvon, Hinrich Schütze

    Abstract: The NLP community has mainly focused on scaling Large Language Models (LLMs) vertically, i.e., making them better for about 100 languages. We instead scale LLMs horizontally: we create, through continued pretraining, Glot500-m, an LLM that covers 511 predominantly low-resource languages. An important part of this effort is to collect and clean Glot500-c, a corpus that covers these 511 languages an… ▽ More

    Submitted 26 May, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  3. arXiv:2211.05335  [pdf

    cs.CV cs.AI cs.GR cs.LG cs.RO

    Scalable Modular Synthetic Data Generation for Advancing Aerial Autonomy

    Authors: Mehrnaz Sabet, Praveen Palanisamy, Sakshi Mishra

    Abstract: One major barrier to advancing aerial autonomy has been collecting large-scale aerial datasets for training machine learning models. Due to costly and time-consuming real-world data collection through deploying drones, there has been an increasing shift towards using synthetic data for training models in drone applications. However, to increase widespread generalization and transferring models to… ▽ More

    Submitted 25 May, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: 25 pages, 14 figures

  4. arXiv:2210.09840  [pdf, other

    cs.CL

    Graph-Based Multilingual Label Propagation for Low-Resource Part-of-Speech Tagging

    Authors: Ayyoob Imani, Silvia Severini, Masoud Jalili Sabet, François Yvon, Hinrich Schütze

    Abstract: Part-of-Speech (POS) tagging is an important component of the NLP pipeline, but many low-resource languages lack labeled data for training. An established method for training a POS tagger in such a scenario is to create a labeled training set by transferring from high-resource languages. In this paper, we propose a novel method for transferring labels from multiple high-resource source to low-reso… ▽ More

    Submitted 31 October, 2022; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  5. arXiv:2205.15713  [pdf, other

    cs.CL

    Don't Forget Cheap Training Signals Before Building Unsupervised Bilingual Word Embeddings

    Authors: Silvia Severini, Viktor Hangya, Masoud Jalili Sabet, Alexander Fraser, Hinrich Schütze

    Abstract: Bilingual Word Embeddings (BWEs) are one of the cornerstones of cross-lingual transfer of NLP models. They can be built using only monolingual corpora without supervision leading to numerous works focusing on unsupervised BWEs. However, most of the current approaches to build unsupervised BWEs do not compare their results with methods based on easy-to-access cross-lingual signals. In this paper, w… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: BUCC@LREC 2022

  6. arXiv:2203.10010  [pdf, other

    cs.CL

    CaMEL: Case Marker Extraction without Labels

    Authors: Leonie Weissweiler, Valentin Hofmann, Masoud Jalili Sabet, Hinrich Schütze

    Abstract: We introduce CaMEL (Case Marker Extraction without Labels), a novel and challenging task in computational morphology that is especially relevant for low-resource languages. We propose a first model for CaMEL that uses a massively multilingual corpus to extract case markers in 83 languages based only on a noun phrase chunker and an alignment system. To evaluate CaMEL, we automatically construct a s… ▽ More

    Submitted 28 March, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  7. arXiv:2203.08654  [pdf, other

    cs.CL

    Graph Neural Networks for Multiparallel Word Alignment

    Authors: Ayyoob Imani, Lütfi Kerem Şenel, Masoud Jalili Sabet, François Yvon, Hinrich Schütze

    Abstract: After a period of decrease, interest in word alignments is increasing again for their usefulness in domains such as typological research, cross-lingual annotation projection, and machine translation. Generally, alignment algorithms only use bitext and do not make use of the fact that many parallel corpora are multiparallel. Here, we compute high-quality word alignments between multiple language pa… ▽ More

    Submitted 10 August, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Report number: ACL 2022 Findings

  8. arXiv:2109.06283  [pdf, other

    cs.CL

    Graph Algorithms for Multiparallel Word Alignment

    Authors: Ayyoob Imani, Masoud Jalili Sabet, Lütfi Kerem Şenel, Philipp Dufter, François Yvon, Hinrich Schütze

    Abstract: With the advent of end-to-end deep learning approaches in machine translation, interest in word alignments initially decreased; however, they have again become a focus of research more recently. Alignments are useful for typological research, transferring formatting like markup to translated texts, and can be used in the decoding of machine translation systems. At the same time, massively multilin… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  9. arXiv:2107.06632  [pdf, other

    cs.CL

    ParCourE: A Parallel Corpus Explorer for a Massively Multilingual Corpus

    Authors: Ayyoob Imani, Masoud Jalili Sabet, Philipp Dufter, Michael Cysouw, Hinrich Schütze

    Abstract: With more than 7000 languages worldwide, multilingual natural language processing (NLP) is essential both from an academic and commercial perspective. Researching typological properties of languages is fundamental for progress in multilingual NLP. Examples include assessing language similarity for effective transfer learning, injecting inductive biases into machine learning models or creating reso… ▽ More

    Submitted 15 July, 2021; v1 submitted 14 July, 2021; originally announced July 2021.

    Comments: The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing

  10. arXiv:2012.11657  [pdf, other

    cs.CL

    Subword Sampling for Low Resource Word Alignment

    Authors: Ehsaneddin Asgari, Masoud Jalili Sabet, Philipp Dufter, Christopher Ringlstetter, Hinrich Schütze

    Abstract: Annotation projection is an important area in NLP that can greatly contribute to creating language resources for low-resource languages. Word alignment plays a key role in this setting. However, most of the existing word alignment methods are designed for a high resource setting in machine translation where millions of parallel sentences are available. This amount reduces to a few thousands of sen… ▽ More

    Submitted 15 June, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

  11. arXiv:2004.08728  [pdf, other

    cs.CL

    SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings

    Authors: Masoud Jalili Sabet, Philipp Dufter, François Yvon, Hinrich Schütze

    Abstract: Word alignments are useful for tasks like statistical and neural machine translation (NMT) and cross-lingual annotation projection. Statistical word aligners perform well, as do methods that extract alignments jointly with translations in NMT. However, most approaches require parallel training data, and quality decreases as less training data is available. We propose word alignment methods that re… ▽ More

    Submitted 16 April, 2021; v1 submitted 18 April, 2020; originally announced April 2020.

    Comments: EMNLP (Findings) 2020

  12. arXiv:1811.00066  [pdf, other

    cs.CL

    Aligning Very Small Parallel Corpora Using Cross-Lingual Word Embeddings and a Monogamy Objective

    Authors: Nina Poerner, Masoud Jalili Sabet, Benjamin Roth, Hinrich Schütze

    Abstract: Count-based word alignment methods, such as the IBM models or fast-align, struggle on very small parallel corpora. We therefore present an alternative approach based on cross-lingual word embeddings (CLWEs), which are trained on purely monolingual data. Our main contribution is an unsupervised objective to adapt CLWEs to parallel corpora. In experiments on between 25 and 500 sentences, our method… ▽ More

    Submitted 31 October, 2018; originally announced November 2018.

  13. arXiv:1606.00615  [pdf, other

    cs.IR

    Low-dimensional Query Projection based on Divergence Minimization Feedback Model for Ad-hoc Retrieval

    Authors: Javid Dadashkarimi, Masoud Jalili Sabet, Heshaam Faili, Azadeh Shakery

    Abstract: Low-dimensional word vectors have long been used in a wide range of applications in natural language processing. In this paper we shed light on estimating query vectors in ad-hoc retrieval where a limited information is available in the original query. Pseudo-relevance feedback (PRF) is a well-known technique for updating query language models and expanding the queries with a number of relevant te… ▽ More

    Submitted 22 December, 2016; v1 submitted 2 June, 2016; originally announced June 2016.