Skip to main content

Showing 1–12 of 12 results for author: Farajian, A

.
  1. arXiv:2506.04079  [pdf, ps, other

    cs.CL cs.AI cs.LG

    EuroLLM-9B: Technical Report

    Authors: Pedro Henrique Martins, João Alves, Patrick Fernandes, Nuno M. Guerreiro, Ricardo Rei, Amin Farajian, Mateusz Klimaszewski, Duarte M. Alves, José Pombal, Manuel Faysse, Pierre Colombo, François Yvon, Barry Haddow, José G. C. de Souza, Alexandra Birch, André F. T. Martins

    Abstract: This report presents EuroLLM-9B, a large language model trained from scratch to support the needs of European citizens by covering all 24 official European Union languages and 11 additional languages. EuroLLM addresses the issue of European languages being underrepresented and underserved in existing open large language models. We provide a comprehensive overview of EuroLLM-9B's development, inclu… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 56 pages

  2. arXiv:2410.11624  [pdf, other

    cs.CL

    Findings of the WMT 2024 Shared Task on Chat Translation

    Authors: Wafaa Mohammed, Sweta Agrawal, M. Amin Farajian, Vera Cabarrão, Bryan Eikema, Ana C. Farinha, José G. C. de Souza

    Abstract: This paper presents the findings from the third edition of the Chat Translation Shared Task. As with previous editions, the task involved translating bilingual customer support conversations, specifically focusing on the impact of conversation context in translation quality and evaluation. We also include two new language pairs: English-Korean and English-Dutch, in addition to the set of language… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 12 pages, 5 figures, 13 tables

  3. arXiv:2409.16235  [pdf, other

    cs.CL

    EuroLLM: Multilingual Language Models for Europe

    Authors: Pedro Henrique Martins, Patrick Fernandes, João Alves, Nuno M. Guerreiro, Ricardo Rei, Duarte M. Alves, José Pombal, Amin Farajian, Manuel Faysse, Mateusz Klimaszewski, Pierre Colombo, Barry Haddow, José G. C. de Souza, Alexandra Birch, André F. T. Martins

    Abstract: The quality of open-weight LLMs has seen significant improvement, yet they remain predominantly focused on English. In this paper, we introduce the EuroLLM project, aimed at developing a suite of open-weight multilingual LLMs capable of understanding and generating text in all official European Union languages, as well as several additional relevant languages. We outline the progress made to date,… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  4. arXiv:2403.08314  [pdf, other

    cs.CL

    Is Context Helpful for Chat Translation Evaluation?

    Authors: Sweta Agrawal, Amin Farajian, Patrick Fernandes, Ricardo Rei, André F. T. Martins

    Abstract: Despite the recent success of automatic metrics for assessing translation quality, their application in evaluating the quality of machine-translated chats has been limited. Unlike more structured texts like news, chat conversations are often unstructured, short, and heavily reliant on contextual information. This poses questions about the reliability of existing sentence-level metrics in this doma… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  5. arXiv:2402.17733  [pdf, other

    cs.CL

    Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

    Authors: Duarte M. Alves, José Pombal, Nuno M. Guerreiro, Pedro H. Martins, João Alves, Amin Farajian, Ben Peters, Ricardo Rei, Patrick Fernandes, Sweta Agrawal, Pierre Colombo, José G. C. de Souza, André F. T. Martins

    Abstract: While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and pa… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  6. arXiv:1907.10352  [pdf, other

    cs.CL

    Unbabel's Participation in the WMT19 Translation Quality Estimation Shared Task

    Authors: Fabio Kepler, Jonay Trénous, Marcos Treviso, Miguel Vera, António Góis, M. Amin Farajian, António V. Lopes, André F. T. Martins

    Abstract: We present the contribution of the Unbabel team to the WMT 2019 Shared Task on Quality Estimation. We participated on the word, sentence, and document-level tracks, encompassing 3 language pairs: English-German, English-Russian, and English-French. Our submissions build upon the recent OpenKiwi framework: we combine linear, neural, and predictor-estimator systems with new transfer learning approac… ▽ More

    Submitted 11 September, 2019; v1 submitted 24 July, 2019; originally announced July 2019.

    Comments: In Proceedings of the Fourth Conference on Machine Translation (WMT) 2019: https://www.aclweb.org/anthology/W19-5406/

  7. arXiv:1905.13068  [pdf, other

    cs.CL

    Unbabel's Submission to the WMT2019 APE Shared Task: BERT-based Encoder-Decoder for Automatic Post-Editing

    Authors: António V. Lopes, M. Amin Farajian, Gonçalo M. Correia, Jonay Trenous, André F. T. Martins

    Abstract: This paper describes Unbabel's submission to the WMT2019 APE Shared Task for the English-German language pair. Following the recent rise of large, powerful, pre-trained models, we adapt the BERT pretrained model to perform Automatic Post-Editing in an encoder-decoder framework. Analogously to dual-encoder architectures we develop a BERT-based encoder-decoder (BED) model in which a single pretraine… ▽ More

    Submitted 29 June, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: Updated sections 2.2 and 4

  8. LOCV calculation of the equation of state and properties of rapidly rotating neutron stars

    Authors: A. H. Farajian, M. Bigdeli, S. Belbasi

    Abstract: In this paper, we have investigated the structural properties of rotating neutron stars using the numerical RNS code and the equation of states which have been calculated within the lowest order constrained variational approach. In order to calculate the equation of state of nuclear matter, we have used UV$_{14}$ $+$TNI and AV$_{18}$ potentials. Here, we have computed the maximum mass of the neutr… ▽ More

    Submitted 11 May, 2018; v1 submitted 12 April, 2018; originally announced April 2018.

    Comments: 7 pages, 7 figures

    Journal ref: Chinese Physics C, Vol.42, No.6 (2018)065102

  9. arXiv:1007.2110  [pdf

    cond-mat.mtrl-sci

    Hydrogen Compounds of Group-IV Nanosheets

    Authors: L. C. Lew Yan Voon, E. Sandberg, R. S. Aga, A. A. Farajian

    Abstract: The structural and electronic properties of the hydrides of silicene and germanene have been studied using ab initio calculations. The trend for the M-H (M=C, Si, Ge) bond lengths, and corresponding bond energies, is consistent with the atomic size trend, and comparable to those of MH_4 hydrides. Band structures were also obtained for the buckled configuration, which is the stable form for both si… ▽ More

    Submitted 13 July, 2010; originally announced July 2010.

    Comments: 9 pages, 7 figures

  10. arXiv:cond-mat/0207149   

    cond-mat.str-el cond-mat.mes-hall

    Vacuum polarization in nanotubes

    Authors: K. Sasaki, A. A. Farajian, H. Mizuseki, Y. Kawazoe

    Abstract: This paper has been withdrawn by the author due to a crucial error.

    Submitted 24 June, 2003; v1 submitted 5 July, 2002; originally announced July 2002.

    Comments: This paper has been withdrawn

    Report number: TU-661

  11. arXiv:cond-mat/0205637  [pdf, ps, other

    cond-mat.str-el cond-mat.mtrl-sci

    Effective Screening of Localized Charged Perturbations in Metallic Nanotubes: Roles of Massive Bands

    Authors: K. Sasaki, A. A. Farajian, H. Mizuseki, Y. Kawazoe

    Abstract: The massive-band effects on screening behavior of metallic carbon nanotubes are theoretically investigated using two different methods; continuous and lattice quantum theories. Both approaches show screening of a localized external perturbation with an effective screening length of the order of the nanotube diameter. Calculating the nonlinear deformation of the local density of states near the c… ▽ More

    Submitted 16 April, 2003; v1 submitted 30 May, 2002; originally announced May 2002.

    Comments: 6 pages, 4 figures

    Report number: TU-657

  12. arXiv:cond-mat/0204609  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Nonlinear charging, and transport times in doped nanotubes junctions

    Authors: Keivan Esfarjani, Amir A. Farajian, Siu Tat Chui, Yoshiyuki Kawazoe

    Abstract: The nonlinear capacitance in doped nanotube junctions is calculated self consistently. It decreases as a function of the applied bias when the latter becomes larger than the pseudogap of the nanotube. For this device, one can deduce a relaxation time of about 0.1 femtosecond. Because of its negative differential resistance (NDR), a switching time of less than a fs can also be deduced.

    Submitted 10 September, 2003; v1 submitted 29 April, 2002; originally announced April 2002.

    Comments: Letter with 3 .ps figures