-
KIT's Offline Speech Translation and Instruction Following Submission for IWSLT 2025
Authors:
Sai Koneru,
Maike Züfle,
Thai-Binh Nguyen,
Seymanur Akti,
Jan Niehues,
Alexander Waibel
Abstract:
The scope of the International Workshop on Spoken Language Translation (IWSLT) has recently broadened beyond traditional Speech Translation (ST) to encompass a wider array of tasks, including Speech Question Answering and Summarization. This shift is partly driven by the growing capabilities of modern systems, particularly with the success of Large Language Models (LLMs). In this paper, we present…
▽ More
The scope of the International Workshop on Spoken Language Translation (IWSLT) has recently broadened beyond traditional Speech Translation (ST) to encompass a wider array of tasks, including Speech Question Answering and Summarization. This shift is partly driven by the growing capabilities of modern systems, particularly with the success of Large Language Models (LLMs). In this paper, we present the Karlsruhe Institute of Technology's submissions for the Offline ST and Instruction Following (IF) tracks, where we leverage LLMs to enhance performance across all tasks. For the Offline ST track, we propose a pipeline that employs multiple automatic speech recognition systems, whose outputs are fused using an LLM with document-level context. This is followed by a two-step translation process, incorporating additional refinement step to improve translation quality. For the IF track, we develop an end-to-end model that integrates a speech encoder with an LLM to perform a wide range of instruction-following tasks. We complement it with a final document-level refinement stage to further enhance output quality by using contextual information.
△ Less
Submitted 19 May, 2025;
originally announced May 2025.
-
Quality-Aware Decoding: Unifying Quality Estimation and Decoding
Authors:
Sai Koneru,
Matthias Huck,
Miriam Exel,
Jan Niehues
Abstract:
Quality Estimation (QE) models for Neural Machine Translation (NMT) predict the quality of the hypothesis without having access to the reference. An emerging research direction in NMT involves the use of QE models, which have demonstrated high correlations with human judgment and can enhance translations through Quality-Aware Decoding. Although several approaches have been proposed based on sampli…
▽ More
Quality Estimation (QE) models for Neural Machine Translation (NMT) predict the quality of the hypothesis without having access to the reference. An emerging research direction in NMT involves the use of QE models, which have demonstrated high correlations with human judgment and can enhance translations through Quality-Aware Decoding. Although several approaches have been proposed based on sampling multiple candidate translations and picking the best candidate, none have integrated these models directly into the decoding process. In this paper, we address this by proposing a novel token-level QE model capable of reliably scoring partial translations. We build a uni-directional QE model for this, as decoder models are inherently trained and efficient on partial sequences. We then present a decoding strategy that integrates the QE model for Quality-Aware decoding and demonstrate that the translation quality improves when compared to the N-best list re-ranking with state-of-the-art QE models (up to $1.39$ XCOMET-XXL $\uparrow$). Finally, we show that our approach provides significant benefits in document translation tasks, where the quality of N-best lists is typically suboptimal. Code can be found at https://ai4lt.iar.kit.edu/english/projects\_kontextmt.php
△ Less
Submitted 1 June, 2025; v1 submitted 12 February, 2025;
originally announced February 2025.
-
Have LLMs Reopened the Pandora's Box of AI-Generated Fake News?
Authors:
Xinyu Wang,
Wenbo Zhang,
Sai Koneru,
Hangzhi Guo,
Bonam Mingole,
S. Shyam Sundar,
Sarah Rajtmajer,
Amulya Yadav
Abstract:
With the rise of AI-generated content spewed at scale from large language models (LLMs), genuine concerns about the spread of fake news have intensified. The perceived ability of LLMs to produce convincing fake news at scale poses new challenges for both human and automated fake news detection systems. To address this gap, this paper presents the findings from a university-level competition that a…
▽ More
With the rise of AI-generated content spewed at scale from large language models (LLMs), genuine concerns about the spread of fake news have intensified. The perceived ability of LLMs to produce convincing fake news at scale poses new challenges for both human and automated fake news detection systems. To address this gap, this paper presents the findings from a university-level competition that aimed to explore how LLMs can be used by humans to create fake news, and to assess the ability of human annotators and AI models to detect it. A total of 110 participants used LLMs to create 252 unique fake news stories, and 84 annotators participated in the detection tasks. Our findings indicate that LLMs are ~68% more effective at detecting real news than humans. However, for fake news detection, the performance of LLMs and humans remains comparable (~60% accuracy). Additionally, we examine the impact of visual elements (e.g., pictures) in news on the accuracy of detecting fake news stories. Finally, we also examine various strategies used by fake news creators to enhance the credibility of their AI-generated content. This work highlights the increasing complexity of detecting AI-generated fake news, particularly in collaborative human-AI settings.
△ Less
Submitted 29 March, 2025; v1 submitted 24 October, 2024;
originally announced October 2024.
-
Plug, Play, and Fuse: Zero-Shot Joint Decoding via Word-Level Re-ranking Across Diverse Vocabularies
Authors:
Sai Koneru,
Matthias Huck,
Miriam Exel,
Jan Niehues
Abstract:
Recent advancements in NLP have resulted in models with specialized strengths, such as processing multimodal inputs or excelling in specific domains. However, real-world tasks, like multimodal translation, often require a combination of these strengths, such as handling both translation and image processing. While individual translation and vision models are powerful, they typically lack the abili…
▽ More
Recent advancements in NLP have resulted in models with specialized strengths, such as processing multimodal inputs or excelling in specific domains. However, real-world tasks, like multimodal translation, often require a combination of these strengths, such as handling both translation and image processing. While individual translation and vision models are powerful, they typically lack the ability to perform both tasks in a single system. Combining these models poses challenges, particularly due to differences in their vocabularies, which limit the effectiveness of traditional ensemble methods to post-generation techniques like N-best list re-ranking. In this work, we propose a novel zero-shot ensembling strategy that allows for the integration of different models during the decoding phase without the need for additional training. Our approach re-ranks beams during decoding by combining scores at the word level, using heuristics to predict when a word is completed. We demonstrate the effectiveness of this method in machine translation scenarios, showing that it enables the generation of translations that are both speech- and image-aware while also improving overall translation quality (We will release the code upon paper acceptance.).
△ Less
Submitted 4 November, 2024; v1 submitted 21 August, 2024;
originally announced August 2024.
-
Assessing the Effectiveness of GPT-4o in Climate Change Evidence Synthesis and Systematic Assessments: Preliminary Insights
Authors:
Elphin Tom Joe,
Sai Dileep Koneru,
Christine J Kirchhoff
Abstract:
In this research short, we examine the potential of using GPT-4o, a state-of-the-art large language model (LLM) to undertake evidence synthesis and systematic assessment tasks. Traditional workflows for such tasks involve large groups of domain experts who manually review and synthesize vast amounts of literature. The exponential growth of scientific literature and recent advances in LLMs provide…
▽ More
In this research short, we examine the potential of using GPT-4o, a state-of-the-art large language model (LLM) to undertake evidence synthesis and systematic assessment tasks. Traditional workflows for such tasks involve large groups of domain experts who manually review and synthesize vast amounts of literature. The exponential growth of scientific literature and recent advances in LLMs provide an opportunity to complementing these traditional workflows with new age tools. We assess the efficacy of GPT-4o to do these tasks on a sample from the dataset created by the Global Adaptation Mapping Initiative (GAMI) where we check the accuracy of climate change adaptation related feature extraction from the scientific literature across three levels of expertise. Our results indicate that while GPT-4o can achieve high accuracy in low-expertise tasks like geographic location identification, their performance in intermediate and high-expertise tasks, such as stakeholder identification and assessment of depth of the adaptation response, is less reliable. The findings motivate the need for designing assessment workflows that utilize the strengths of models like GPT-4o while also providing refinements to improve their performance on these tasks.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024
Authors:
Sai Koneru,
Thai-Binh Nguyen,
Ngoc-Quan Pham,
Danni Liu,
Zhaolin Li,
Alexander Waibel,
Jan Niehues
Abstract:
Large Language Models (LLMs) are currently under exploration for various tasks, including Automatic Speech Recognition (ASR), Machine Translation (MT), and even End-to-End Speech Translation (ST). In this paper, we present KIT's offline submission in the constrained + LLM track by incorporating recently proposed techniques that can be added to any cascaded speech translation. Specifically, we inte…
▽ More
Large Language Models (LLMs) are currently under exploration for various tasks, including Automatic Speech Recognition (ASR), Machine Translation (MT), and even End-to-End Speech Translation (ST). In this paper, we present KIT's offline submission in the constrained + LLM track by incorporating recently proposed techniques that can be added to any cascaded speech translation. Specifically, we integrate Mistral-7B\footnote{mistralai/Mistral-7B-Instruct-v0.1} into our system to enhance it in two ways. Firstly, we refine the ASR outputs by utilizing the N-best lists generated by our system and fine-tuning the LLM to predict the transcript accurately. Secondly, we refine the MT outputs at the document level by fine-tuning the LLM, leveraging both ASR and MT predictions to improve translation quality. We find that integrating the LLM into the ASR and MT systems results in an absolute improvement of $0.3\%$ in Word Error Rate and $0.65\%$ in COMET for tst2019 test set. In challenging test sets with overlapping speakers and background noise, we find that integrating LLM is not beneficial due to poor ASR performance. Here, we use ASR with chunked long-form decoding to improve context usage that may be unavailable when transcribing with Voice Activity Detection segmentation alone.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
The Failed Migration of Academic Twitter
Authors:
Xinyu Wang,
Sai Koneru,
Sarah Rajtmajer
Abstract:
Following changes in Twitter's ownership and subsequent changes to content moderation policies, many in academia looked to move their discourse elsewhere and migration to Mastodon was pursued by some. Our study looks at the dynamics of this migration. Utilizing publicly available user account data, we track the posting activity of academics on Mastodon over a one year period. We also gathered foll…
▽ More
Following changes in Twitter's ownership and subsequent changes to content moderation policies, many in academia looked to move their discourse elsewhere and migration to Mastodon was pursued by some. Our study looks at the dynamics of this migration. Utilizing publicly available user account data, we track the posting activity of academics on Mastodon over a one year period. We also gathered follower-followee relationships to map internal networks, finding that the subset of academics who engaged in migration were well-connected. However, this strong internal connectivity was insufficient to prevent users from returning to Twitter/X. Our analyses reveal significant challenges sustaining user engagement on Mastodon due to its decentralized structure as well as competition from other platforms such as Bluesky and Threads. The movement lost momentum after an initial surge of enthusiasm where the main network was fully established as most users did not maintain their activity levels, and those who did faced lower levels of engagement. Our findings highlight the challenges involved in transitioning professional communities to decentralized platforms, emphasizing the need for focus on community building for long-term user engagement.
△ Less
Submitted 23 October, 2024; v1 submitted 6 June, 2024;
originally announced June 2024.
-
The Unappreciated Role of Intent in Algorithmic Moderation of Social Media Content
Authors:
Xinyu Wang,
Sai Koneru,
Pranav Narayanan Venkit,
Brett Frischmann,
Sarah Rajtmajer
Abstract:
As social media has become a predominant mode of communication globally, the rise of abusive content threatens to undermine civil discourse. Recognizing the critical nature of this issue, a significant body of research has been dedicated to developing language models that can detect various types of online abuse, e.g., hate speech, cyberbullying. However, there exists a notable disconnect between…
▽ More
As social media has become a predominant mode of communication globally, the rise of abusive content threatens to undermine civil discourse. Recognizing the critical nature of this issue, a significant body of research has been dedicated to developing language models that can detect various types of online abuse, e.g., hate speech, cyberbullying. However, there exists a notable disconnect between platform policies, which often consider the author's intention as a criterion for content moderation, and the current capabilities of detection models, which typically lack efforts to capture intent. This paper examines the role of intent in content moderation systems. We review state of the art detection models and benchmark training datasets for online abuse to assess their awareness and ability to capture intent. We propose strategic changes to the design and development of automated detection and moderation systems to improve alignment with ethical and policy conceptualizations of abuse.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Reproducibility, Replicability, and Transparency in Research: What 430 Professors Think in Universities across the USA and India
Authors:
Tatiana Chakravorti,
Sai Dileep Koneru,
Sarah Rajtmajer
Abstract:
In the past decade, open science and science of science communities have initiated innovative efforts to address concerns about the reproducibility and replicability of published scientific research. In some respects, these efforts have been successful, yet there are still many pockets of researchers with little to no familiarity with these concerns, subsequent responses, or best practices for eng…
▽ More
In the past decade, open science and science of science communities have initiated innovative efforts to address concerns about the reproducibility and replicability of published scientific research. In some respects, these efforts have been successful, yet there are still many pockets of researchers with little to no familiarity with these concerns, subsequent responses, or best practices for engaging in reproducible, replicable, and reliable scholarship. In this work, we survey 430 professors from Universities across the USA and India to understand perspectives on scientific processes and identify key points for intervention. Our findings reveal both national and disciplinary gaps in attention to reproducibility and replicability, aggravated by incentive misalignment and resource constraints. We suggest that solutions addressing scientific integrity should be culturally-centered, where definitions of culture should include both regional and domain-specific elements.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Perspectives from India: Opportunities and Challenges for AI Replication Prediction to Improve Confidence in Published Research
Authors:
Tatiana Chakravorti,
Chuhao Wu,
Sai Koneru,
Sarah Rajtmajer
Abstract:
Over the past decade, a crisis of confidence in scientific literature has gained attention, particularly in the West. In response, we have seen changes in policy and practice amongst individual researchers and institutions. Greater attention is given to the transparency of workflows and the appropriate use of statistical methods. Advances in scholarly big data and machine learning have led to the…
▽ More
Over the past decade, a crisis of confidence in scientific literature has gained attention, particularly in the West. In response, we have seen changes in policy and practice amongst individual researchers and institutions. Greater attention is given to the transparency of workflows and the appropriate use of statistical methods. Advances in scholarly big data and machine learning have led to the development of AI-driven tools for the evaluation of published findings. In this study, we conduct 19 semi-structured interviews with Indian researchers to understand their perspectives on challenges and opportunities for AI technologies to improve confidence in published research. Our findings highlight the importance of social and cultural context for the design and deployment of AI tools for research assessment. Our work suggests that such technologies must work alongside rather than replace human research assessment mechanisms. They must be explainable and situated within well-functioning human-centered peer review processes.
△ Less
Submitted 15 September, 2024; v1 submitted 29 October, 2023;
originally announced October 2023.
-
Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing
Authors:
Sai Koneru,
Miriam Exel,
Matthias Huck,
Jan Niehues
Abstract:
Large Language Models (LLM's) have demonstrated considerable success in various Natural Language Processing tasks, but they have yet to attain state-of-the-art performance in Neural Machine Translation (NMT). Nevertheless, their significant performance in tasks demanding a broad understanding and contextual processing shows their potential for translation. To exploit these abilities, we investigat…
▽ More
Large Language Models (LLM's) have demonstrated considerable success in various Natural Language Processing tasks, but they have yet to attain state-of-the-art performance in Neural Machine Translation (NMT). Nevertheless, their significant performance in tasks demanding a broad understanding and contextual processing shows their potential for translation. To exploit these abilities, we investigate using LLM's for MT and explore recent parameter-efficient fine-tuning techniques. Surprisingly, our initial experiments find that fine-tuning for translation purposes even led to performance degradation. To overcome this, we propose an alternative approach: adapting LLM's as Automatic Post-Editors (APE) rather than direct translators. Building on the LLM's exceptional ability to process and generate lengthy sequences, we also propose extending our approach to document-level translation. We show that leveraging Low-Rank-Adapter fine-tuning for APE can yield significant improvements across both sentence and document-level metrics while generalizing to out-of-domain data. Most notably, we achieve a state-of-the-art accuracy rate of 89\% on the ContraPro test set, which specifically assesses the model's ability to resolve pronoun ambiguities when translating from English to German. Lastly, we investigate a practical scenario involving manual post-editing for document-level translation, where reference context is made available. Here, we demonstrate that leveraging human corrections can significantly reduce the number of edits required for subsequent translations (Interactive Demo for integrating manual feedback can be found here: https://huggingface.co/spaces/skoneru/contextual_refinement_ende).
△ Less
Submitted 18 March, 2024; v1 submitted 23 October, 2023;
originally announced October 2023.
-
Can Large Language Models Discern Evidence for Scientific Hypotheses? Case Studies in the Social Sciences
Authors:
Sai Koneru,
Jian Wu,
Sarah Rajtmajer
Abstract:
Hypothesis formulation and testing are central to empirical research. A strong hypothesis is a best guess based on existing evidence and informed by a comprehensive view of relevant literature. However, with exponential increase in the number of scientific articles published annually, manual aggregation and synthesis of evidence related to a given hypothesis is a challenge. Our work explores the a…
▽ More
Hypothesis formulation and testing are central to empirical research. A strong hypothesis is a best guess based on existing evidence and informed by a comprehensive view of relevant literature. However, with exponential increase in the number of scientific articles published annually, manual aggregation and synthesis of evidence related to a given hypothesis is a challenge. Our work explores the ability of current large language models (LLMs) to discern evidence in support or refute of specific hypotheses based on the text of scientific abstracts. We share a novel dataset for the task of scientific hypothesis evidencing using community-driven annotations of studies in the social sciences. We compare the performance of LLMs to several state-of-the-art benchmarks and highlight opportunities for future research in this area. The dataset is available at https://github.com/Sai90000/ScientificHypothesisEvidencing.git
△ Less
Submitted 25 March, 2024; v1 submitted 7 September, 2023;
originally announced September 2023.
-
End-to-End Evaluation for Low-Latency Simultaneous Speech Translation
Authors:
Christian Huber,
Tu Anh Dinh,
Carlos Mullov,
Ngoc Quan Pham,
Thai Binh Nguyen,
Fabian Retkowski,
Stefan Constantin,
Enes Yavuz Ugan,
Danni Liu,
Zhaolin Li,
Sai Koneru,
Jan Niehues,
Alexander Waibel
Abstract:
The challenge of low-latency speech translation has recently draw significant interest in the research community as shown by several publications and shared tasks. Therefore, it is essential to evaluate these different approaches in realistic scenarios. However, currently only specific aspects of the systems are evaluated and often it is not possible to compare different approaches.
In this work…
▽ More
The challenge of low-latency speech translation has recently draw significant interest in the research community as shown by several publications and shared tasks. Therefore, it is essential to evaluate these different approaches in realistic scenarios. However, currently only specific aspects of the systems are evaluated and often it is not possible to compare different approaches.
In this work, we propose the first framework to perform and evaluate the various aspects of low-latency speech translation under realistic conditions. The evaluation is carried out in an end-to-end fashion. This includes the segmentation of the audio as well as the run-time of the different components.
Secondly, we compare different approaches to low-latency speech translation using this framework. We evaluate models with the option to revise the output as well as methods with fixed output. Furthermore, we directly compare state-of-the-art cascaded as well as end-to-end systems. Finally, the framework allows to automatically evaluate the translation quality as well as latency and also provides a web interface to show the low-latency model outputs to the user.
△ Less
Submitted 17 July, 2024; v1 submitted 7 August, 2023;
originally announced August 2023.
-
KIT's Multilingual Speech Translation System for IWSLT 2023
Authors:
Danni Liu,
Thai Binh Nguyen,
Sai Koneru,
Enes Yavuz Ugan,
Ngoc-Quan Pham,
Tuan-Nam Nguyen,
Tu Anh Dinh,
Carlos Mullov,
Alexander Waibel,
Jan Niehues
Abstract:
Many existing speech translation benchmarks focus on native-English speech in high-quality recording conditions, which often do not match the conditions in real-life use-cases. In this paper, we describe our speech translation system for the multilingual track of IWSLT 2023, which evaluates translation quality on scientific conference talks. The test condition features accented input speech and te…
▽ More
Many existing speech translation benchmarks focus on native-English speech in high-quality recording conditions, which often do not match the conditions in real-life use-cases. In this paper, we describe our speech translation system for the multilingual track of IWSLT 2023, which evaluates translation quality on scientific conference talks. The test condition features accented input speech and terminology-dense contents. The task requires translation into 10 languages of varying amounts of resources. In absence of training data from the target domain, we use a retrieval-based approach (kNN-MT) for effective adaptation (+0.8 BLEU for speech translation). We also use adapters to easily integrate incremental training data from data augmentation, and show that it matches the performance of re-training. We observe that cascaded systems are more easily adaptable towards specific target domains, due to their separate modules. Our cascaded speech system substantially outperforms its end-to-end counterpart on scientific talk translation, although their performance remains similar on TED talks.
△ Less
Submitted 12 July, 2023; v1 submitted 8 June, 2023;
originally announced June 2023.
-
Microstructural engineering by heat treatments of multi-principal element alloys via spinodal mediated phase transformation pathways
Authors:
Shalini Roy Koneru,
Kamalnath Kadirvel,
Hamish Fraser,
Yunzhi Wang
Abstract:
Nanoscale multi-phase microstructures observed in multi-principal element alloys (MPEAs) such as $\rm AlMo_{0.5}NbTa_{0.5}TiZr$, $\rm Al_{0.5}NbTa_{0.8}Ti_{1.5}V_{0.2}Zr$, $\rm TiZrNbTa$, $\rm AlCoCrFeNi$ and $\rm Fe_{15}Co_{15}Ni_{20}Mn_{20}Cu_{30}$ that exhibit promising mechanical or functional properties may have evolved through spinodal-mediated phase transformation pathways (PTPs). The micro…
▽ More
Nanoscale multi-phase microstructures observed in multi-principal element alloys (MPEAs) such as $\rm AlMo_{0.5}NbTa_{0.5}TiZr$, $\rm Al_{0.5}NbTa_{0.8}Ti_{1.5}V_{0.2}Zr$, $\rm TiZrNbTa$, $\rm AlCoCrFeNi$ and $\rm Fe_{15}Co_{15}Ni_{20}Mn_{20}Cu_{30}$ that exhibit promising mechanical or functional properties may have evolved through spinodal-mediated phase transformation pathways (PTPs). The microstructures in such MPEA systems could be further engineered for targeted applications by appropriately designing the alloy composition and heat-treatment schedule. In this study, we investigate systematically how different heat treatment schedules such as single-step isothermal aging, two-step isothermal aging and continuous cooling alter the interplay among the various factors associated with alloy composition, such as volume fraction of individual phases, lattice misfit and modulus mismatch between the co-existing phases. We have determined the degree to which these factors influence significantly the spinodal-mediated PTPs and the corresponding microstructures by use of high-throughput phase-field simulations. In particular, we demonstrate that the microstructural topology (i.e., which phase forms the continuous matrix and which phase forms discrete precipitates) in the same MPEA having an asymmetric miscibility gap could be inverted simply by a continuous cooling heat treatment. Further, we reveal a rich variety of novel hierarchical microstructures that could be designed using two-step isothermal aging heat treatments in MPEA systems with symmetric or asymmetric miscibility gaps. These simulation results may shed light on novel microstructure design and engineering for the above-mentioned MPEA systems.
△ Less
Submitted 10 April, 2023; v1 submitted 27 March, 2023;
originally announced March 2023.
-
The evolution of scientific literature as metastable knowledge states
Authors:
Sai Dileep Koneru,
David Rench McCauley,
Michael C. Smith,
David Guarrera,
Jenn Robinson,
Sarah Rajtmajer
Abstract:
The problem of identifying common concepts in the sciences and deciding when new ideas have emerged is an open one. Metascience researchers have sought to formalize principles underlying stages in the life-cycle of scientific research, determine how knowledge is transferred between scientists and stakeholders, and understand how new ideas are generated and take hold. Here, we model the state of sc…
▽ More
The problem of identifying common concepts in the sciences and deciding when new ideas have emerged is an open one. Metascience researchers have sought to formalize principles underlying stages in the life-cycle of scientific research, determine how knowledge is transferred between scientists and stakeholders, and understand how new ideas are generated and take hold. Here, we model the state of scientific knowledge immediately preceding new directions of research as a metastable state and the creation of new concepts as combinatorial innovation. We find that, through the combined use of natural language clustering and citation graph analysis, we can predict the evolution of ideas over time and thus connect a single scientific article to past and future concepts in a way that goes beyond traditional citation and reference connections.
△ Less
Submitted 11 September, 2022; v1 submitted 25 February, 2022;
originally announced February 2022.
-
Cost-Effective Training in Low-Resource Neural Machine Translation
Authors:
Sai Koneru,
Danni Liu,
Jan Niehues
Abstract:
While Active Learning (AL) techniques are explored in Neural Machine Translation (NMT), only a few works focus on tackling low annotation budgets where a limited number of sentences can get translated. Such situations are especially challenging and can occur for endangered languages with few human annotators or having cost constraints to label large amounts of data. Although AL is shown to be help…
▽ More
While Active Learning (AL) techniques are explored in Neural Machine Translation (NMT), only a few works focus on tackling low annotation budgets where a limited number of sentences can get translated. Such situations are especially challenging and can occur for endangered languages with few human annotators or having cost constraints to label large amounts of data. Although AL is shown to be helpful with large budgets, it is not enough to build high-quality translation systems in these low-resource conditions. In this work, we propose a cost-effective training procedure to increase the performance of NMT models utilizing a small number of annotated sentences and dictionary entries. Our method leverages monolingual data with self-supervised objectives and a small-scale, inexpensive dictionary for additional supervision to initialize the NMT model before applying AL. We show that improving the model using a combination of these knowledge sources is essential to exploit AL strategies and increase gains in low-resource conditions. We also present a novel AL strategy inspired by domain adaptation for NMT and show that it is effective for low budgets. We propose a new hybrid data-driven approach, which samples sentences that are diverse from the labelled data and also most similar to unlabelled data. Finally, we show that initializing the NMT model and further using our AL strategy can achieve gains of up to $13$ BLEU compared to conventional AL methods.
△ Less
Submitted 14 January, 2022;
originally announced January 2022.
-
Exploration of Spinodal Decomposition in Multi-Principal Element Alloys (MPEAs) using CALPHAD Modeling
Authors:
Kamalnath Kadirvel,
Shalini Roy Koneru,
Yunzhi Wang
Abstract:
Researchers attributed the orderly arranged nanoscale phases observed in many multi-principal element alloys (MPEAs) to spinodal/spinodal-mediated phase transformation pathways. However, spinodal decomposition is not well understood in multicomponent systems. Although the theoretical background is available, CALPHAD databases were not used to explore the miscibility gap in MPEAs. In this work, we…
▽ More
Researchers attributed the orderly arranged nanoscale phases observed in many multi-principal element alloys (MPEAs) to spinodal/spinodal-mediated phase transformation pathways. However, spinodal decomposition is not well understood in multicomponent systems. Although the theoretical background is available, CALPHAD databases were not used to explore the miscibility gap in MPEAs. In this work, we develop a CALPHAD framework utilizing the Hessian of free energy to study the stability of solid solutions in MPEAs. In particular, we utilize the geometry of higher dimensional Gibbs simplex in conjunction with the Hessian to calculate concentration modulations in early stages of spinodal decomposition. We apply this framework to a diverse set of multi-phase MPEAs that have been studied in the literature including TiZrNbTa (BCC/BCC), Fe15Co15Ni20Mn20Cu30 (FCC/FCC), Al0.5NbTa0.8Ti1.5V0.2Zr (BCC/B2), AlCo0.4Cr0.6FeNi (BCC/B2) and Al0.5Cr0.9FeNi2.5V0.2 (FCC/L12). We show that the MPEA systems are unstable only to certain concentration modulations which could be further explored to design microstructurally engineered alloys.
△ Less
Submitted 29 December, 2021; v1 submitted 23 December, 2021;
originally announced December 2021.
-
Unsupervised Machine Translation On Dravidian Languages
Authors:
Sai Koneru,
Danni Liu,
Jan Niehues
Abstract:
Unsupervised neural machine translation (UNMT) is beneficial especially for low resource languages such as those from the Dravidian family. However, UNMT systems tend to fail in realistic scenarios involving actual low resource languages. Recent works propose to utilize auxiliary parallel data and have achieved state-of-the-art results. In this work, we focus on unsupervised translation between En…
▽ More
Unsupervised neural machine translation (UNMT) is beneficial especially for low resource languages such as those from the Dravidian family. However, UNMT systems tend to fail in realistic scenarios involving actual low resource languages. Recent works propose to utilize auxiliary parallel data and have achieved state-of-the-art results. In this work, we focus on unsupervised translation between English and Kannada, a low resource Dravidian language. We additionally utilize a limited amount of auxiliary data between English and other related Dravidian languages. We show that unifying the writing systems is essential in unsupervised translation between the Dravidian languages. We explore several model architectures that use the auxiliary data in order to maximize knowledge sharing and enable UNMT for distant language pairs. Our experiments demonstrate that it is crucial to include auxiliary languages that are similar to our focal language, Kannada. Furthermore, we propose a metric to measure language similarity and show that it serves as a good indicator for selecting the auxiliary languages.
△ Less
Submitted 29 March, 2021;
originally announced March 2021.