Skip to main content

Showing 1–19 of 19 results for author: Koneru, S

.
  1. arXiv:2505.13036  [pdf, ps, other

    cs.CL cs.AI

    KIT's Offline Speech Translation and Instruction Following Submission for IWSLT 2025

    Authors: Sai Koneru, Maike Züfle, Thai-Binh Nguyen, Seymanur Akti, Jan Niehues, Alexander Waibel

    Abstract: The scope of the International Workshop on Spoken Language Translation (IWSLT) has recently broadened beyond traditional Speech Translation (ST) to encompass a wider array of tasks, including Speech Question Answering and Summarization. This shift is partly driven by the growing capabilities of modern systems, particularly with the success of Large Language Models (LLMs). In this paper, we present… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  2. arXiv:2502.08561  [pdf, ps, other

    cs.CL

    Quality-Aware Decoding: Unifying Quality Estimation and Decoding

    Authors: Sai Koneru, Matthias Huck, Miriam Exel, Jan Niehues

    Abstract: Quality Estimation (QE) models for Neural Machine Translation (NMT) predict the quality of the hypothesis without having access to the reference. An emerging research direction in NMT involves the use of QE models, which have demonstrated high correlations with human judgment and can enhance translations through Quality-Aware Decoding. Although several approaches have been proposed based on sampli… ▽ More

    Submitted 1 June, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

    Comments: IWSLT 2025

  3. arXiv:2410.19250  [pdf, other

    cs.CL

    Have LLMs Reopened the Pandora's Box of AI-Generated Fake News?

    Authors: Xinyu Wang, Wenbo Zhang, Sai Koneru, Hangzhi Guo, Bonam Mingole, S. Shyam Sundar, Sarah Rajtmajer, Amulya Yadav

    Abstract: With the rise of AI-generated content spewed at scale from large language models (LLMs), genuine concerns about the spread of fake news have intensified. The perceived ability of LLMs to produce convincing fake news at scale poses new challenges for both human and automated fake news detection systems. To address this gap, this paper presents the findings from a university-level competition that a… ▽ More

    Submitted 29 March, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

  4. arXiv:2408.11327  [pdf, other

    cs.CL cs.AI

    Plug, Play, and Fuse: Zero-Shot Joint Decoding via Word-Level Re-ranking Across Diverse Vocabularies

    Authors: Sai Koneru, Matthias Huck, Miriam Exel, Jan Niehues

    Abstract: Recent advancements in NLP have resulted in models with specialized strengths, such as processing multimodal inputs or excelling in specific domains. However, real-world tasks, like multimodal translation, often require a combination of these strengths, such as handling both translation and image processing. While individual translation and vision models are powerful, they typically lack the abili… ▽ More

    Submitted 4 November, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: WMT 2024

  5. arXiv:2407.12826  [pdf, other

    cs.CL cs.AI

    Assessing the Effectiveness of GPT-4o in Climate Change Evidence Synthesis and Systematic Assessments: Preliminary Insights

    Authors: Elphin Tom Joe, Sai Dileep Koneru, Christine J Kirchhoff

    Abstract: In this research short, we examine the potential of using GPT-4o, a state-of-the-art large language model (LLM) to undertake evidence synthesis and systematic assessment tasks. Traditional workflows for such tasks involve large groups of domain experts who manually review and synthesize vast amounts of literature. The exponential growth of scientific literature and recent advances in LLMs provide… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  6. arXiv:2406.16777  [pdf, other

    cs.CL cs.AI

    Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024

    Authors: Sai Koneru, Thai-Binh Nguyen, Ngoc-Quan Pham, Danni Liu, Zhaolin Li, Alexander Waibel, Jan Niehues

    Abstract: Large Language Models (LLMs) are currently under exploration for various tasks, including Automatic Speech Recognition (ASR), Machine Translation (MT), and even End-to-End Speech Translation (ST). In this paper, we present KIT's offline submission in the constrained + LLM track by incorporating recently proposed techniques that can be added to any cascaded speech translation. Specifically, we inte… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  7. arXiv:2406.04005  [pdf, other

    cs.SI

    The Failed Migration of Academic Twitter

    Authors: Xinyu Wang, Sai Koneru, Sarah Rajtmajer

    Abstract: Following changes in Twitter's ownership and subsequent changes to content moderation policies, many in academia looked to move their discourse elsewhere and migration to Mastodon was pursued by some. Our study looks at the dynamics of this migration. Utilizing publicly available user account data, we track the posting activity of academics on Mastodon over a one year period. We also gathered foll… ▽ More

    Submitted 23 October, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  8. arXiv:2405.11030  [pdf, other

    cs.CL

    The Unappreciated Role of Intent in Algorithmic Moderation of Social Media Content

    Authors: Xinyu Wang, Sai Koneru, Pranav Narayanan Venkit, Brett Frischmann, Sarah Rajtmajer

    Abstract: As social media has become a predominant mode of communication globally, the rise of abusive content threatens to undermine civil discourse. Recognizing the critical nature of this issue, a significant body of research has been dedicated to developing language models that can detect various types of online abuse, e.g., hate speech, cyberbullying. However, there exists a notable disconnect between… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  9. arXiv:2402.08796  [pdf, other

    cs.HC

    Reproducibility, Replicability, and Transparency in Research: What 430 Professors Think in Universities across the USA and India

    Authors: Tatiana Chakravorti, Sai Dileep Koneru, Sarah Rajtmajer

    Abstract: In the past decade, open science and science of science communities have initiated innovative efforts to address concerns about the reproducibility and replicability of published scientific research. In some respects, these efforts have been successful, yet there are still many pockets of researchers with little to no familiarity with these concerns, subsequent responses, or best practices for eng… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  10. arXiv:2310.19158  [pdf, other

    cs.HC

    Perspectives from India: Opportunities and Challenges for AI Replication Prediction to Improve Confidence in Published Research

    Authors: Tatiana Chakravorti, Chuhao Wu, Sai Koneru, Sarah Rajtmajer

    Abstract: Over the past decade, a crisis of confidence in scientific literature has gained attention, particularly in the West. In response, we have seen changes in policy and practice amongst individual researchers and institutions. Greater attention is given to the transparency of workflows and the appropriate use of statistical methods. Advances in scholarly big data and machine learning have led to the… ▽ More

    Submitted 15 September, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

  11. arXiv:2310.14855  [pdf, other

    cs.CL cs.AI

    Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing

    Authors: Sai Koneru, Miriam Exel, Matthias Huck, Jan Niehues

    Abstract: Large Language Models (LLM's) have demonstrated considerable success in various Natural Language Processing tasks, but they have yet to attain state-of-the-art performance in Neural Machine Translation (NMT). Nevertheless, their significant performance in tasks demanding a broad understanding and contextual processing shows their potential for translation. To exploit these abilities, we investigat… ▽ More

    Submitted 18 March, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: NAACL 2024

  12. arXiv:2309.06578  [pdf, other

    cs.CL cs.AI

    Can Large Language Models Discern Evidence for Scientific Hypotheses? Case Studies in the Social Sciences

    Authors: Sai Koneru, Jian Wu, Sarah Rajtmajer

    Abstract: Hypothesis formulation and testing are central to empirical research. A strong hypothesis is a best guess based on existing evidence and informed by a comprehensive view of relevant literature. However, with exponential increase in the number of scientific articles published annually, manual aggregation and synthesis of evidence related to a given hypothesis is a challenge. Our work explores the a… ▽ More

    Submitted 25 March, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

  13. arXiv:2308.03415  [pdf, other

    cs.CL cs.AI

    End-to-End Evaluation for Low-Latency Simultaneous Speech Translation

    Authors: Christian Huber, Tu Anh Dinh, Carlos Mullov, Ngoc Quan Pham, Thai Binh Nguyen, Fabian Retkowski, Stefan Constantin, Enes Yavuz Ugan, Danni Liu, Zhaolin Li, Sai Koneru, Jan Niehues, Alexander Waibel

    Abstract: The challenge of low-latency speech translation has recently draw significant interest in the research community as shown by several publications and shared tasks. Therefore, it is essential to evaluate these different approaches in realistic scenarios. However, currently only specific aspects of the systems are evaluated and often it is not possible to compare different approaches. In this work… ▽ More

    Submitted 17 July, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: Demo paper at EMNLP 2023

  14. arXiv:2306.05320  [pdf, other

    cs.CL cs.SD

    KIT's Multilingual Speech Translation System for IWSLT 2023

    Authors: Danni Liu, Thai Binh Nguyen, Sai Koneru, Enes Yavuz Ugan, Ngoc-Quan Pham, Tuan-Nam Nguyen, Tu Anh Dinh, Carlos Mullov, Alexander Waibel, Jan Niehues

    Abstract: Many existing speech translation benchmarks focus on native-English speech in high-quality recording conditions, which often do not match the conditions in real-life use-cases. In this paper, we describe our speech translation system for the multilingual track of IWSLT 2023, which evaluates translation quality on scientific conference talks. The test condition features accented input speech and te… ▽ More

    Submitted 12 July, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: IWSLT 2023

  15. arXiv:2303.15370  [pdf

    cond-mat.mtrl-sci

    Microstructural engineering by heat treatments of multi-principal element alloys via spinodal mediated phase transformation pathways

    Authors: Shalini Roy Koneru, Kamalnath Kadirvel, Hamish Fraser, Yunzhi Wang

    Abstract: Nanoscale multi-phase microstructures observed in multi-principal element alloys (MPEAs) such as $\rm AlMo_{0.5}NbTa_{0.5}TiZr$, $\rm Al_{0.5}NbTa_{0.8}Ti_{1.5}V_{0.2}Zr$, $\rm TiZrNbTa$, $\rm AlCoCrFeNi$ and $\rm Fe_{15}Co_{15}Ni_{20}Mn_{20}Cu_{30}$ that exhibit promising mechanical or functional properties may have evolved through spinodal-mediated phase transformation pathways (PTPs). The micro… ▽ More

    Submitted 10 April, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: Preprint submitted to Acta Materialia, 31 pages, 11 figures

  16. arXiv:2202.12913  [pdf, other

    cs.DL

    The evolution of scientific literature as metastable knowledge states

    Authors: Sai Dileep Koneru, David Rench McCauley, Michael C. Smith, David Guarrera, Jenn Robinson, Sarah Rajtmajer

    Abstract: The problem of identifying common concepts in the sciences and deciding when new ideas have emerged is an open one. Metascience researchers have sought to formalize principles underlying stages in the life-cycle of scientific research, determine how knowledge is transferred between scientists and stakeholders, and understand how new ideas are generated and take hold. Here, we model the state of sc… ▽ More

    Submitted 11 September, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

  17. arXiv:2201.05700  [pdf, other

    cs.CL cs.AI

    Cost-Effective Training in Low-Resource Neural Machine Translation

    Authors: Sai Koneru, Danni Liu, Jan Niehues

    Abstract: While Active Learning (AL) techniques are explored in Neural Machine Translation (NMT), only a few works focus on tackling low annotation budgets where a limited number of sentences can get translated. Such situations are especially challenging and can occur for endangered languages with few human annotators or having cost constraints to label large amounts of data. Although AL is shown to be help… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  18. arXiv:2112.12882  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Exploration of Spinodal Decomposition in Multi-Principal Element Alloys (MPEAs) using CALPHAD Modeling

    Authors: Kamalnath Kadirvel, Shalini Roy Koneru, Yunzhi Wang

    Abstract: Researchers attributed the orderly arranged nanoscale phases observed in many multi-principal element alloys (MPEAs) to spinodal/spinodal-mediated phase transformation pathways. However, spinodal decomposition is not well understood in multicomponent systems. Although the theoretical background is available, CALPHAD databases were not used to explore the miscibility gap in MPEAs. In this work, we… ▽ More

    Submitted 29 December, 2021; v1 submitted 23 December, 2021; originally announced December 2021.

    Comments: Submitted to Scripta Materialia, 5 figures, 2 tables

    Journal ref: Scripta Materialia 2022, 214, 114657

  19. arXiv:2103.15877  [pdf, other

    cs.CL cs.AI

    Unsupervised Machine Translation On Dravidian Languages

    Authors: Sai Koneru, Danni Liu, Jan Niehues

    Abstract: Unsupervised neural machine translation (UNMT) is beneficial especially for low resource languages such as those from the Dravidian family. However, UNMT systems tend to fail in realistic scenarios involving actual low resource languages. Recent works propose to utilize auxiliary parallel data and have achieved state-of-the-art results. In this work, we focus on unsupervised translation between En… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.