Skip to main content

Showing 1–15 of 15 results for author: Yuksel, K A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.12755  [pdf, other

    cs.CL cs.AI cs.HC

    Efficient Machine Translation Corpus Generation: Integrating Human-in-the-Loop Post-Editing with Large Language Models

    Authors: Kamer Ali Yuksel, Ahmet Gunduz, Abdul Baseet Anees, Hassan Sawaf

    Abstract: This paper introduces an advanced methodology for machine translation (MT) corpus generation, integrating semi-automated, human-in-the-loop post-editing with large language models (LLMs) to enhance efficiency and translation quality. Building upon previous work that utilized real-time training of a custom MT quality estimation metric, this system incorporates novel LLM features such as Enhanced Tr… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  2. arXiv:2502.12745  [pdf, other

    cs.CL cs.AI cs.LG

    MediaMind: Revolutionizing Media Monitoring using Agentification

    Authors: Ahmet Gunduz, Kamer Ali Yuksel, Hassan Sawaf

    Abstract: In an era of rapid technological advancements, agentification of software tools has emerged as a critical innovation, enabling systems to function autonomously and adaptively. This paper introduces MediaMind as a case study to demonstrate the agentification process, highlighting how existing software can be transformed into intelligent agents capable of independent decision-making and dynamic inte… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  3. arXiv:2502.04315  [pdf, other

    cs.CL cs.AI cs.LG

    ChameleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters

    Authors: Kamer Ali Yuksel, Hassan Sawaf

    Abstract: Recent advances in large language models (LLMs) have shown remarkable performance across diverse tasks. However, these models are typically deployed with fixed weights, which limits their ability to adapt dynamically to the variability inherent in real-world data during inference. This paper introduces ChameleonLLM, a novel framework that enables inference-time adaptation of LLMs by leveraging bat… ▽ More

    Submitted 11 February, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

  4. arXiv:2502.00029  [pdf, other

    q-fin.PM cs.AI cs.CL cs.NE q-fin.RM

    AlphaSharpe: LLM-Driven Discovery of Robust Risk-Adjusted Metrics

    Authors: Kamer Ali Yuksel, Hassan Sawaf

    Abstract: Financial metrics like the Sharpe ratio are pivotal in evaluating investment performance by balancing risk and return. However, traditional metrics often struggle with robustness and generalization, particularly in dynamic and volatile market conditions. This paper introduces AlphaSharpe, a novel framework leveraging large language models (LLMs) to iteratively evolve and optimize financial metrics… ▽ More

    Submitted 4 February, 2025; v1 submitted 23 January, 2025; originally announced February 2025.

  5. arXiv:2412.17149  [pdf, other

    cs.CL cs.AI cs.ET cs.MA cs.NE

    A Multi-AI Agent System for Autonomous Optimization of Agentic AI Solutions via Iterative Refinement and LLM-Driven Feedback Loops

    Authors: Kamer Ali Yuksel, Hassan Sawaf

    Abstract: Agentic AI systems use specialized agents to handle tasks within complex workflows, enabling automation and efficiency. However, optimizing these systems often requires labor-intensive, manual adjustments to refine roles, tasks, and interactions. This paper introduces a framework for autonomously optimizing Agentic AI solutions across industries, such as NLP-driven enterprise applications. The sys… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

  6. arXiv:2409.12476  [pdf, other

    cs.CL cs.SD eess.AS

    AutoMode-ASR: Learning to Select ASR Systems for Better Quality and Cost

    Authors: Ahmet Gündüz, Yunsu Kim, Kamer Ali Yuksel, Mohamed Al-Badrashiny, Thiago Castro Ferreira, Hassan Sawaf

    Abstract: We present AutoMode-ASR, a novel framework that effectively integrates multiple ASR systems to enhance the overall transcription quality while optimizing cost. The idea is to train a decision model to select the optimal ASR system for each segment based solely on the audio input before running the systems. We achieve this by ensembling binary classifiers determining the preference between two syst… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: SPECOM 2024 Conference

  7. arXiv:2402.16380  [pdf, other

    eess.AS cs.AI cs.CL cs.LG

    An Automated End-to-End Open-Source Software for High-Quality Text-to-Speech Dataset Generation

    Authors: Ahmet Gunduz, Kamer Ali Yuksel, Kareem Darwish, Golara Javadi, Fabio Minazzi, Nicola Sobieski, Sebastien Bratieres

    Abstract: Data availability is crucial for advancing artificial intelligence applications, including voice-based technologies. As content creation, particularly in social media, experiences increasing demand, translation and text-to-speech (TTS) technologies have become essential tools. Notably, the performance of these TTS technologies is highly dependent on the quality of the training data, emphasizing th… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 9 Pages, 6 Figures, 4 Tables, LREC-COLING 2024

  8. arXiv:2401.11268  [pdf, other

    cs.CL cs.SD eess.AS

    Word-Level ASR Quality Estimation for Efficient Corpus Sampling and Post-Editing through Analyzing Attentions of a Reference-Free Metric

    Authors: Golara Javadi, Kamer Ali Yuksel, Yunsu Kim, Thiago Castro Ferreira, Mohamed Al-Badrashiny

    Abstract: In the realm of automatic speech recognition (ASR), the quest for models that not only perform with high accuracy but also offer transparency in their decision-making processes is crucial. The potential of quality estimation (QE) metrics is introduced and evaluated as a novel tool to enhance explainable artificial intelligence (XAI) in ASR systems. Through experiments and analyses, the capabilitie… ▽ More

    Submitted 2 February, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

    Journal ref: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024), Seoul, Korea

  9. arXiv:2307.07811  [pdf, other

    cs.NE q-fin.PM

    Generative Meta-Learning Robust Quality-Diversity Portfolio

    Authors: Kamer Ali Yuksel

    Abstract: This paper proposes a novel meta-learning approach to optimize a robust portfolio ensemble. The method uses a deep generative model to generate diverse and high-quality sub-portfolios combined to form the ensemble portfolio. The generative model consists of a convolutional layer, a stateful LSTM module, and a dense network. During training, the model takes a randomly sampled batch of Gaussian nois… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

  10. arXiv:2306.13114  [pdf, other

    cs.CL eess.AS

    A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-Supervision

    Authors: Kamer Ali Yuksel, Thiago Ferreira, Ahmet Gunduz, Mohamed Al-Badrashiny, Golara Javadi

    Abstract: The common standard for quality evaluation of automatic speech recognition (ASR) systems is reference-based metrics such as the Word Error Rate (WER), computed using manual ground-truth transcriptions that are time-consuming and expensive to obtain. This work proposes a multi-language referenceless quality metric, which allows comparing the performance of different ASR models on a speech dataset w… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2306.12577

  11. arXiv:2306.12577  [pdf, other

    cs.CL cs.SD eess.AS

    NoRefER: a Referenceless Quality Metric for Automatic Speech Recognition via Semi-Supervised Language Model Fine-Tuning with Contrastive Learning

    Authors: Kamer Ali Yuksel, Thiago Ferreira, Golara Javadi, Mohamed El-Badrashiny, Ahmet Gunduz

    Abstract: This paper introduces NoRefER, a novel referenceless quality metric for automatic speech recognition (ASR) systems. Traditional reference-based metrics for evaluating ASR systems require costly ground-truth transcripts. NoRefER overcomes this limitation by fine-tuning a multilingual language model for pair-wise ranking ASR hypotheses using contrastive learning with Siamese network architecture. Th… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  12. arXiv:2306.11838  [pdf, other

    cs.CL

    Efficient Machine Translation Corpus Generation

    Authors: Kamer Ali Yuksel, Ahmet Gunduz, Shreyas Sharma, Hassan Sawaf

    Abstract: This paper proposes an efficient and semi-automated method for human-in-the-loop post-editing for machine translation (MT) corpus generation. The method is based on online training of a custom MT quality estimation metric on-the-fly as linguists perform post-edits. The online estimator is used to prioritize worse hypotheses for post-editing, and auto-close best hypotheses without post-editing. Thi… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  13. arXiv:2306.11823  [pdf, other

    cs.CL

    EvolveMT: an Ensemble MT Engine Improving Itself with Usage Only

    Authors: Kamer Ali Yuksel, Ahmet Gunduz, Mohamed Al-Badrashiny, Shreyas Sharma, Hassan Sawaf

    Abstract: This paper presents EvolveMT for efficiently combining multiple machine translation (MT) engines. The proposed system selects the output from a single engine for each segment by utilizing online learning techniques to predict the most suitable system for every translation request. A neural quality estimation metric supervises the method without requiring reference translations. The online learning… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  14. arXiv:1911.06913  [pdf, other

    stat.AP cs.LG eess.IV

    Granular Motor State Monitoring of Free Living Parkinson's Disease Patients via Deep Learning

    Authors: Kamer A. Yuksel, Jann Goschenhofer, Hridya V. Varma, Urban Fietzek, Franz M. J. Pfister

    Abstract: Parkinson's disease (PD) is the second most common neurodegenerative disease worldwide and affects around 1% of the (60+ years old) elderly population in industrial nations. More than 80% of PD patients suffer from motor symptoms, which could be well addressed if a personalized medication schedule and dosage could be administered to them. However, such personalized medication schedule requires a c… ▽ More

    Submitted 11 December, 2019; v1 submitted 15 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 -- Extended Abstract

  15. arXiv:1904.10829  [pdf, other

    cs.LG stat.AP stat.ML

    Wearable-based Parkinson's Disease Severity Monitoring using Deep Learning

    Authors: Jann Goschenhofer, Franz MJ Pfister, Kamer Ali Yuksel, Bernd Bischl, Urban Fietzek, Janek Thomas

    Abstract: One major challenge in the medication of Parkinson's disease is that the severity of the disease, reflected in the patients' motor state, cannot be measured using accessible biomarkers. Therefore, we develop and examine a variety of statistical models to detect the motor state of such patients based on sensor data from a wearable device. We find that deep learning models consistently outperform a… ▽ More

    Submitted 24 April, 2019; originally announced April 2019.