Skip to main content

Showing 1–7 of 7 results for author: Tur, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.04957  [pdf, other

    cs.LG cs.AI cs.CL

    SafeArena: Evaluating the Safety of Autonomous Web Agents

    Authors: Ada Defne Tur, Nicholas Meade, Xing Han Lù, Alejandra Zambrano, Arkil Patel, Esin Durmus, Spandana Gella, Karolina Stańczak, Siva Reddy

    Abstract: LLM-based agents are becoming increasingly proficient at solving web-based tasks. With this capability comes a greater risk of misuse for malicious purposes, such as posting misinformation in an online forum or selling illicit substances on a website. To evaluate these risks, we propose SafeArena, the first benchmark to focus on the deliberate misuse of web agents. SafeArena comprises 250 safe and… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  2. arXiv:2502.05670  [pdf, other

    cs.CL cs.AI

    Language Models Largely Exhibit Human-like Constituent Ordering Preferences

    Authors: Ada Defne Tur, Gaurav Kamath, Siva Reddy

    Abstract: Though English sentences are typically inflexible vis-à-vis word order, constituents often show far more variability in ordering. One prominent theory presents the notion that constituent ordering is directly correlated with constituent weight: a measure of the constituent's length or complexity. Such theories are interesting in the context of natural language processing (NLP), because while recen… ▽ More

    Submitted 14 February, 2025; v1 submitted 8 February, 2025; originally announced February 2025.

    Comments: NAACL 2025 Main Conference

  3. arXiv:2409.00217  [pdf, other

    cs.CL cs.SD eess.AS

    ProGRes: Prompted Generative Rescoring on ASR n-Best

    Authors: Ada Defne Tur, Adel Moumen, Mirco Ravanelli

    Abstract: Large Language Models (LLMs) have shown their ability to improve the performance of speech recognizers by effectively rescoring the n-best hypotheses generated during the beam search process. However, the best way to exploit recent generative instruction-tuned LLMs for hypothesis rescoring is still unclear. This paper proposes a novel method that uses instruction-tuned LLMs to dynamically expand t… ▽ More

    Submitted 8 September, 2024; v1 submitted 30 August, 2024; originally announced September 2024.

    Comments: IEEE Spoken Language Technology Workshop

  4. arXiv:2303.01586  [pdf, other

    cs.HC cs.AI cs.RO

    Alexa Arena: A User-Centric Interactive Platform for Embodied AI

    Authors: Qiaozi Gao, Govind Thattai, Suhaila Shakiah, Xiaofeng Gao, Shreyas Pansare, Vasu Sharma, Gaurav Sukhatme, Hangjie Shi, Bofei Yang, Desheng Zheng, Lucy Hu, Karthika Arumugam, Shui Hu, Matthew Wen, Dinakar Guthy, Cadence Chung, Rohan Khanna, Osman Ipek, Leslie Ball, Kate Bland, Heather Rocker, Yadunandana Rao, Michael Johnston, Reza Ghanadan, Arindam Mandal , et al. (2 additional authors not shown)

    Abstract: We introduce Alexa Arena, a user-centric simulation platform for Embodied AI (EAI) research. Alexa Arena provides a variety of multi-room layouts and interactable objects, for the creation of human-robot interaction (HRI) missions. With user-friendly graphics and control mechanisms, Alexa Arena supports the development of gamified robotic tasks readily accessible to general human users, thus openi… ▽ More

    Submitted 7 June, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  5. arXiv:2103.14580  [pdf, other

    cs.CL

    Correcting Automated and Manual Speech Transcription Errors using Warped Language Models

    Authors: Mahdi Namazifar, John Malik, Li Erran Li, Gokhan Tur, Dilek Hakkani Tür

    Abstract: Masked language models have revolutionized natural language processing systems in the past few years. A recently introduced generalization of masked language models called warped language models are trained to be more robust to the types of errors that appear in automatic or manual transcriptions of spoken language by exposing the language model to the same types of errors during training. In this… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: Submitted to INTERSPEECH

  6. arXiv:2011.01900  [pdf, other

    cs.CL cs.AI

    Warped Language Models for Noise Robust Language Understanding

    Authors: Mahdi Namazifar, Gokhan Tur, Dilek Hakkani Tür

    Abstract: Masked Language Models (MLM) are self-supervised neural networks trained to fill in the blanks in a given sentence with masked tokens. Despite the tremendous success of MLMs for various text based tasks, they are not robust for spoken language understanding, especially for spontaneous conversational speech recognition noise. In this work we introduce Warped Language Models (WLM) in which input sen… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: To appear at IEEE SLT 2021

  7. arXiv:2002.12616  [pdf, ps, other

    cs.CY

    Mobile Phone Usage Data for Credit Scoring

    Authors: Henri Ots, Innar Liiv, Diana Tur

    Abstract: The aim of this study is to demostrate that mobile phone usage data can be used to make predictions and find the best classification method for credit scoring even if the dataset is small (2,503 customers). We use different classification algorithms to split customers into paying and non-paying ones using mobile data, and then compare the predicted results with actual results. There are several re… ▽ More

    Submitted 28 February, 2020; originally announced February 2020.

    Comments: 14 pages, submitted to DB&IS 2020