Skip to main content

Showing 1–9 of 9 results for author: Akyürek, A F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.13257  [pdf, ps, other

    cs.CL cs.AI cs.LG

    WikiPersonas: What Can We Learn From Personalized Alignment to Famous People?

    Authors: Zilu Tang, Afra Feyza Akyürek, Ekin Akyürek, Derry Wijaya

    Abstract: Preference alignment has become a standard pipeline in finetuning models to follow \emph{generic} human preferences. Majority of work seeks to optimize model to produce responses that would be preferable \emph{on average}, simplifying the diverse and often \emph{contradicting} space of human preferences. While research has increasingly focused on personalized alignment: adapting models to individu… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 9 pages, preprint

  2. arXiv:2401.08574  [pdf, other

    cs.CL

    Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability

    Authors: Afra Feyza Akyürek, Ekin Akyürek, Leshem Choshen, Derry Wijaya, Jacob Andreas

    Abstract: While language models (LMs) can sometimes generate factually correct text and estimate truth values of individual claims, these generally do not reflect a globally coherent, manipulable model of the world. As a consequence, current LMs also generate incorrect or nonsensical content, and are difficult to edit and bring up to date. We present a method called Deductive Closure Training (DCT) that use… ▽ More

    Submitted 26 June, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: ACL Findings

  3. arXiv:2311.16087  [pdf, other

    cs.CL

    DUnE: Dataset for Unified Editing

    Authors: Afra Feyza Akyürek, Eric Pan, Garry Kuwanto, Derry Wijaya

    Abstract: Even the most advanced language models remain susceptible to errors necessitating to modify these models without initiating a comprehensive retraining process. Model editing refers to the modification of a model's knowledge or representations in a manner that produces the desired outcomes. Prior research primarily centered around editing factual data e.g. "Messi plays for Inter Miami" confining th… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP 2023

  4. arXiv:2305.08844  [pdf, other

    cs.CL

    RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs

    Authors: Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon

    Abstract: Despite their unprecedented success, even the largest language models make mistakes. Similar to how humans learn and improve using feedback, previous work proposed providing language models with natural language feedback to guide them in repairing their outputs. Because human-generated critiques are expensive to obtain, researchers have devised learned critique generators in lieu of human critics… ▽ More

    Submitted 11 July, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  5. arXiv:2205.11605  [pdf, other

    cs.CL cs.CY

    On Measuring Social Biases in Prompt-Based Multi-Task Learning

    Authors: Afra Feyza Akyürek, Sejin Paik, Muhammed Yusuf Kocyigit, Seda Akbiyik, Şerife Leman Runyun, Derry Wijaya

    Abstract: Large language models trained on a mixture of NLP tasks that are converted into a text-to-text format using prompts, can generalize into novel forms of language and handle novel tasks. A large body of work within prompt engineering attempts to understand the effects of input forms and prompts in achieving superior performance. We consider an alternative measure and inquire whether the way in which… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: Findings of NAACL 2022

  6. arXiv:2205.11601  [pdf, other

    cs.CL cs.CY

    Challenges in Measuring Bias via Open-Ended Language Generation

    Authors: Afra Feyza Akyürek, Muhammed Yusuf Kocyigit, Sejin Paik, Derry Wijaya

    Abstract: Researchers have devised numerous ways to quantify social biases vested in pretrained language models. As some language models are capable of generating coherent completions given a set of textual prompts, several prompting datasets have been proposed to measure biases between social groups -- posing language generation as a way of identifying biases. In this opinion paper, we analyze how specific… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: 4th Workshop on Gender Bias in Natural Language Processing. NAACL, 2022

  7. arXiv:2110.07059  [pdf, other

    cs.CV cs.LG

    Subspace Regularizers for Few-Shot Class Incremental Learning

    Authors: Afra Feyza Akyürek, Ekin Akyürek, Derry Tanti Wijaya, Jacob Andreas

    Abstract: Few-shot class incremental learning -- the problem of updating a trained classifier to discriminate among an expanded set of classes with limited labeled data -- is a key challenge for machine learning systems deployed in non-stationary environments. Existing approaches to the problem rely on complex model architectures and training procedures that are difficult to tune and re-use. In this paper,… ▽ More

    Submitted 20 February, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: ICLR 2022. Code is available through https://github.com/feyzaakyurek/subspace-reg

  8. arXiv:2103.13272  [pdf, other

    cs.CL

    Low-Resource Machine Translation Training Curriculum Fit for Low-Resource Languages

    Authors: Garry Kuwanto, Afra Feyza Akyürek, Isidora Chara Tourni, Siyang Li, Alexander Gregory Jones, Derry Wijaya

    Abstract: We conduct an empirical study of neural machine translation (NMT) for truly low-resource languages, and propose a training curriculum fit for cases when both parallel training data and compute resource are lacking, reflecting the reality of most of the world's languages and the researchers working on these languages. Previously, unsupervised NMT, which employs back-translation (BT) and auto-encodi… ▽ More

    Submitted 29 November, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

  9. arXiv:2010.03706  [pdf, other

    cs.CL cs.LG

    Learning to Recombine and Resample Data for Compositional Generalization

    Authors: Ekin Akyürek, Afra Feyza Akyürek, Jacob Andreas

    Abstract: Flexible neural sequence models outperform grammar- and automaton-based counterparts on a variety of tasks. However, neural models perform poorly in settings requiring compositional generalization beyond the training data -- particularly to rare or unseen subsequences. Past work has found symbolic scaffolding (e.g. grammars or automata) essential in these settings. We describe R&R, a learned data… ▽ More

    Submitted 7 June, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: ICLR2021