Skip to main content

Showing 1–5 of 5 results for author: Kara, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.23189  [pdf, ps, other

    cs.CV

    Trident: Detecting Face Forgeries with Adversarial Triplet Learning

    Authors: Mustafa Hakan Kara, Aysegul Dundar, Uğur Güdükbay

    Abstract: As face forgeries generated by deep neural networks become increasingly sophisticated, detecting face manipulations in digital media has posed a significant challenge, underscoring the importance of maintaining digital media integrity and combating visual disinformation. Current detection models, predominantly based on supervised training with domain-specific data, often falter against forgeries g… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

    Comments: 11 pages, 3 figures, and 7 tables

  2. arXiv:2506.15329  [pdf, ps, other

    cs.LG cs.AI cs.CL math.OC

    When and How Unlabeled Data Provably Improve In-Context Learning

    Authors: Yingcong Li, Xiangyu Chang, Muti Kara, Xiaofeng Liu, Amit Roy-Chowdhury, Samet Oymak

    Abstract: Recent research shows that in-context learning (ICL) can be effective even when demonstrations have missing or incorrect labels. To shed light on this capability, we examine a canonical setting where the demonstrations are drawn according to a binary Gaussian mixture model (GMM) and a certain fraction of the demonstrations have missing labels. We provide a comprehensive theoretical study to show t… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  3. arXiv:2504.15929  [pdf, ps, other

    cs.CV cs.AI

    Meta-Entity Driven Triplet Mining for Aligning Medical Vision-Language Models

    Authors: Saban Ozturk, Melih B. Yilmaz, Muti Kara, M. Talat Yavuz, Aykut Koç, Tolga Çukur

    Abstract: Diagnostic imaging relies on interpreting both images and radiology reports, but the growing data volumes place significant pressure on medical experts, yielding increased errors and workflow backlogs. Medical vision-language models (med-VLMs) have emerged as a powerful framework to efficiently process multimodal imaging data, particularly in chest X-ray (CXR) evaluations, albeit their performance… ▽ More

    Submitted 23 April, 2025; v1 submitted 22 April, 2025; originally announced April 2025.

    Comments: 18 pages, 7 figures, 6 tables

  4. arXiv:2503.02102  [pdf, other

    cs.CL cs.AI

    Provable Benefits of Task-Specific Prompts for In-context Learning

    Authors: Xiangyu Chang, Yingcong Li, Muti Kara, Samet Oymak, Amit K. Roy-Chowdhury

    Abstract: The in-context learning capabilities of modern language models have motivated a deeper mathematical understanding of sequence models. A line of recent work has shown that linear attention models can emulate projected gradient descent iterations to implicitly learn the task vector from the data provided in the context window. In this work, we consider a novel setting where the global task distribut… ▽ More

    Submitted 5 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

    Comments: Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS) 2025

  5. arXiv:2304.12326  [pdf, other

    cs.DL physics.soc-ph

    Proposal for a distributed, community-driven academic publishing system

    Authors: Matteo Barbone, Mustafa Gündoğan, Dhiren M. Kara, Benjamin Pingault, Alejandro Rodriguez-Pardo Montblanch, Lucio Stefan, Anthony K. C. Tan

    Abstract: We propose an academic publishing system where research papers are stored in a network of data centres owned by university libraries and research institutions, and are interfaced with the academic community through a website. In our system, the editor is replaced by an initial adjusted community-wide evaluation, the standard peer-review is accompanied by a post-publication open-ended and community… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.