Skip to main content

Showing 1–11 of 11 results for author: Seyfioglu, M S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.08916  [pdf, other

    cs.CV cs.AI cs.CL cs.MA

    PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology

    Authors: Fatemeh Ghezloo, Mehmet Saygin Seyfioglu, Rustin Soraki, Wisdom O. Ikezogwo, Beibin Li, Tejoram Vivekanandan, Joann G. Elmore, Ranjay Krishna, Linda Shapiro

    Abstract: Diagnosing diseases through histopathology whole slide images (WSIs) is fundamental in modern pathology but is challenged by the gigapixel scale and complexity of WSIs. Trained histopathologists overcome this challenge by navigating the WSI, looking for relevant patches, taking notes, and compiling them to produce a final holistic diagnostic. Traditional AI approaches, such as multiple instance le… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

  2. arXiv:2501.04184  [pdf, other

    cs.CV

    MedicalNarratives: Connecting Medical Vision and Language with Localized Narratives

    Authors: Wisdom O. Ikezogwo, Kevin Zhang, Mehmet Saygin Seyfioglu, Fatemeh Ghezloo, Linda Shapiro, Ranjay Krishna

    Abstract: We propose MedicalNarratives, a dataset curated from medical pedagogical videos similar in nature to data collected in Think-Aloud studies and inspired by Localized Narratives, which collects grounded image-text data by curating instructors' speech and mouse cursor movements synchronized in time. MedicalNarratives enables pretraining of both semantic and dense objectives, alleviating the need to t… ▽ More

    Submitted 12 January, 2025; v1 submitted 7 January, 2025; originally announced January 2025.

  3. arXiv:2401.13795  [pdf, other

    cs.CV

    Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All

    Authors: Mehmet Saygin Seyfioglu, Karim Bouyarmane, Suren Kumar, Amir Tavanaei, Ismail B. Tutar

    Abstract: As online shopping is growing, the ability for buyers to virtually visualize products in their settings-a phenomenon we define as "Virtual Try-All"-has become crucial. Recent diffusion models inherently contain a world model, rendering them suitable for this task within an inpainting context. However, traditional image-conditioned diffusion models often fail to capture the fine-grained details of… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  4. arXiv:2312.04746  [pdf, other

    cs.CV cs.AI cs.CL

    Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Open-Source Histopathology Videos

    Authors: Mehmet Saygin Seyfioglu, Wisdom O. Ikezogwo, Fatemeh Ghezloo, Ranjay Krishna, Linda Shapiro

    Abstract: Diagnosis in histopathology requires a global whole slide images (WSIs) analysis, requiring pathologists to compound evidence from different WSI patches. The gigapixel scale of WSIs poses a challenge for histopathology multi-modal models. Training multi-model models for histopathology requires instruction tuning datasets, which currently contain information for individual image patches, without a… ▽ More

    Submitted 13 January, 2025; v1 submitted 7 December, 2023; originally announced December 2023.

  5. arXiv:2306.11207  [pdf, other

    cs.CV cs.CL cs.LG

    Quilt-1M: One Million Image-Text Pairs for Histopathology

    Authors: Wisdom Oluchi Ikezogwo, Mehmet Saygin Seyfioglu, Fatemeh Ghezloo, Dylan Stefan Chan Geva, Fatwir Sheikh Mohammed, Pavan Kumar Anand, Ranjay Krishna, Linda Shapiro

    Abstract: Recent accelerations in multi-modal applications have been made possible with the plethora of image and text data available online. However, the scarcity of analogous data in the medical field, specifically in histopathology, has slowed comparable progress. To enable similar representation learning for histopathology, we turn to YouTube, an untapped resource of videos, offering $1,087$ hours of va… ▽ More

    Submitted 13 January, 2025; v1 submitted 19 June, 2023; originally announced June 2023.

  6. arXiv:2305.01257  [pdf, other

    cs.CV cs.AI

    DreamPaint: Few-Shot Inpainting of E-Commerce Items for Virtual Try-On without 3D Modeling

    Authors: Mehmet Saygin Seyfioglu, Karim Bouyarmane, Suren Kumar, Amir Tavanaei, Ismail B. Tutar

    Abstract: We introduce DreamPaint, a framework to intelligently inpaint any e-commerce product on any user-provided context image. The context image can be, for example, the user's own image for virtual try-on of clothes from the e-commerce catalog on themselves, the user's room image for virtual try-on of a piece of furniture from the e-commerce catalog in their room, etc. As opposed to previous augmented-… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  7. arXiv:2209.01534  [pdf, other

    cs.CV cs.LG

    Multi-modal Masked Autoencoders Learn Compositional Histopathological Representations

    Authors: Wisdom Oluchi Ikezogwo, Mehmet Saygin Seyfioglu, Linda Shapiro

    Abstract: Self-supervised learning (SSL) enables learning useful inductive biases through utilizing pretext tasks that require no labels. The unlabeled nature of SSL makes it especially important for whole slide histopathological images (WSIs), where patch-level human annotation is difficult. Masked Autoencoders (MAE) is a recent SSL method suitable for digital pathology as it does not require negative samp… ▽ More

    Submitted 14 November, 2022; v1 submitted 4 September, 2022; originally announced September 2022.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 10 pages

  8. arXiv:2207.04574  [pdf, other

    cs.CV cs.LG

    Brain-Aware Replacements for Supervised Contrastive Learning in Detection of Alzheimer's Disease

    Authors: Mehmet Saygın Seyfioğlu, Zixuan Liu, Pranav Kamath, Sadjyot Gangolli, Sheng Wang, Thomas Grabowski, Linda Shapiro

    Abstract: We propose a novel framework for Alzheimer's disease (AD) detection using brain MRIs. The framework starts with a data augmentation method called Brain-Aware Replacements (BAR), which leverages a standard brain parcellation to replace medically-relevant 3D brain regions in an anchor MRI from a randomly picked MRI to create synthetic samples. Ground truth "hard" labels are also linearly mixed depen… ▽ More

    Submitted 20 July, 2022; v1 submitted 10 July, 2022; originally announced July 2022.

    Journal ref: MICCAI 2022

  9. arXiv:2109.12178  [pdf, other

    cs.CV cs.AI cs.LG

    MLIM: Vision-and-Language Model Pre-training with Masked Language and Image Modeling

    Authors: Tarik Arici, Mehmet Saygin Seyfioglu, Tal Neiman, Yi Xu, Son Train, Trishul Chilimbi, Belinda Zeng, Ismail Tutar

    Abstract: Vision-and-Language Pre-training (VLP) improves model performance for downstream tasks that require image and text inputs. Current VLP approaches differ on (i) model architecture (especially image embedders), (ii) loss functions, and (iii) masking policies. Image embedders are either deep models like ResNet or linear projections that directly feed image-pixels into the transformer. Typically, in a… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

  10. arXiv:2107.13869  [pdf, other

    cs.NI eess.SY

    Autonomous UAV Base Stations for Next Generation Wireless Networks: A Deep Learning Approach

    Authors: Ali Murat Demirtas, Mehmet Saygin Seyfioglu, Irem Bor-Yaliniz, Bulent Tavli, Halim Yanikomeroglu

    Abstract: To address the ever-growing connectivity demands of wireless communications, the adoption of ingenious solutions, such as Unmanned Aerial Vehicles (UAVs) as mobile Base Stations (BSs), is imperative. In general, the location of a UAV Base Station (UAV-BS) is determined by optimization algorithms, which have high computationally complexities and place heavy demands on UAV resources. In this paper,… ▽ More

    Submitted 11 April, 2022; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: 7 pages, 6 figures

  11. arXiv:1904.05054  [pdf, other

    cs.CL cs.CR cs.LG

    Detecting Cybersecurity Events from Noisy Short Text

    Authors: Semih Yagcioglu, Mehmet Saygin Seyfioglu, Begum Citamak, Batuhan Bardak, Seren Guldamlasioglu, Azmi Yuksel, Emin Islam Tatli

    Abstract: It is very critical to analyze messages shared over social networks for cyber threat intelligence and cyber-crime prevention. In this study, we propose a method that leverages both domain-specific word embeddings and task-specific features to detect cyber security events from tweets. Our model employs a convolutional neural network (CNN) and a long short-term memory (LSTM) recurrent neural network… ▽ More

    Submitted 2 June, 2019; v1 submitted 10 April, 2019; originally announced April 2019.

    Comments: Accepted February 2019 to North American Chapter of the Association for Computational Linguistics (NAACL) 2019