Skip to main content

Showing 1–6 of 6 results for author: Yang, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.00084  [pdf

    cs.CL cs.AI

    Vision-Language and Large Language Model Performance in Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, and Quantized Models

    Authors: Seyed Amir Ahmad Safavi-Naini, Shuhaib Ali, Omer Shahab, Zahra Shahhoseini, Thomas Savage, Sara Rafiee, Jamil S Samaan, Reem Al Shabeeb, Farah Ladak, Jamie O Yang, Juan Echavarria, Sumbal Babar, Aasma Shaukat, Samuel Margolis, Nicholas P Tatonetti, Girish Nadkarni, Bara El Kurdi, Ali Soroush

    Abstract: Background and Aims: This study evaluates the medical reasoning performance of large language models (LLMs) and vision language models (VLMs) in gastroenterology. Methods: We used 300 gastroenterology board exam-style multiple-choice questions, 138 of which contain images to systematically assess the impact of model configurations and parameters and prompt engineering strategies utilizing GPT-3.… ▽ More

    Submitted 4 September, 2024; v1 submitted 25 August, 2024; originally announced September 2024.

    Comments: Manuscript Pages: 34, Figures: 7, Tables: 2, Supplementary File Pages: 35, Data Transparency Statement: Code is available at: https://github.com/Sdamirsa/LLM-VLM-in-Gastroenterology . Study data from American College of Gastroenterology (ACG) are restricted and available upon request with ACG permission. Correction: updated abstract considering Llama3.1 results

    MSC Class: 92C50; 68T50 ACM Class: J.3

  2. arXiv:2309.09546  [pdf, other

    eess.AS cs.CL cs.SD

    Training dynamic models using early exits for automatic speech recognition on resource-constrained devices

    Authors: George August Wright, Umberto Cappellazzo, Salah Zaiem, Desh Raj, Lucas Ondel Yang, Daniele Falavigna, Mohamed Nabih Ali, Alessio Brutti

    Abstract: The ability to dynamically adjust the computational load of neural models during inference is crucial for on-device processing scenarios characterised by limited and time-varying computational resources. A promising solution is presented by early-exit architectures, in which additional exit branches are appended to intermediate layers of the encoder. In self-attention models for automatic speech r… ▽ More

    Submitted 22 February, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted at the ICASSP Workshop Self-supervision in Audio, Speech and Beyond 2024

  3. arXiv:2309.05032  [pdf, other

    cs.CV

    Unified Contrastive Fusion Transformer for Multimodal Human Action Recognition

    Authors: Kyoung Ok Yang, Junho Koh, Jun Won Choi

    Abstract: Various types of sensors have been considered to develop human action recognition (HAR) models. Robust HAR performance can be achieved by fusing multimodal data acquired by different sensors. In this paper, we introduce a new multimodal fusion architecture, referred to as Unified Contrastive Fusion Transformer (UCFFormer) designed to integrate data with diverse distributions to enhance HAR perform… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

  4. arXiv:2212.13014  [pdf, other

    cs.LG cs.CY

    Bias Mitigation Framework for Intersectional Subgroups in Neural Networks

    Authors: Narine Kokhlikyan, Bilal Alsallakh, Fulton Wang, Vivek Miglani, Oliver Aobo Yang, David Adkins

    Abstract: We propose a fairness-aware learning framework that mitigates intersectional subgroup bias associated with protected attributes. Prior research has primarily focused on mitigating one kind of bias by incorporating complex fairness-driven constraints into optimization objectives or designing additional layers that focus on specific protected attributes. We introduce a simple and generic bias mitiga… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

  5. arXiv:2203.06690  [pdf, other

    cs.LG cs.AI

    Algebraic Learning: Towards Interpretable Information Modeling

    Authors: Tong Owen Yang

    Abstract: Along with the proliferation of digital data collected using sensor technologies and a boost of computing power, Deep Learning (DL) based approaches have drawn enormous attention in the past decade due to their impressive performance in extracting complex relations from raw data and representing valuable information. Meanwhile, though, rooted in its notorious black-box nature, the appreciation of… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: 122 pages, 14 figures

  6. Robust and Efficient Multilevel-ILU Preconditioning of Hybrid Newton-GMRES for Incompressible Navier-Stokes Equations

    Authors: Qiao Chen, Xiangmin Jiao, Oliver Yang

    Abstract: We introduce a robust and efficient preconditioner for a hybrid Newton-GMRES method for solving the nonlinear systems arising from incompressible Navier-Stokes equations. When the Reynolds number is relatively high, these systems often involve millions of degrees of freedom (DOFs), and the nonlinear systems are difficult to converge, partially due to the strong asymmetry of the system and the sadd… ▽ More

    Submitted 9 August, 2021; v1 submitted 14 November, 2020; originally announced November 2020.

    Comments: Submitted to International Journal for Numerical Methods in Fluids

    Journal ref: International Journal for Numerical Methods in Fluids (2021)