Skip to main content

Showing 1–11 of 11 results for author: Chiang, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.11323  [pdf, other

    stat.ML cs.LG

    Convergence Rates of Constrained Expected Improvement

    Authors: Haowei Wang, Jingyi Wang, Zhongxiang Dai, Nai-Yuan Chiang, Szu Hui Ng, Cosmin G. Petra

    Abstract: Constrained Bayesian optimization (CBO) methods have seen significant success in black-box optimization with constraints, and one of the most commonly used CBO methods is the constrained expected improvement (CEI) algorithm. CEI is a natural extension of the expected improvement (EI) when constraints are incorporated. However, the theoretical convergence rate of CEI has not been established. In th… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  2. arXiv:2504.03964  [pdf, other

    cs.CL cs.AI cs.LG

    Clinical ModernBERT: An efficient and long context encoder for biomedical text

    Authors: Simon A. Lee, Anthony Wu, Jeffrey N. Chiang

    Abstract: We introduce Clinical ModernBERT, a transformer based encoder pretrained on large scale biomedical literature, clinical notes, and medical ontologies, incorporating PubMed abstracts, MIMIC IV clinical data, and medical codes with their textual descriptions. Building on ModernBERT the current state of the art natural language text encoder featuring architectural upgrades such as rotary positional e… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: Manuscript writeup corresponding to the Clinical ModernBERT pre-trained encoder (https://huggingface.co/Simonlee711/Clinical_ModernBERT)

  3. arXiv:2501.09262  [pdf, other

    stat.ML cs.LG math.OC

    On the convergence rate of noisy Bayesian Optimization with Expected Improvement

    Authors: Jingyi Wang, Haowei Wang, Nai-Yuan Chiang, Cosmin G. Petra

    Abstract: Expected improvement (EI) is one of the most widely used acquisition functions in Bayesian optimization (BO). Despite its proven success in applications for decades, important open questions remain on the theoretical convergence behaviors and rates for EI. In this paper, we contribute to the convergence theory of EI in three novel and critical areas. First, we consider objective functions that fit… ▽ More

    Submitted 12 February, 2025; v1 submitted 15 January, 2025; originally announced January 2025.

  4. arXiv:2412.18789  [pdf, ps, other

    cs.LG stat.ML

    On Improved Regret Bounds In Bayesian Optimization with Gaussian Noise

    Authors: Jingyi Wang, Haowei Wang, Cosmin G. Petra, Nai-Yuan Chiang

    Abstract: Bayesian optimization (BO) with Gaussian process (GP) surrogate models is a powerful black-box optimization method. Acquisition functions are a critical part of a BO algorithm as they determine how the new samples are selected. Some of the most widely used acquisition functions include upper confidence bound (UCB) and Thompson sampling (TS). The convergence analysis of BO algorithms has focused on… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

  5. arXiv:2411.01322  [pdf, other

    cs.LG stat.ML

    FEET: A Framework for Evaluating Embedding Techniques

    Authors: Simon A. Lee, John Lee, Jeffrey N. Chiang

    Abstract: In this study, we introduce FEET, a standardized protocol designed to guide the development and benchmarking of foundation models. While numerous benchmark datasets exist for evaluating these models, we propose a structured evaluation protocol across three distinct scenarios to gain a comprehensive understanding of their practical performance. We define three primary use cases: frozen embeddings,… ▽ More

    Submitted 2 November, 2024; originally announced November 2024.

    Comments: Findings paper presented at Machine Learning for Health (ML4H) symposium 2024, December 15-16, 2024, Vancouver, Canada, 11 pages

  6. arXiv:2409.15586  [pdf, other

    eess.SP cs.AI

    TFT-multi: simultaneous forecasting of vital sign trajectories in the ICU

    Authors: Rosemary Y. He, Jeffrey N. Chiang

    Abstract: Trajectory forecasting in healthcare data has been an important area of research in precision care and clinical integration for computational methods. In recent years, generative AI models have demonstrated promising results in capturing short and long range dependencies in time series data. While these models have also been applied in healthcare, most of them only predict one value at a time, whi… ▽ More

    Submitted 6 December, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

  7. arXiv:2405.20419  [pdf, other

    cs.LG cs.AI cs.CL

    Enhancing Antibiotic Stewardship using a Natural Language Approach for Better Feature Representation

    Authors: Simon A. Lee, Trevor Brokowski, Jeffrey N. Chiang

    Abstract: The rapid emergence of antibiotic-resistant bacteria is recognized as a global healthcare crisis, undermining the efficacy of life-saving antibiotics. This crisis is driven by the improper and overuse of antibiotics, which escalates bacterial resistance. In response, this study explores the use of clinical decision support systems, enhanced through the integration of electronic health records (EHR… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  8. arXiv:2403.07384  [pdf, other

    cs.CL cs.AI cs.LG

    SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models

    Authors: Yu Yang, Siddhartha Mishra, Jeffrey N Chiang, Baharan Mirzasoleiman

    Abstract: Despite the effectiveness of data selection for large language models (LLMs) during pretraining and instruction fine-tuning phases, improving data efficiency in supervised fine-tuning (SFT) for specialized domains poses significant challenges due to the complexity of fine-tuning data. To bridge this gap, we introduce an effective and scalable data selection method for SFT, SmallToLarge (S2L), whic… ▽ More

    Submitted 5 December, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  9. arXiv:2402.00160  [pdf, other

    cs.CL

    Emergency Department Decision Support using Clinical Pseudo-notes

    Authors: Simon A. Lee, Sujay Jain, Alex Chen, Kyoka Ono, Jennifer Fang, Akos Rudas, Jeffrey N. Chiang

    Abstract: In this work, we introduce the Multiple Embedding Model for EHR (MEME), an approach that serializes multimodal EHR tabular data into text using pseudo-notes, mimicking clinical text generation. This conversion not only preserves better representations of categorical data and learns contexts but also enables the effective employment of pretrained foundation models for rich feature representation. T… ▽ More

    Submitted 29 April, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

  10. arXiv:2210.06664  [pdf

    eess.IV cs.AI cs.CV

    Are Macula or Optic Nerve Head Structures better at Diagnosing Glaucoma? An Answer using AI and Wide-Field Optical Coherence Tomography

    Authors: Charis Y. N. Chiang, Fabian Braeu, Thanadet Chuangsuwanich, Royston K. Y. Tan, Jacqueline Chua, Leopold Schmetterer, Alexandre Thiery, Martin Buist, Michaël J. A. Girard

    Abstract: Purpose: (1) To develop a deep learning algorithm to automatically segment structures of the optic nerve head (ONH) and macula in 3D wide-field optical coherence tomography (OCT) scans; (2) To assess whether 3D macula or ONH structures (or the combination of both) provide the best diagnostic power for glaucoma. Methods: A cross-sectional comparative study was performed which included wide-field sw… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: 23 pages, 5 figures

  11. arXiv:2110.03636  [pdf, other

    math.OC cs.DC

    A Hybrid Direct-Iterative Method for Solving KKT Linear Systems

    Authors: Shaked Regev, Nai-Yuan Chiang, Eric Darve, Cosmin G. Petra, Michael A. Saunders, Kasia Świrydowicz, Slaven Peleš

    Abstract: We propose a solution strategy for linear systems arising in interior method optimization, which is suitable for implementation on hardware accelerators such as graphical processing units (GPUs). The current gold standard for solving these systems is the LDL^T factorization. However, LDL^T requires pivoting during factorization, which substantially increases communication cost and degrades perform… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: 22 pages, 9 figures, 7 tables

    MSC Class: 15; 65; 68 ACM Class: G.1