Skip to main content

Showing 1–6 of 6 results for author: Codella, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.05885  [pdf, ps, other

    cs.DB cs.IR

    Cost-Effective, Low Latency Vector Search with Azure Cosmos DB

    Authors: Nitish Upreti, Krishnan Sundaram, Hari Sudan Sundar, Samer Boshra, Balachandar Perumalswamy, Shivam Atri, Martin Chisholm, Revti Raman Singh, Greg Yang, Subramanyam Pattipaka, Tamara Hass, Nitesh Dudhey, James Codella, Mark Hildebrand, Magdalen Manohar, Jack Moffitt, Haiyang Xu, Naren Datha, Suryansh Gupta, Ravishankar Krishnaswamy, Prashant Gupta, Abhishek Sahu, Ritika Mor, Santosh Kulkarni, Hemeswari Varada , et al. (11 additional authors not shown)

    Abstract: Vector indexing enables semantic search over diverse corpora and has become an important interface to databases for both users and AI agents. Efficient vector search requires deep optimizations in database systems. This has motivated a new class of specialized vector databases that optimize for vector search quality and cost. Instead, we argue that a scalable, high-performance, and cost-efficient… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    ACM Class: H.3.3

  2. arXiv:2104.04377  [pdf, other

    cs.LG

    Blending Knowledge in Deep Recurrent Networks for Adverse Event Prediction at Hospital Discharge

    Authors: Prithwish Chakraborty, James Codella, Piyush Madan, Ying Li, Hu Huang, Yoonyoung Park, Chao Yan, Ziqi Zhang, Cheng Gao, Steve Nyemba, Xu Min, Sanjib Basak, Mohamed Ghalwash, Zach Shahn, Parthasararathy Suryanarayanan, Italo Buleje, Shannon Harrer, Sarah Miller, Amol Rajmane, Colin Walsh, Jonathan Wanderer, Gigi Yuen Reed, Kenney Ng, Daby Sow, Bradley A. Malin

    Abstract: Deep learning architectures have an extremely high-capacity for modeling complex data in a wide variety of domains. However, these architectures have been limited in their ability to support complex prediction problems using insurance claims data, such as readmission at 30 days, mainly due to data sparsity issue. Consequently, classical machine learning methods, especially those that embed domain… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: Presented at the AMIA 2021 Virtual Informatics Summit

  3. arXiv:2009.02188  [pdf

    cs.LG cs.AI stat.ML

    Phenotypical Ontology Driven Framework for Multi-Task Learning

    Authors: Mohamed Ghalwash, Zijun Yao, Prithwish Chakraborty, James Codella, Daby Sow

    Abstract: Despite the large number of patients in Electronic Health Records (EHRs), the subset of usable data for modeling outcomes of specific phenotypes are often imbalanced and of modest size. This can be attributed to the uneven coverage of medical concepts in EHRs. In this paper, we propose OMTL, an Ontology-driven Multi-Task Learning framework, that is designed to overcome such data limitations. The k… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

    Comments: To be appear on ACM CHIL 2021

  4. arXiv:2007.12780  [pdf, other

    cs.LG cs.AI cs.CY

    A Canonical Architecture For Predictive Analytics on Longitudinal Patient Records

    Authors: Parthasarathy Suryanarayanan, Bhavani Iyer, Prithwish Chakraborty, Bibo Hao, Italo Buleje, Piyush Madan, James Codella, Antonio Foncubierta, Divya Pathak, Sarah Miller, Amol Rajmane, Shannon Harrer, Gigi Yuan-Reed, Daby Sow

    Abstract: Many institutions within the healthcare ecosystem are making significant investments in AI technologies to optimize their business operations at lower cost with improved patient outcomes. Despite the hype with AI, the full realization of this potential is seriously hindered by several systemic problems, including data privacy, security, bias, fairness, and explainability. In this paper, we propose… ▽ More

    Submitted 5 January, 2021; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: Presented at DSHealth 2020 KDD Workshop on Applied Data Science for Healthcare

  5. arXiv:2005.06434  [pdf

    cs.LG cs.IR stat.ML

    ODVICE: An Ontology-Driven Visual Analytic Tool for Interactive Cohort Extraction

    Authors: Mohamed Ghalwash, Zijun Yao, Prithwish Chakrabotry, James Codella, Daby Sow

    Abstract: Increased availability of electronic health records (EHR) has enabled researchers to study various medical questions. Cohort selection for the hypothesis under investigation is one of the main consideration for EHR analysis. For uncommon diseases, cohorts extracted from EHRs contain very limited number of records - hampering the robustness of any analysis. Data augmentation methods have been succe… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  6. arXiv:1912.07200  [pdf, other

    cs.CV cs.LG

    A Broader Study of Cross-Domain Few-Shot Learning

    Authors: Yunhui Guo, Noel C. Codella, Leonid Karlinsky, James V. Codella, John R. Smith, Kate Saenko, Tajana Rosing, Rogerio Feris

    Abstract: Recent progress on few-shot learning largely relies on annotated data for meta-learning: base classes sampled from the same domain as the novel classes. However, in many applications, collecting data for meta-learning is infeasible or impossible. This leads to the cross-domain few-shot learning problem, where there is a large shift between base and novel class domains. While investigations of the… ▽ More

    Submitted 17 July, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: ECCV 2020. Website: https://www.learning-with-limited-labels.com/