Skip to main content

Showing 1–4 of 4 results for author: Fu, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.18955  [pdf, other

    cs.CL

    BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning

    Authors: Yujuan Velvin Fu, Giridhar Kaushik Ramachandran, Namu Park, Kevin Lybarger, Fei Xia, Ozlem Uzuner, Meliha Yetisgen

    Abstract: Large language models (LLMs) such as ChatGPT are fine-tuned on large and diverse instruction-following corpora, and can generalize to new tasks. However, those instruction-tuned LLMs often perform poorly in specialized medical natural language understanding (NLU) tasks that require domain knowledge, granular text comprehension, and structured data extraction. To bridge the gap, we: (1) propose a u… ▽ More

    Submitted 9 March, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

    Comments: 3 figures an 5 tables; Accepted by AMIA 2025 Informatics Summit

  2. arXiv:2211.02753  [pdf, other

    cs.DB cs.LG

    The Tensor Data Platform: Towards an AI-centric Database System

    Authors: Apurva Gandhi, Yuki Asada, Victor Fu, Advitya Gemawat, Lihao Zhang, Rathijit Sen, Carlo Curino, Jesús Camacho-Rodríguez, Matteo Interlandi

    Abstract: Database engines have historically absorbed many of the innovations in data processing, adding features to process graph data, XML, object oriented, and text among many others. In this paper, we make the case that it is time to do the same for AI -- but with a twist! While existing approaches have tried to achieve this by integrating databases with external ML tools, in this paper we claim that ac… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: Accepted for publication at The Conference on Innovative Data Systems Research (CIDR) 2023

  3. Share the Tensor Tea: How Databases can Leverage the Machine Learning Ecosystem

    Authors: Yuki Asada, Victor Fu, Apurva Gandhi, Advitya Gemawat, Lihao Zhang, Dong He, Vivek Gupta, Ehi Nosakhare, Dalitso Banda, Rathijit Sen, Matteo Interlandi

    Abstract: We demonstrate Tensor Query Processor (TQP): a query processor that automatically compiles relational operators into tensor programs. By leveraging tensor runtimes such as PyTorch, TQP is able to: (1) integrate with ML tools (e.g., Pandas for data ingestion, Tensorboard for visualization); (2) target different hardware (e.g., CPU, GPU) and software (e.g., browser) backends; and (3) end-to-end acce… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Journal ref: PVLDB, 15(12): 3598 - 3601, 2022

  4. arXiv:2005.08068  [pdf, other

    cs.LG cs.AI stat.ML

    Model-Augmented Actor-Critic: Backpropagating through Paths

    Authors: Ignasi Clavera, Violet Fu, Pieter Abbeel

    Abstract: Current model-based reinforcement learning approaches use the model simply as a learned black-box simulator to augment the data for policy optimization or value function learning. In this paper, we show how to make more effective use of the model by exploiting its differentiability. We construct a policy optimization algorithm that uses the pathwise derivative of the learned model and policy acros… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    Comments: Accepted paper at ICLR 2020