Skip to main content

Showing 1–29 of 29 results for author: Bitterman, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.14909  [pdf

    eess.IV cs.AI cs.CV

    Foundation Artificial Intelligence Models for Health Recognition Using Face Photographs (FAHR-Face)

    Authors: Fridolin Haugg, Grace Lee, John He, Leonard Nürnberg, Dennis Bontempi, Danielle S. Bitterman, Paul Catalano, Vasco Prudente, Dmitrii Glubokov, Andrew Warrington, Suraj Pai, Dirk De Ruysscher, Christian Guthier, Benjamin H. Kann, Vadim N. Gladyshev, Hugo JWL Aerts, Raymond H. Mak

    Abstract: Background: Facial appearance offers a noninvasive window into health. We built FAHR-Face, a foundation model trained on >40 million facial images and fine-tuned it for two distinct tasks: biological age estimation (FAHR-FaceAge) and survival risk prediction (FAHR-FaceSurvival). Methods: FAHR-FaceAge underwent a two-stage, age-balanced fine-tuning on 749,935 public images; FAHR-FaceSurvival was… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  2. arXiv:2506.07458  [pdf, ps, other

    cs.CL cs.AI cs.LG

    KScope: A Framework for Characterizing the Knowledge Status of Language Models

    Authors: Yuxin Xiao, Shan Chen, Jack Gallifant, Danielle Bitterman, Thomas Hartvigsen, Marzyeh Ghassemi

    Abstract: Characterizing a large language model's (LLM's) knowledge of a given question is challenging. As a result, prior work has primarily examined LLM behavior under knowledge conflicts, where the model's internal parametric memory contradicts information in the external context. However, this does not fully reflect how well the model knows the answer to the question. In this paper, we first introduce a… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  3. arXiv:2505.22888  [pdf, ps, other

    cs.CL

    When Models Reason in Your Language: Controlling Thinking Trace Language Comes at the Cost of Accuracy

    Authors: Jirui Qi, Shan Chen, Zidi Xiong, Raquel Fernández, Danielle S. Bitterman, Arianna Bisazza

    Abstract: Recent Large Reasoning Models (LRMs) with thinking traces have shown strong performance on English reasoning tasks. However, their ability to think in other languages is less studied. This capability is as important as answer accuracy for real world applications because users may find the reasoning trace useful for oversight only when it is expressed in their own language. We comprehensively evalu… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  4. arXiv:2505.14963  [pdf, ps, other

    cs.CL

    MedBrowseComp: Benchmarking Medical Deep Research and Computer Use

    Authors: Shan Chen, Pedro Moreira, Yuxin Xiao, Sam Schmidgall, Jeremy Warner, Hugo Aerts, Thomas Hartvigsen, Jack Gallifant, Danielle S. Bitterman

    Abstract: Large language models (LLMs) are increasingly envisioned as decision-support tools in clinical practice, yet safe clinical reasoning demands integrating heterogeneous knowledge bases -- trials, primary studies, regulatory documents, and cost data -- under strict accuracy constraints. Existing evaluations often rely on synthetic prompts, reduce the task to single-hop factoid queries, or conflate re… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: You can visit our project page at: https://moreirap12.github.io/mbc-browse-app/

  5. arXiv:2504.02874  [pdf, other

    cs.CL

    TheBlueScrubs-v1, a comprehensive curated medical dataset derived from the internet

    Authors: Luis Felipe, Carlos Garcia, Issam El Naqa, Monique Shotande, Aakash Tripathi, Vivek Rudrapatna, Ghulam Rasool, Danielle Bitterman, Gilmer Valdes

    Abstract: The need for robust and diverse data sets to train clinical large language models (cLLMs) is critical given that currently available public repositories often prove too limited in size or scope for comprehensive medical use. While resources like PubMed provide foundational medical literature, they capture only a narrow range of formal publications and omit the broader medical discourse on the inte… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

    Comments: 22 pages, 8 figures, 10 tables

  6. arXiv:2502.11367  [pdf, other

    cs.LG cs.AI cs.CL

    Sparse Autoencoder Features for Classifications and Transferability

    Authors: Jack Gallifant, Shan Chen, Kuleen Sasse, Hugo Aerts, Thomas Hartvigsen, Danielle S. Bitterman

    Abstract: Sparse Autoencoders (SAEs) provide potentials for uncovering structured, human-interpretable representations in Large Language Models (LLMs), making them a crucial tool for transparent and controllable AI systems. We systematically analyze SAE for interpretable feature extraction from LLMs in safety-critical classification tasks. Our framework evaluates (1) model-layer selection and scaling proper… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  7. arXiv:2502.07794  [pdf

    cs.CY cs.AI

    Regulatory Science Innovation for Generative AI and Large Language Models in Health and Medicine: A Global Call for Action

    Authors: Jasmine Chiat Ling Ong, Yilin Ning, Mingxuan Liu, Yian Ma, Zhao Liang, Kuldev Singh, Robert T Chang, Silke Vogel, John CW Lim, Iris Siu Kwan Tan, Oscar Freyer, Stephen Gilbert, Danielle S Bitterman, Xiaoxuan Liu, Alastair K Denniston, Nan Liu

    Abstract: The integration of generative AI (GenAI) and large language models (LLMs) in healthcare presents both unprecedented opportunities and challenges, necessitating innovative regulatory approaches. GenAI and LLMs offer broad applications, from automating clinical workflows to personalizing diagnostics. However, the non-deterministic outputs, broad functionalities and complex integration of GenAI and L… ▽ More

    Submitted 27 January, 2025; originally announced February 2025.

  8. arXiv:2412.14304  [pdf, other

    cs.CL cs.AI

    Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs

    Authors: David Restrepo, Chenwei Wu, Zhengxu Tang, Zitao Shuai, Thao Nguyen Minh Phan, Jun-En Ding, Cong-Tinh Dao, Jack Gallifant, Robyn Gayle Dychiao, Jose Carlo Artiaga, André Hiroshi Bando, Carolina Pelegrini Barbosa Gracitelli, Vincenz Ferrer, Leo Anthony Celi, Danielle Bitterman, Michael G Morley, Luis Filipe Nakayama

    Abstract: Current ophthalmology clinical workflows are plagued by over-referrals, long waits, and complex and heterogeneous medical records. Large language models (LLMs) present a promising solution to automate various procedures such as triaging, preliminary tests like visual acuity assessment, and report summaries. However, LLMs have demonstrated significantly varied performance across different languages… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: Accepted at the AAAI 2025 Artificial Intelligence for Social Impact Track (AAAI-AISI 2025)

  9. arXiv:2412.01955  [pdf

    cs.CL cs.AI

    The use of large language models to enhance cancer clinical trial educational materials

    Authors: Mingye Gao, Aman Varshney, Shan Chen, Vikram Goddla, Jack Gallifant, Patrick Doyle, Claire Novack, Maeve Dillon-Martin, Teresia Perkins, Xinrong Correia, Erik Duhaime, Howard Isenstein, Elad Sharon, Lisa Soleymani Lehmann, David Kozono, Brian Anthony, Dmitriy Dligach, Danielle S. Bitterman

    Abstract: Cancer clinical trials often face challenges in recruitment and engagement due to a lack of participant-facing informational and educational resources. This study investigated the potential of Large Language Models (LLMs), specifically GPT4, in generating patient-friendly educational content from clinical trial informed consent forms. Using data from ClinicalTrials.gov, we employed zero-shot learn… ▽ More

    Submitted 3 December, 2024; v1 submitted 2 December, 2024; originally announced December 2024.

  10. arXiv:2411.06469  [pdf, other

    cs.CL

    ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

    Authors: Canyu Chen, Jian Yu, Shan Chen, Che Liu, Zhongwei Wan, Danielle Bitterman, Fei Wang, Kai Shu

    Abstract: Large Language Models (LLMs) hold great promise to revolutionize current clinical systems for their superior capacities on medical text processing tasks and medical licensing exams. Meanwhile, traditional ML models such as SVM and XGBoost have still been mainly adopted in clinical prediction tasks. An emerging question is Can LLMs beat traditional ML models in clinical prediction? Thus, we build a… ▽ More

    Submitted 10 November, 2024; originally announced November 2024.

    Comments: The first two authors contributed equally. 10 pages for main paper, 66 pages including appendix. Project website: https://clinicalbench.github.io

  11. arXiv:2411.04962  [pdf, other

    cs.AI cs.CL

    Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability

    Authors: Yanjun Gao, Skatje Myers, Shan Chen, Dmitriy Dligach, Timothy A Miller, Danielle Bitterman, Guanhua Chen, Anoop Mayampurath, Matthew Churpek, Majid Afshar

    Abstract: Large language models (LLMs) are being explored for diagnostic decision support, yet their ability to estimate pre-test probabilities, vital for clinical decision-making, remains limited. This study evaluates two LLMs, Mistral-7B and Llama3-70B, using structured electronic health record data on three diagnosis tasks. We examined three current methods of extracting LLM probability estimations and r… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: Accepted to GenAI4Health Workshop at NeurIPS 2024

  12. arXiv:2410.13146  [pdf, other

    cs.CL cs.CV

    debiaSAE: Benchmarking and Mitigating Vision-Language Model Bias

    Authors: Kuleen Sasse, Shan Chen, Jackson Pond, Danielle Bitterman, John Osborne

    Abstract: As Vision Language Models (VLMs) gain widespread use, their fairness remains under-explored. In this paper, we analyze demographic biases across five models and six datasets. We find that portrait datasets like UTKFace and CelebA are the best tools for bias detection, finding gaps in performance and fairness for both LLaVa and CLIP models. Scene-based datasets like PATA and VLStereoSet fail to be… ▽ More

    Submitted 29 March, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Under Review at COLM 2025

  13. arXiv:2410.12722  [pdf, other

    cs.CL

    WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation

    Authors: João Matos, Shan Chen, Siena Placino, Yingya Li, Juan Carlos Climent Pardo, Daphna Idan, Takeshi Tohyama, David Restrepo, Luis F. Nakayama, Jose M. M. Pascual-Leone, Guergana Savova, Hugo Aerts, Leo A. Celi, A. Ian Wong, Danielle S. Bitterman, Jack Gallifant

    Abstract: Multimodal/vision language models (VLMs) are increasingly being deployed in healthcare settings worldwide, necessitating robust benchmarks to ensure their safety, efficacy, and fairness. Multiple-choice question and answer (QA) datasets derived from national medical examinations have long served as valuable evaluation tools, but existing datasets are largely text-only and available in a limited su… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: submitted for review, total of 14 pages

  14. arXiv:2409.20385  [pdf

    cs.CL

    Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation

    Authors: Shan Chen, Mingye Gao, Kuleen Sasse, Thomas Hartvigsen, Brian Anthony, Lizhou Fan, Hugo Aerts, Jack Gallifant, Danielle Bitterman

    Abstract: Background: Large language models (LLMs) are trained to follow directions, but this introduces a vulnerability to blindly comply with user requests even if they generate wrong information. In medicine, this could accelerate the generation of misinformation that impacts human well-being. Objectives/Methods: We analyzed compliance to requests to generate misleading content about medications in set… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: Submitted for Review

  15. arXiv:2409.18968  [pdf, other

    cs.CY cs.AI cs.LG

    Safety challenges of AI in medicine in the era of large language models

    Authors: Xiaoye Wang, Nicole Xi Zhang, Hongyu He, Trang Nguyen, Kun-Hsing Yu, Hao Deng, Cynthia Brandt, Danielle S. Bitterman, Ling Pan, Ching-Yu Cheng, James Zou, Dianbo Liu

    Abstract: Recent advancements in artificial intelligence (AI), particularly in large language models (LLMs), have unlocked significant potential to enhance the quality and efficiency of medical care. By introducing a novel way to interact with AI and data through natural language, LLMs offer new opportunities for medical practitioners, patients, and researchers. However, as AI and LLMs become more powerful… ▽ More

    Submitted 30 January, 2025; v1 submitted 11 September, 2024; originally announced September 2024.

  16. arXiv:2409.18924  [pdf

    cs.CL cs.AI

    AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow

    Authors: Huizi Yu, Jiayan Zhou, Lingyao Li, Shan Chen, Jack Gallifant, Anye Shi, Xiang Li, Wenyue Hua, Mingyu Jin, Guang Chen, Yang Zhou, Zhao Li, Trisha Gupte, Ming-Li Chen, Zahra Azizi, Yongfeng Zhang, Themistocles L. Assimes, Xin Ma, Danielle S. Bitterman, Lin Lu, Lizhou Fan

    Abstract: Simulated patient systems play a crucial role in modern medical education and research, providing safe, integrative learning environments and enabling clinical decision-making simulations. Large Language Models (LLM) could advance simulated patient systems by replicating medical conditions and patient-doctor interactions with high fidelity and low cost. However, ensuring the effectiveness and trus… ▽ More

    Submitted 1 October, 2024; v1 submitted 27 September, 2024; originally announced September 2024.

    Comments: 42 pages, 6 figures, 7 tables

  17. arXiv:2408.11854  [pdf, other

    cs.CL cs.AI cs.LG

    When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications?

    Authors: Yanjun Gao, Skatje Myers, Shan Chen, Dmitriy Dligach, Timothy A Miller, Danielle Bitterman, Matthew Churpek, Majid Afshar

    Abstract: The introduction of Large Language Models (LLMs) has advanced data representation and analysis, bringing significant progress in their use for medical questions and answering. Despite these advancements, integrating tabular data, especially numerical data pivotal in clinical contexts, into LLM paradigms has not been thoroughly explored. In this study, we examine the effectiveness of vector represe… ▽ More

    Submitted 19 September, 2024; v1 submitted 14 August, 2024; originally announced August 2024.

    Comments: Accepted to Findings of EMNLP 2024

  18. arXiv:2406.13152  [pdf, other

    cs.CL

    Analyzing Diversity in Healthcare LLM Research: A Scientometric Perspective

    Authors: David Restrepo, Chenwei Wu, Constanza Vásquez-Venegas, João Matos, Jack Gallifant, Leo Anthony Celi, Danielle S. Bitterman, Luis Filipe Nakayama

    Abstract: The deployment of large language models (LLMs) in healthcare has demonstrated substantial potential for enhancing clinical decision-making, administrative efficiency, and patient outcomes. However, the underrepresentation of diverse groups in the development and application of these models can perpetuate biases, leading to inequitable healthcare delivery. This paper presents a comprehensive scient… ▽ More

    Submitted 2 September, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  19. arXiv:2406.12449  [pdf

    cs.AI

    Retrieval-Augmented Generation for Generative Artificial Intelligence in Medicine

    Authors: Rui Yang, Yilin Ning, Emilia Keppo, Mingxuan Liu, Chuan Hong, Danielle S Bitterman, Jasmine Chiat Ling Ong, Daniel Shu Wei Ting, Nan Liu

    Abstract: Generative artificial intelligence (AI) has brought revolutionary innovations in various fields, including medicine. However, it also exhibits limitations. In response, retrieval-augmented generation (RAG) provides a potential solution, enabling models to generate more accurate contents by leveraging the retrieval of external knowledge. With the rapid advancement of generative AI, RAG can pave the… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  20. arXiv:2406.12066  [pdf, other

    cs.CL

    Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks

    Authors: Jack Gallifant, Shan Chen, Pedro Moreira, Nikolaj Munch, Mingye Gao, Jackson Pond, Leo Anthony Celi, Hugo Aerts, Thomas Hartvigsen, Danielle Bitterman

    Abstract: Medical knowledge is context-dependent and requires consistent reasoning across various natural language expressions of semantically equivalent phrases. This is particularly crucial for drug names, where patients often use brand names like Advil or Tylenol instead of their generic equivalents. To study this, we create a new robustness dataset, RABBITS, to evaluate performance differences on medica… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: submitted for review, total 15 pages

  21. arXiv:2405.05506  [pdf, other

    cs.CL

    Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias

    Authors: Shan Chen, Jack Gallifant, Mingye Gao, Pedro Moreira, Nikolaj Munch, Ajay Muthukkumar, Arvind Rajan, Jaya Kolluri, Amelia Fiske, Janna Hastings, Hugo Aerts, Brian Anthony, Leo Anthony Celi, William G. La Cava, Danielle S. Bitterman

    Abstract: Large language models (LLMs) are increasingly essential in processing natural languages, yet their application is frequently compromised by biases and inaccuracies originating in their training data. In this study, we introduce Cross-Care, the first benchmark framework dedicated to assessing biases and real world knowledge in LLMs, specifically focusing on the representation of disease prevalence… ▽ More

    Submitted 24 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: Submitted for review, data visualization tool available at: www.crosscare.net

  22. arXiv:2405.05049  [pdf

    cs.CL

    Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources

    Authors: Lasse Hyldig Hansen, Nikolaj Andersen, Jack Gallifant, Liam G. McCoy, James K Stone, Nura Izath, Marcela Aguirre-Jerez, Danielle S Bitterman, Judy Gichoya, Leo Anthony Celi

    Abstract: Background Advancements in Large Language Models (LLMs) hold transformative potential in healthcare, however, recent work has raised concern about the tendency of these models to produce outputs that display racial or gender biases. Although training data is a likely source of such biases, exploration of disease and demographic associations in text data at scale has been limited. Methods We cond… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  23. arXiv:2403.19511  [pdf

    cs.CL

    Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data

    Authors: Shan Chen, Jack Gallifant, Marco Guevara, Yanjun Gao, Majid Afshar, Timothy Miller, Dmitriy Dligach, Danielle S. Bitterman

    Abstract: Generative models have been showing potential for producing data in mass. This study explores the enhancement of clinical natural language processing performance by utilizing synthetic data generated from advanced language models. Promising results show feasible applications in such a high-stakes domain.

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: submitted to review

  24. arXiv:2310.17703  [pdf

    cs.CL

    The impact of responding to patient messages with large language model assistance

    Authors: Shan Chen, Marco Guevara, Shalini Moningi, Frank Hoebers, Hesham Elhalawani, Benjamin H. Kann, Fallon E. Chipidza, Jonathan Leeman, Hugo J. W. L. Aerts, Timothy Miller, Guergana K. Savova, Raymond H. Mak, Maryam Lustberg, Majid Afshar, Danielle S. Bitterman

    Abstract: Documentation burden is a major contributor to clinician burnout, which is rising nationally and is an urgent threat to our ability to care for patients. Artificial intelligence (AI) chatbots, such as ChatGPT, could reduce clinician burden by assisting with documentation. Although many hospitals are actively integrating such systems into electronic medical record systems, AI chatbots utility and i… ▽ More

    Submitted 29 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 4 figures and tables in main, submitted for review

  25. arXiv:2310.12300  [pdf, other

    cs.CL

    Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly

    Authors: Sheng Lu, Shan Chen, Yingya Li, Danielle Bitterman, Guergana Savova, Iryna Gurevych

    Abstract: In-context learning (ICL) is a new learning paradigm that has gained popularity along with the development of large language models. In this work, we adapt a recently proposed hardness metric, pointwise $\mathcal{V}$-usable information (PVI), to an in-context version (in-context PVI). Compared to the original PVI, in-context PVI is more efficient in that it requires only a few exemplars and does n… ▽ More

    Submitted 8 December, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings

  26. arXiv:2309.12339  [pdf

    cs.CY cs.AI cs.CL

    Considerations for health care institutions training large language models on electronic health records

    Authors: Weipeng Zhou, Danielle Bitterman, Majid Afshar, Timothy A. Miller

    Abstract: Large language models (LLMs) like ChatGPT have excited scientists across fields; in medicine, one source of excitement is the potential applications of LLMs trained on electronic health record (EHR) data. But there are tough questions we must first answer if health care institutions are interested in having LLMs trained on their own data; should they train an LLM from scratch or fine-tune it from… ▽ More

    Submitted 23 August, 2023; originally announced September 2023.

  27. Large Language Models to Identify Social Determinants of Health in Electronic Health Records

    Authors: Marco Guevara, Shan Chen, Spencer Thomas, Tafadzwa L. Chaunzwa, Idalid Franco, Benjamin Kann, Shalini Moningi, Jack Qian, Madeleine Goldstein, Susan Harper, Hugo JWL Aerts, Guergana K. Savova, Raymond H. Mak, Danielle S. Bitterman

    Abstract: Social determinants of health (SDoH) have an important impact on patient outcomes but are incompletely collected from the electronic health records (EHR). This study researched the ability of large language models to extract SDoH from free text in EHRs, where they are most commonly documented, and explored the role of synthetic clinical text for improving the extraction of these scarcely documente… ▽ More

    Submitted 5 March, 2024; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: Peer-reviewed version published at NPJ Digital Medicine: https://www.nature.com/articles/s41746-023-00970-0

    Journal ref: NPJ Digit Med. 2024 Jan 11;7(1):6

  28. Evaluation of ChatGPT Family of Models for Biomedical Reasoning and Classification

    Authors: Shan Chen, Yingya Li, Sheng Lu, Hoang Van, Hugo JWL Aerts, Guergana K. Savova, Danielle S. Bitterman

    Abstract: Recent advances in large language models (LLMs) have shown impressive ability in biomedical question-answering, but have not been adequately investigated for more specific biomedical applications. This study investigates the performance of LLMs such as the ChatGPT family of models (GPT-3.5s, GPT-4) in biomedical tasks beyond question-answering. Because no patient data can be passed to the OpenAI A… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 28 pages, 2 tables and 4 figures. Submitting for review

  29. Natural language processing to automatically extract the presence and severity of esophagitis in notes of patients undergoing radiotherapy

    Authors: Shan Chen, Marco Guevara, Nicolas Ramirez, Arpi Murray, Jeremy L. Warner, Hugo JWL Aerts, Timothy A. Miller, Guergana K. Savova, Raymond H. Mak, Danielle S. Bitterman

    Abstract: Radiotherapy (RT) toxicities can impair survival and quality-of-life, yet remain under-studied. Real-world evidence holds potential to improve our understanding of toxicities, but toxicity information is often only in clinical notes. We developed natural language processing (NLP) models to identify the presence and severity of esophagitis from notes of patients treated with thoracic RT. We fine-tu… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: 17 pages, 6 tables, 1figure, submiting to JCO-CCI for review