Skip to main content

Showing 1–11 of 11 results for author: Smit, A

.
  1. arXiv:2407.13780  [pdf, other

    q-bio.BM cs.CL cs.LG

    Generative Model for Small Molecules with Latent Space RL Fine-Tuning to Protein Targets

    Authors: Ulrich A. Mbou Sob, Qiulin Li, Miguel Arbesú, Oliver Bent, Andries P. Smit, Arnu Pretorius

    Abstract: A specific challenge with deep learning approaches for molecule generation is generating both syntactically valid and chemically plausible molecular string representations. To address this, we propose a novel generative latent-variable transformer model for small molecules that leverages a recently proposed molecular string representation called SAFE. We introduce a modification to SAFE to reduce… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 12 pages, 6 figures, Proceedings of the ICML 2024 Workshop on Accessible and Effi- cient Foundation Models for Biological Discovery, Vienna, Austria. 2024

  2. arXiv:2405.13492  [pdf, other

    astro-ph.IM astro-ph.CO

    Euclid. II. The VIS Instrument

    Authors: Euclid Collaboration, M. S. Cropper, A. Al-Bahlawan, J. Amiaux, S. Awan, R. Azzollini, K. Benson, M. Berthe, J. Boucher, E. Bozzo, C. Brockley-Blatt, G. P. Candini, C. Cara, R. A. Chaudery, R. E. Cole, P. Danto, J. Denniston, A. M. Di Giorgio, B. Dryer, J. -P. Dubois, J. Endicott, M. Farina, E. Galli, L. Genolet, J. P. D. Gow , et al. (410 additional authors not shown)

    Abstract: This paper presents the specification, design, and development of the Visible Camera (VIS) on the ESA Euclid mission. VIS is a large optical-band imager with a field of view of 0.54 deg^2 sampled at 0.1" with an array of 609 Megapixels and spatial resolution of 0.18". It will be used to survey approximately 14,000 deg^2 of extragalactic sky to measure the distortion of galaxies in the redshift ran… ▽ More

    Submitted 2 January, 2025; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Paper submitted as part of the A&A special issue `Euclid on Sky', which contains Euclid key reference papers and first results from the Euclid Early Release Observations (A&A reference aa50996-24). This revision is the version accepted by A&A on 19 September 2024. Other than changes to the author list, changes are limited to editorial and journal style

    Journal ref: A&A 697, A2 (2025)

  3. arXiv:2405.13491  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM

    Euclid. I. Overview of the Euclid mission

    Authors: Euclid Collaboration, Y. Mellier, Abdurro'uf, J. A. Acevedo Barroso, A. Achúcarro, J. Adamek, R. Adam, G. E. Addison, N. Aghanim, M. Aguena, V. Ajani, Y. Akrami, A. Al-Bahlawan, A. Alavi, I. S. Albuquerque, G. Alestas, G. Alguero, A. Allaoui, S. W. Allen, V. Allevato, A. V. Alonso-Tetilla, B. Altieri, A. Alvarez-Candal, S. Alvi, A. Amara , et al. (1115 additional authors not shown)

    Abstract: The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14… ▽ More

    Submitted 24 September, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted for publication in the A&A special issue`Euclid on Sky'

    Journal ref: A&A 697, A1 (2025)

  4. arXiv:2311.17371  [pdf, other

    cs.CL cs.AI

    Should we be going MAD? A Look at Multi-Agent Debate Strategies for LLMs

    Authors: Andries Smit, Paul Duckworth, Nathan Grinsztajn, Thomas D. Barrett, Arnu Pretorius

    Abstract: Recent advancements in large language models (LLMs) underscore their potential for responding to inquiries in various domains. However, ensuring that generative agents provide accurate and reliable answers remains an ongoing challenge. In this context, multi-agent debate (MAD) has emerged as a promising strategy for enhancing the truthfulness of LLMs. We benchmark a range of debating and prompting… ▽ More

    Submitted 18 July, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: 2 pages, 13 figures

  5. arXiv:2306.09884  [pdf, other

    cs.LG cs.AI

    Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

    Authors: Clément Bonnet, Daniel Luo, Donal Byrne, Shikha Surana, Sasha Abramowitz, Paul Duckworth, Vincent Coyette, Laurence I. Midgley, Elshadai Tegegn, Tristan Kalloniatis, Omayma Mahjoub, Matthew Macfarlane, Andries P. Smit, Nathan Grinsztajn, Raphael Boige, Cemlyn N. Waters, Mohamed A. Mimouni, Ulrich A. Mbou Sob, Ruan de Kock, Siddarth Singh, Daniel Furelos-Blanco, Victor Le, Arnu Pretorius, Alexandre Laterre

    Abstract: Open-source reinforcement learning (RL) environments have played a crucial role in driving progress in the development of AI algorithms. In modern RL research, there is a need for simulated environments that are performant, scalable, and modular to enable their utilization in a wider range of potential real-world applications. Therefore, we present Jumanji, a suite of diverse RL environments speci… ▽ More

    Submitted 15 March, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 9 pages + 21 pages of appendices and references. Published at ICLR 2024

  6. arXiv:2107.01460  [pdf, other

    cs.LG cs.MA

    Mava: a research library for distributed multi-agent reinforcement learning in JAX

    Authors: Ruan de Kock, Omayma Mahjoub, Sasha Abramowitz, Wiem Khlifi, Callum Rhys Tilbury, Claude Formanek, Andries Smit, Arnu Pretorius

    Abstract: Multi-agent reinforcement learning (MARL) research is inherently computationally expensive and it is often difficult to obtain a sufficient number of experiment samples to test hypotheses and make robust statistical claims. Furthermore, MARL algorithms are typically complex in their design and can be tricky to implement correctly. These aspects of MARL present a difficult challenge when it comes t… ▽ More

    Submitted 15 December, 2023; v1 submitted 3 July, 2021; originally announced July 2021.

  7. arXiv:2104.00793  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Effect of Radiology Report Labeler Quality on Deep Learning Models for Chest X-Ray Interpretation

    Authors: Saahil Jain, Akshay Smit, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Although deep learning models for chest X-ray interpretation are commonly trained on labels generated by automatic radiology report labelers, the impact of improvements in report labeling on the performance of chest X-ray classification models has not been systematically investigated. We first compare the CheXpert, CheXbert, and VisualCheXbert labelers on the task of extracting accurate chest X-ra… ▽ More

    Submitted 27 November, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: In Neural Information Processing Systems (NeurIPS) Workshop on Data-Centric AI (DCAI)

  8. arXiv:2103.14339  [pdf, other

    cs.CV cs.AI cs.LG

    MedSelect: Selective Labeling for Medical Image Classification Combining Meta-Learning with Deep Reinforcement Learning

    Authors: Akshay Smit, Damir Vrabac, Yujie He, Andrew Y. Ng, Andrew L. Beam, Pranav Rajpurkar

    Abstract: We propose a selective learning method using meta-learning and deep reinforcement learning for medical image interpretation in the setting of limited labeling resources. Our method, MedSelect, consists of a trainable deep learning selector that uses image embeddings obtained from contrastive pretraining for determining which images to label, and a non-parametric selector that uses cosine similarit… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

  9. arXiv:2102.11467  [pdf, other

    eess.IV cs.CV cs.LG

    VisualCheXbert: Addressing the Discrepancy Between Radiology Report Labels and Image Labels

    Authors: Saahil Jain, Akshay Smit, Steven QH Truong, Chanh DT Nguyen, Minh-Thanh Huynh, Mudit Jain, Victoria A. Young, Andrew Y. Ng, Matthew P. Lungren, Pranav Rajpurkar

    Abstract: Automatic extraction of medical conditions from free-text radiology reports is critical for supervising computer vision models to interpret medical images. In this work, we show that radiologists labeling reports significantly disagree with radiologists labeling corresponding chest X-ray images, which reduces the quality of report labels as proxies for image labels. We develop and evaluate methods… ▽ More

    Submitted 15 March, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: Accepted to ACM Conference on Health, Inference, and Learning (ACM-CHIL) 2021

  10. arXiv:2009.08123  [pdf, other

    cs.CV cs.AI cs.LG

    DLBCL-Morph: Morphological features computed using deep learning for an annotated digital DLBCL image set

    Authors: Damir Vrabac, Akshay Smit, Rebecca Rojansky, Yasodha Natkunam, Ranjana H. Advani, Andrew Y. Ng, Sebastian Fernandez-Pol, Pranav Rajpurkar

    Abstract: Diffuse Large B-Cell Lymphoma (DLBCL) is the most common non-Hodgkin lymphoma. Though histologically DLBCL shows varying morphologies, no morphologic features have been consistently demonstrated to correlate with prognosis. We present a morphologic analysis of histology sections from 209 DLBCL cases with associated clinical and cytogenetic data. Duplicate tissue core sections were arranged in tiss… ▽ More

    Submitted 24 September, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Corrections to folder structure figure

  11. arXiv:2004.09167  [pdf, other

    cs.CL cs.IR cs.LG

    CheXbert: Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT

    Authors: Akshay Smit, Saahil Jain, Pranav Rajpurkar, Anuj Pareek, Andrew Y. Ng, Matthew P. Lungren

    Abstract: The extraction of labels from radiology text reports enables large-scale training of medical imaging models. Existing approaches to report labeling typically rely either on sophisticated feature engineering based on medical domain knowledge or manual annotations by experts. In this work, we introduce a BERT-based approach to medical image report labeling that exploits both the scale of available r… ▽ More

    Submitted 18 October, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

    Comments: Accepted to EMNLP 2020