Skip to main content

Showing 1–15 of 15 results for author: Amar, J

.
  1. arXiv:2506.10910  [pdf, ps, other

    cs.CL

    Magistral

    Authors: Mistral-AI, :, Abhinav Rastogi, Albert Q. Jiang, Andy Lo, Gabrielle Berrada, Guillaume Lample, Jason Rute, Joep Barmentlo, Karmesh Yadav, Kartik Khandelwal, Khyathi Raghavi Chandu, Léonard Blier, Lucile Saulnier, Matthieu Dinot, Maxime Darrin, Neha Gupta, Roman Soletskyi, Sagar Vaze, Teven Le Scao, Yihan Wang, Adam Yang, Alexander H. Liu, Alexandre Sablayrolles, Amélie Héliou , et al. (76 additional authors not shown)

    Abstract: We introduce Magistral, Mistral's first reasoning model and our own scalable reinforcement learning (RL) pipeline. Instead of relying on existing implementations and RL traces distilled from prior models, we follow a ground up approach, relying solely on our own models and infrastructure. Notably, we demonstrate a stack that enabled us to explore the limits of pure RL training of LLMs, present a s… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  2. arXiv:2506.04032  [pdf, ps, other

    cs.CL

    AI Agents for Conversational Patient Triage: Preliminary Simulation-Based Evaluation with Real-World EHR Data

    Authors: Sina Rashidian, Nan Li, Jonathan Amar, Jong Ha Lee, Sam Pugh, Eric Yang, Geoff Masterson, Myoung Cha, Yugang Jia, Akhil Vaid

    Abstract: Background: We present a Patient Simulator that leverages real world patient encounters which cover a broad range of conditions and symptoms to provide synthetic test subjects for development and testing of healthcare agentic models. The simulator provides a realistic approach to patient presentation and multi-turn conversation with a symptom-checking agent. Objectives: (1) To construct and instan… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  3. arXiv:2502.13135  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions

    Authors: Taedong Yun, Eric Yang, Mustafa Safdari, Jong Ha Lee, Vaishnavi Vinod Kumar, S. Sara Mahdavi, Jonathan Amar, Derek Peyton, Reut Aharony, Andreas Michaelides, Logan Schneider, Isaac Galatzer-Levy, Yugang Jia, John Canny, Arthur Gretton, Maja Matarić

    Abstract: We present an end-to-end framework for generating synthetic users for evaluating interactive agents designed to encourage positive behavior changes, such as in health and lifestyle coaching. The synthetic users are grounded in health and lifestyle conditions, specifically sleep and diabetes management in this study, to ensure realistic interactions with the health coaching agent. Synthetic users a… ▽ More

    Submitted 4 June, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Comments: Accepted to the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)

    ACM Class: I.2.7

  4. arXiv:2410.14041  [pdf, other

    cs.LG cs.CL

    From Barriers to Tactics: A Behavioral Science-Informed Agentic Workflow for Personalized Nutrition Coaching

    Authors: Eric Yang, Tomas Garcia, Hannah Williams, Bhawesh Kumar, Martin Ramé, Eileen Rivera, Yiran Ma, Jonathan Amar, Caricia Catalani, Yugang Jia

    Abstract: Effective management of cardiometabolic conditions requires sustained positive nutrition habits, often hindered by complex and individualized barriers. Direct human management is simply not scalable, while previous attempts aimed at automating nutrition coaching lack the personalization needed to address these diverse challenges. This paper introduces a novel LLM-powered agentic workflow designed… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 22 pages

  5. arXiv:2407.18044  [pdf, other

    cs.LG

    The Geometry of Queries: Query-Based Innovations in Retrieval-Augmented Generation

    Authors: Eric Yang, Jonathan Amar, Jong Ha Lee, Bhawesh Kumar, Yugang Jia

    Abstract: Digital health chatbots powered by Large Language Models (LLMs) have the potential to significantly improve personal health management for chronic conditions by providing accessible and on-demand health coaching and question-answering. However, these chatbots risk providing unverified and inaccurate information because LLMs generate responses based on patterns learned from diverse internet data. R… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: 22 pages

  6. arXiv:2405.06093  [pdf, other

    cs.LG cs.CL

    Selective Fine-tuning on LLM-labeled Data May Reduce Reliance on Human Annotation: A Case Study Using Schedule-of-Event Table Detection

    Authors: Bhawesh Kumar, Jonathan Amar, Eric Yang, Nan Li, Yugang Jia

    Abstract: Large Language Models (LLMs) have demonstrated their efficacy across a broad spectrum of tasks in healthcare applications. However, often LLMs need to be fine-tuned on task-specific expert annotated data to achieve optimal performance, which can be expensive and time consuming. In this study, we fine-tune PaLM-2 with parameter efficient fine-tuning (PEFT) using noisy labels obtained from gemini-pr… ▽ More

    Submitted 5 August, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: 23 pages. Published in MLHC 2024

  7. arXiv:2308.09726  [pdf, other

    cs.LG cs.AI cs.CY cs.MA

    Equitable Restless Multi-Armed Bandits: A General Framework Inspired By Digital Health

    Authors: Jackson A. Killian, Manish Jain, Yugang Jia, Jonathan Amar, Erich Huang, Milind Tambe

    Abstract: Restless multi-armed bandits (RMABs) are a popular framework for algorithmic decision making in sequential settings with limited resources. RMABs are increasingly being used for sensitive decisions such as in public health, treatment scheduling, anti-poaching, and -- the motivation for this work -- digital health. For such high stakes settings, decisions must both improve outcomes and prevent disp… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 16 pages, 8 figures, 2 tables

  8. arXiv:1810.10661  [pdf, other

    cs.GT math.OC

    The Second-Price Knapsack Problem: Near-Optimal Real Time Bidding in Internet Advertisement

    Authors: Jonathan Amar, Nicholas Renegar

    Abstract: In many online advertisement (ad) exchanges, ad slots are each sold via a separate second-price auction. This paper considers the bidder's problem of maximizing the value of ads they purchase in these auctions, subject to budget constraints. This 'second-price knapsack' problem presents challenges when devising a bidding strategy because of the uncertain resource consumption: bidders win if they b… ▽ More

    Submitted 12 March, 2020; v1 submitted 24 October, 2018; originally announced October 2018.

  9. arXiv:1711.06696  [pdf, ps, other

    cond-mat.stat-mech

    Extinction and Survival in Two-Species Annihilation

    Authors: J. G. Amar, E. Ben-Naim, S. M. Davis, P. L. Krapivsky

    Abstract: We study diffusion-controlled two-species annihilation with a finite number of particles. In this stochastic process, particles move diffusively, and when two particles of opposite type come into contact, the two annihilate. We focus on the behavior in three spatial dimensions and for initial conditions where particles are confined to a compact domain. Generally, one species outnumbers the other,… ▽ More

    Submitted 17 November, 2017; originally announced November 2017.

    Comments: 8 pages, 7 figures

    Journal ref: Phys. Rev. E 97, 022112 (2018)

  10. arXiv:1607.01516  [pdf, other

    math.ST q-bio.QM

    A new gene co-expression network analysis based on Core Structure Detection (CSD)

    Authors: A-C Brunet, J-M Azais, J-M Loubes, J Amar, R Burcelin

    Abstract: We propose a novel method to cluster gene networks. Based on a dissimilarity built using correlation structures, we consider networks that connect all the genes based on the strength of their dissimilarity. The large number of genes require the use of the threshold to find sparse structures in the graph. in this work, using the notion of graph coreness, we identify clusters of genes which are cent… ▽ More

    Submitted 6 July, 2016; originally announced July 2016.

  11. arXiv:1103.2553  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.soft

    Effects of cluster diffusion on the island density and size distribution in submonolayer island growth

    Authors: Yevgen A. Kryukov, Jacques G. Amar

    Abstract: The effects of cluster diffusion on the submonolayer island density and island-size distribution are studied for the case of irreversible growth of compact islands on a 2D substrate. In our model, we assume instantaneous coalescence of circular islands, while the cluster mobility is assumed to exhibit power-law decay as a function of island-size with exponent mu. Results are presented for mu = 1/2… ▽ More

    Submitted 13 March, 2011; originally announced March 2011.

    Comments: 12 pages, submitted to Physical Review E

  12. Effects of shadowing in oblique-incidence metal(100) epitaxial growth

    Authors: Yunsic Shim, Jacques G. Amar

    Abstract: The effects of shadowing in oblique incidence metal (100) epitaxial growth are studied using a simplified model. We find that many of the features observed in Cu(100) growth, including the existence of a transition from anisotropic mounds to ripples perpendicular to the beam, can be explained purely by geometrical effects. We also show that the formation of (111) facets is crucial to the develop… ▽ More

    Submitted 9 August, 2006; originally announced August 2006.

    Comments: 4 1/4 pages, 4 figures, submitted to Physical Review Letters

  13. arXiv:cond-mat/0406540  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.stat-mech

    Synchronous relaxation algorithm for parallel kinetic Monte Carlo

    Authors: Yunsic Shim, Jacques G. Amar

    Abstract: We investigate the applicability of the synchronous relaxation (SR) algorithm to parallel kinetic Monte Carlo simulations of simple models of thin-film growth. A variety of techniques for optimizing the parallel efficiency are also presented. We find that the parallel efficiency is determined by three main factors $-$ the calculation overhead due to relaxation iterations to correct boundary even… ▽ More

    Submitted 22 June, 2004; originally announced June 2004.

    Comments: 13 pages, 15 figures

  14. arXiv:cond-mat/0406379  [pdf, ps, other

    cond-mat.mtrl-sci

    Synchronous sublattice algorithm for parallel kinetic Monte Carlo

    Authors: Yunsic Shim, Jacques G. Amar

    Abstract: The standard kinetic Monte Carlo algorithm is an extremely efficient method to carry out serial simulations of dynamical processes such as thin-film growth. However, in some cases it is necessary to study systems over extended time and length scales, and therefore a parallel algorithm is desired. Here we describe an efficient, semi-rigorous synchronous sublattice algorithm for parallel kinetic M… ▽ More

    Submitted 24 June, 2004; v1 submitted 16 June, 2004; originally announced June 2004.

    Comments: 13 pages, 14 figures, Fig. 1 replaced with clearer version, corrected references and equation citations, author emails added

  15. Asymptotic Capture-Number and Island-Size Distributions for One-Dimensional Irreversible Submonolayer Growth

    Authors: J. G. Amar, M. N. Popescu

    Abstract: Using a set of evolution equations [J.G. Amar {\it et al}, Phys. Rev. Lett. {\bf 86}, 3092 (2001)] for the average gap-size between islands, we calculate analytically the asymptotic scaled capture-number distribution (CND) for one-dimensional irreversible submonolayer growth of point islands. The predicted asymptotic CND is in reasonably good agreement with kinetic Monte-Carlo (KMC) results and… ▽ More

    Submitted 11 July, 2003; originally announced July 2003.

    Comments: 4 pages, 1 figure, submitted to Phys. Rev. B