Skip to main content

Showing 1–30 of 30 results for author: McMillan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.10823  [pdf, ps, other

    cs.CV eess.IV

    From Embeddings to Accuracy: Comparing Foundation Models for Radiographic Classification

    Authors: Xue Li, Jameson Merkow, Noel C. F. Codella, Alberto Santamaria-Pang, Naiteek Sangani, Alexander Ersoy, Christopher Burt, John W. Garrett, Richard J. Bruce, Joshua D. Warner, Tyler Bradshaw, Ivan Tarapov, Matthew P. Lungren, Alan B. McMillan

    Abstract: Foundation models, pretrained on extensive datasets, have significantly advanced machine learning by providing robust and transferable embeddings applicable to various domains, including medical imaging diagnostics. This study evaluates the utility of embeddings derived from both general-purpose and medical domain-specific foundation models for training lightweight adapter models in multi-class ra… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 11 pages, 5 figures, 4 tables

  2. arXiv:2504.12249  [pdf

    eess.IV cs.CV cs.LG

    Comparative Evaluation of Radiomics and Deep Learning Models for Disease Detection in Chest Radiography

    Authors: Zhijin He, Alan B. McMillan

    Abstract: The application of artificial intelligence (AI) in medical imaging has revolutionized diagnostic practices, enabling advanced analysis and interpretation of radiological data. This study presents a comprehensive evaluation of radiomics-based and deep learning-based approaches for disease detection in chest radiography, focusing on COVID-19, lung opacity, and viral pneumonia. While deep learning mo… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  3. arXiv:2504.07450  [pdf

    eess.IV cs.AI cs.CV

    Synthetic CT Generation from Time-of-Flight Non-Attenutaion-Corrected PET for Whole-Body PET Attenuation Correction

    Authors: Weijie Chen, James Wang, Alan McMillan

    Abstract: Positron Emission Tomography (PET) imaging requires accurate attenuation correction (AC) to account for photon loss due to tissue density variations. In PET/MR systems, computed tomography (CT), which offers a straightforward estimation of AC is not available. This study presents a deep learning approach to generate synthetic CT (sCT) images directly from Time-of-Flight (TOF) non-attenuation corre… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: 4 pages, 2 figures, ISBI 2025

    MSC Class: 68T05; 92C55 ACM Class: I.2.6; I.2.10

  4. arXiv:2503.11850  [pdf, ps, other

    cs.CR cs.DS cs.LG

    Local Pan-Privacy for Federated Analytics

    Authors: Vitaly Feldman, Audra McMillan, Guy N. Rothblum, Kunal Talwar

    Abstract: Pan-privacy was proposed by Dwork et al. as an approach to designing a private analytics system that retains its privacy properties in the face of intrusions that expose the system's internal state. Motivated by federated telemetry applications, we study local pan-privacy, where privacy should be retained under repeated unannounced intrusions on the local state. We consider the problem of monitori… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  5. arXiv:2502.00528  [pdf, other

    cs.CV cs.CL

    Vision-Language Modeling in PET/CT for Visual Grounding of Positive Findings

    Authors: Zachary Huemann, Samuel Church, Joshua D. Warner, Daniel Tran, Xin Tie, Alan B McMillan, Junjie Hu, Steve Y. Cho, Meghan Lubner, Tyler J. Bradshaw

    Abstract: Vision-language models can connect the text description of an object to its specific location in an image through visual grounding. This has potential applications in enhanced radiology reporting. However, these models require large annotated image-text datasets, which are lacking for PET/CT. We developed an automated pipeline to generate weak labels linking PET/CT report descriptions to their ima… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

  6. arXiv:2501.05309  [pdf, other

    cs.CR cs.DS cs.LG

    Private Selection with Heterogeneous Sensitivities

    Authors: Daniela Antonova, Allegra Laro, Audra McMillan, Lorenz Wolf

    Abstract: Differentially private (DP) selection involves choosing a high-scoring candidate from a finite candidate pool, where each score depends on a sensitive dataset. This problem arises naturally in a variety of contexts including model selection, hypothesis testing, and within many DP algorithms. Classical methods, such as Report Noisy Max (RNM), assume all candidates' scores are equally sensitive to c… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: 21 pages, 18 figures

  7. arXiv:2412.09445  [pdf

    eess.IV cs.CV

    Embeddings are all you need! Achieving High Performance Medical Image Classification through Training-Free Embedding Analysis

    Authors: Raj Hansini Khoiwal, Alan B. McMillan

    Abstract: Developing artificial intelligence (AI) and machine learning (ML) models for medical imaging typically involves extensive training and testing on large datasets, consuming significant computational time, energy, and resources. There is a need for more efficient methods that can achieve comparable or superior diagnostic performance without the associated resource burden. We investigated the feasibi… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

    Comments: 15 pages, 7 figures, 3 tables

  8. arXiv:2412.04142  [pdf, other

    physics.flu-dyn cs.AI

    Methodology for Online Estimation of Rheological Parameters in Polymer Melts Using Deep Learning and Microfluidics

    Authors: Juan Sandubete-López, José L. Risco-Martín, Alexander H. McMillan, Eva Besada-Portas

    Abstract: Microfluidic devices are increasingly used in biological and chemical experiments due to their cost-effectiveness for rheological estimation in fluids. However, these devices often face challenges in terms of accuracy, size, and cost. This study presents a methodology, integrating deep learning, modeling and simulation to enhance the design of microfluidic systems, used to develop an innovative ap… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: 12 pages, 6 figures, Winter Simulation Conference 2024

  9. arXiv:2411.05324  [pdf, other

    cs.LG cs.CV stat.ME

    SASWISE-UE: Segmentation and Synthesis with Interpretable Scalable Ensembles for Uncertainty Estimation

    Authors: Weijie Chen, Alan McMillan

    Abstract: This paper introduces an efficient sub-model ensemble framework aimed at enhancing the interpretability of medical deep learning models, thus increasing their clinical applicability. By generating uncertainty maps, this framework enables end-users to evaluate the reliability of model outputs. We developed a strategy to develop diverse models from a single well-trained checkpoint, facilitating the… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: 16 pages, 12 figures, 5 tables

    MSC Class: 62P10; 68T01; 68T37; 92C50

  10. arXiv:2410.23437  [pdf, other

    cs.LG cs.CL cs.IR

    Mind the Gap: A Generalized Approach for Cross-Modal Embedding Alignment

    Authors: Arihan Yadav, Alan McMillan

    Abstract: Retrieval-Augmented Generation (RAG) systems enhance text generation by incorporating external knowledge but often struggle when retrieving context across different text modalities due to semantic gaps. We introduce a generalized projection-based method, inspired by adapter modules in transfer learning, that efficiently bridges these gaps between various text types, such as programming code and ps… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: 18 pages, 3 figures

    ACM Class: H.3.3; I.2.7; I.2.6

  11. arXiv:2410.06542  [pdf, other

    eess.IV cs.CV

    MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging

    Authors: Noel C. F. Codella, Ying Jin, Shrey Jain, Yu Gu, Ho Hin Lee, Asma Ben Abacha, Alberto Santamaria-Pang, Will Guyman, Naiteek Sangani, Sheng Zhang, Hoifung Poon, Stephanie Hyland, Shruthi Bannur, Javier Alvarez-Valle, Xue Li, John Garrett, Alan McMillan, Gaurav Rajguru, Madhu Maddi, Nilesh Vijayrania, Rehaan Bhimai, Nick Mecklenburg, Rupal Jain, Daniel Holstein, Naveen Gaur , et al. (6 additional authors not shown)

    Abstract: In this work, we present MedImageInsight, an open-source medical imaging embedding model. MedImageInsight is trained on medical images with associated text and labels across a diverse collection of domains, including X-Ray, CT, MRI, dermoscopy, OCT, fundus photography, ultrasound, histopathology, and mammography. Rigorous evaluations demonstrate MedImageInsight's ability to achieve state-of-the-ar… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  12. arXiv:2406.19566  [pdf, other

    cs.LG cs.CR cs.DS math.ST stat.ML

    Instance-Optimal Private Density Estimation in the Wasserstein Distance

    Authors: Vitaly Feldman, Audra McMillan, Satchit Sivakumar, Kunal Talwar

    Abstract: Estimating the density of a distribution from samples is a fundamental problem in statistics. In many practical settings, the Wasserstein distance is an appropriate error metric for density estimation. For example, when estimating population densities in a geographic region, a small Wasserstein distance means that the estimate is able to capture roughly where the population mass is. In this work w… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  13. Anatomy and Physiology of Artificial Intelligence in PET Imaging

    Authors: Tyler J. Bradshaw, Alan B. McMillan

    Abstract: The influence of artificial intelligence (AI) within the field of nuclear medicine has been rapidly growing. Many researchers and clinicians are seeking to apply AI within PET, and clinicians will soon find themselves engaging with AI-based applications all along the chain of molecular imaging, from image reconstruction to enhanced reporting. This expanding presence of AI in PET imaging will resul… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Journal ref: PET Clin; 16(4):471-482 (2021)

  14. arXiv:2307.15835  [pdf, ps, other

    cs.CR cs.DS cs.LG stat.ML

    Mean Estimation with User-level Privacy under Data Heterogeneity

    Authors: Rachel Cummings, Vitaly Feldman, Audra McMillan, Kunal Talwar

    Abstract: A key challenge in many modern data analysis tasks is that user data are heterogeneous. Different users may possess vastly different numbers of data points. More importantly, it cannot be assumed that all users sample from the same underlying distribution. This is true, for example in language data, where different speech styles result in data heterogeneity. In this work we propose a simple model… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Conference version published at NeurIPS 2022

  15. arXiv:2307.15017  [pdf, other

    cs.CR cs.LG

    Samplable Anonymous Aggregation for Private Federated Data Analysis

    Authors: Kunal Talwar, Shan Wang, Audra McMillan, Vojta Jina, Vitaly Feldman, Pansy Bansal, Bailey Basile, Aine Cahill, Yi Sheng Chan, Mike Chatzidakis, Junye Chen, Oliver Chick, Mona Chitnis, Suman Ganta, Yusuf Goren, Filip Granqvist, Kristine Guo, Frederic Jacobs, Omid Javidbakht, Albert Liu, Richard Low, Dan Mascenik, Steve Myers, David Park, Wonhee Park , et al. (12 additional authors not shown)

    Abstract: We revisit the problem of designing scalable protocols for private statistics and private federated learning when each device holds its private data. Locally differentially private algorithms require little trust but are (provably) limited in their utility. Centrally differentially private algorithms can allow significantly better utility but require a trusted curator. This gap has led to signific… ▽ More

    Submitted 18 July, 2024; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: 34 pages

  16. arXiv:2307.11749  [pdf, other

    cs.LG cs.CR

    Differentially Private Heavy Hitter Detection using Federated Analytics

    Authors: Karan Chadha, Junye Chen, John Duchi, Vitaly Feldman, Hanieh Hashemi, Omid Javidbakht, Audra McMillan, Kunal Talwar

    Abstract: In this work, we study practical heuristics to improve the performance of prefix-tree based algorithms for differentially private heavy hitter detection. Our model assumes each user has multiple data points and the goal is to learn as many of the most frequent data points as possible across all users' data with aggregate and local differential privacy. We propose an adaptive hyperparameter tuning… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  17. arXiv:2211.10082  [pdf, other

    cs.CR

    Private Federated Statistics in an Interactive Setting

    Authors: Audra McMillan, Omid Javidbakht, Kunal Talwar, Elliot Briggs, Mike Chatzidakis, Junye Chen, John Duchi, Vitaly Feldman, Yusuf Goren, Michael Hesse, Vojta Jina, Anil Katti, Albert Liu, Cheney Lyford, Joey Meyer, Alex Palmer, David Park, Wonhee Park, Gianni Parsa, Paul Pelzl, Rehan Rishi, Congzheng Song, Shan Wang, Shundong Zhou

    Abstract: Privately learning statistics of events on devices can enable improved user experience. Differentially private algorithms for such problems can benefit significantly from interactivity. We argue that an aggregation protocol can enable an interactive private federated statistics system where user's devices maintain control of the privacy assurance. We describe the architecture of such a system, and… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

  18. arXiv:2210.15819  [pdf, other

    math.ST cs.CR cs.LG

    Instance-Optimal Differentially Private Estimation

    Authors: Audra McMillan, Adam Smith, Jon Ullman

    Abstract: In this work, we study local minimax convergence estimation rates subject to $ε$-differential privacy. Unlike worst-case rates, which may be conservative, algorithms that are locally minimax optimal must adapt to easy instances of the problem. We construct locally minimax differentially private estimators for one-parameter exponential families and estimating the tail rate of a distribution. In the… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

  19. arXiv:2208.04591  [pdf, other

    cs.CR cs.DS cs.LG stat.ML

    Stronger Privacy Amplification by Shuffling for Rényi and Approximate Differential Privacy

    Authors: Vitaly Feldman, Audra McMillan, Kunal Talwar

    Abstract: The shuffle model of differential privacy has gained significant interest as an intermediate trust model between the standard local and central models [EFMRTT19; CSUZZ19]. A key result in this model is that randomly shuffling locally randomized data amplifies differential privacy guarantees. Such amplification implies substantially stronger privacy guarantees for systems in which data is contribut… ▽ More

    Submitted 30 October, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

    Comments: Errata added. 14 pages, 4 figures

  20. arXiv:2106.10333  [pdf, other

    cs.CR cs.LG stat.ME stat.ML

    Non-parametric Differentially Private Confidence Intervals for the Median

    Authors: Joerg Drechsler, Ira Globus-Harris, Audra McMillan, Jayshree Sarathy, Adam Smith

    Abstract: Differential privacy is a restriction on data processing algorithms that provides strong confidentiality guarantees for individual records in the data. However, research on proper statistical inference, that is, research on properly quantifying the uncertainty of the (noisy) sample estimate regarding the true value in the population, is currently still limited. This paper proposes and evaluates se… ▽ More

    Submitted 3 July, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: 44 pages, 15 figures

  21. arXiv:2012.12803  [pdf, other

    cs.LG cs.CR cs.DS stat.ML

    Hiding Among the Clones: A Simple and Nearly Optimal Analysis of Privacy Amplification by Shuffling

    Authors: Vitaly Feldman, Audra McMillan, Kunal Talwar

    Abstract: Recent work of Erlingsson, Feldman, Mironov, Raghunathan, Talwar, and Thakurta [EFMRTT19] demonstrates that random shuffling amplifies differential privacy guarantees of locally randomized data. Such amplification implies substantially stronger privacy guarantees for systems in which data is contributed anonymously [BEMMRLRKTS17] and has lead to significant interest in the shuffle model of privacy… ▽ More

    Submitted 7 September, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

    Comments: Updated to include numerical experiments for Renyi differential privacy

  22. arXiv:2007.12674  [pdf, other

    stat.ME cs.CR cs.LG

    Controlling Privacy Loss in Sampling Schemes: an Analysis of Stratified and Cluster Sampling

    Authors: Mark Bun, Jörg Drechsler, Marco Gaboardi, Audra McMillan, Jayshree Sarathy

    Abstract: Sampling schemes are fundamental tools in statistics, survey design, and algorithm design. A fundamental result in differential privacy is that a differentially private mechanism run on a simple random sample of a population provides stronger privacy guarantees than the same algorithm run on the entire population. However, in practice, sampling designs are often more complex than the simple, data-… ▽ More

    Submitted 21 June, 2023; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: Appeared at FORC 2022

  23. arXiv:2007.05157  [pdf, other

    cs.LG cs.CR stat.ME stat.ML

    Differentially Private Simple Linear Regression

    Authors: Daniel Alabi, Audra McMillan, Jayshree Sarathy, Adam Smith, Salil Vadhan

    Abstract: Economics and social science research often require analyzing datasets of sensitive personal information at fine granularity, with models fit to small subsets of the data. Unfortunately, such fine-grained analysis can easily reveal sensitive individual information. We study algorithms for simple linear regression that satisfy differential privacy, a constraint which guarantees that an algorithm's… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Comments: 20 pages, 18 figures

  24. arXiv:1908.00656  [pdf

    eess.IV cs.CV

    Robustifying deep networks for image segmentation

    Authors: Zheng Liu, Jinnian Zhang, Varun Jog, Po-Ling Loh, Alan B McMillan

    Abstract: Purpose: The purpose of this study is to investigate the robustness of a commonly-used convolutional neural network for image segmentation with respect to visually-subtle adversarial perturbations, and suggest new methods to make these networks more robust to such perturbations. Materials and Methods: In this retrospective study, the accuracy of brain tumor segmentation was studied in subjects wit… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.

  25. arXiv:1905.11947  [pdf, ps, other

    cs.DS cs.CR cs.IT cs.LG stat.ML

    Private Identity Testing for High-Dimensional Distributions

    Authors: Clément L. Canonne, Gautam Kamath, Audra McMillan, Jonathan Ullman, Lydia Zakynthinou

    Abstract: In this work we present novel differentially private identity (goodness-of-fit) testers for natural and widely studied classes of multivariate product distributions: Gaussians in $\mathbb{R}^d$ with known covariance and product distributions over $\{\pm 1\}^{d}$. Our testers have improved sample complexity compared to those derived from previous techniques, and are the first testers whose sample c… ▽ More

    Submitted 3 March, 2022; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Discussing a mistake in the proof of one of the algorithms (Theorem 1.2, computationally inefficient tester), and pointing to follow-up work by Narayanan (2022) who improves upon our results and fixes this mistake

  26. arXiv:1811.11148  [pdf, ps, other

    cs.DS cs.CR cs.IT cs.LG stat.ML

    The Structure of Optimal Private Tests for Simple Hypotheses

    Authors: Clément L. Canonne, Gautam Kamath, Audra McMillan, Adam Smith, Jonathan Ullman

    Abstract: Hypothesis testing plays a central role in statistical inference, and is used in many settings where privacy concerns are paramount. This work answers a basic question about privately testing simple hypotheses: given two distributions $P$ and $Q$, and a privacy level $\varepsilon$, how many i.i.d. samples are needed to distinguish $P$ from $Q$ subject to $\varepsilon$-differential privacy, and wha… ▽ More

    Submitted 2 April, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

    Comments: To appear in STOC 2019

  27. arXiv:1806.06427  [pdf, ps, other

    cs.CR cs.DS

    Property Testing for Differential Privacy

    Authors: Anna Gilbert, Audra McMillan

    Abstract: We consider the problem of property testing for differential privacy: with black-box access to a purportedly private algorithm, can we verify its privacy guarantees? In particular, we show that any privacy guarantee that can be efficiently verified is also efficiently breakable in the sense that there exist two databases between which we can efficiently distinguish. We give lower bounds on the que… ▽ More

    Submitted 13 February, 2019; v1 submitted 17 June, 2018; originally announced June 2018.

    Comments: Allerton, 2018

  28. arXiv:1711.10019  [pdf, ps, other

    cs.LG

    Online Learning via the Differential Privacy Lens

    Authors: Jacob Abernethy, Young Hun Jung, Chansoo Lee, Audra McMillan, Ambuj Tewari

    Abstract: In this paper, we use differential privacy as a lens to examine online learning in both full and partial information settings. The differential privacy framework is, at heart, less about privacy and more about algorithmic stability, and thus has found application in domains well beyond those where information security is central. Here we develop an algorithmic property called one-step differential… ▽ More

    Submitted 28 October, 2019; v1 submitted 27 November, 2017; originally announced November 2017.

  29. arXiv:1706.05916  [pdf, other

    cs.CR cs.DB

    Local Differential Privacy for Physical Sensor Data and Sparse Recovery

    Authors: Anna C. Gilbert, Audra McMillan

    Abstract: In this work we explore the utility of locally differentially private thermal sensor data. We design a locally differentially private recovery algorithm for the 1-dimensional, discrete heat source location problem and analyse its performance in terms of the Earth Mover Distance error. Our work indicates that it is possible to produce locally private sensor measurements that both keep the exact loc… ▽ More

    Submitted 23 March, 2018; v1 submitted 30 May, 2017; originally announced June 2017.

    Comments: appeared at CISS 2018

  30. arXiv:1604.01871  [pdf, ps, other

    math.ST cs.LG

    When is Nontrivial Estimation Possible for Graphons and Stochastic Block Models?

    Authors: Audra McMillan, Adam Smith

    Abstract: Block graphons (also called stochastic block models) are an important and widely-studied class of models for random networks. We provide a lower bound on the accuracy of estimators for block graphons with a large number of blocks. We show that, given only the number $k$ of blocks and an upper bound $ρ$ on the values (connection probabilities) of the graphon, every estimator incurs error at least o… ▽ More

    Submitted 7 April, 2016; originally announced April 2016.

    Comments: 11 pages