-
DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes
Authors:
Junlin Guo,
James R. Zimmer-Dauphinee,
Jordan M. Nieusma,
Siqi Lu,
Quan Liu,
Ruining Deng,
Can Cui,
Jialin Yue,
Yizhe Lin,
Tianyuan Yao,
Juming Xiong,
Junchao Zhu,
Chongyu Qu,
Yuechen Yang,
Mitchell Wilkes,
Xiao Wang,
Parker VanValkenburgh,
Steven A. Wernke,
Yuankai Huo
Abstract:
By mapping sites at large scales using remotely sensed data, archaeologists can generate unique insights into long-term demographic trends, inter-regional social networks, and past adaptations to climate change. Remote sensing surveys complement field-based approaches, and their reach can be especially great when combined with deep learning and computer vision techniques. However, conventional sup…
▽ More
By mapping sites at large scales using remotely sensed data, archaeologists can generate unique insights into long-term demographic trends, inter-regional social networks, and past adaptations to climate change. Remote sensing surveys complement field-based approaches, and their reach can be especially great when combined with deep learning and computer vision techniques. However, conventional supervised deep learning methods face challenges in annotating fine-grained archaeological features at scale. While recent vision foundation models have shown remarkable success in learning large-scale remote sensing data with minimal annotations, most off-the-shelf solutions are designed for RGB images rather than multi-spectral satellite imagery, such as the 8-band data used in our study. In this paper, we introduce DeepAndes, a transformer-based vision foundation model trained on three million multi-spectral satellite images, specifically tailored for Andean archaeology. DeepAndes incorporates a customized DINOv2 self-supervised learning algorithm optimized for 8-band multi-spectral imagery, marking the first foundation model designed explicitly for the Andes region. We evaluate its image understanding performance through imbalanced image classification, image instance retrieval, and pixel-level semantic segmentation tasks. Our experiments show that DeepAndes achieves superior F1 scores, mean average precision, and Dice scores in few-shot learning scenarios, significantly outperforming models trained from scratch or pre-trained on smaller datasets. This underscores the effectiveness of large-scale self-supervised pre-training in archaeological remote sensing. Codes will be available on https://github.com/geopacha/DeepAndes.
△ Less
Submitted 28 April, 2025;
originally announced April 2025.
-
How Good Are We? Evaluating Cell AI Foundation Models in Kidney Pathology with Human-in-the-Loop Enrichment
Authors:
Junlin Guo,
Siqi Lu,
Can Cui,
Ruining Deng,
Tianyuan Yao,
Zhewen Tao,
Yizhe Lin,
Marilyn Lionts,
Quan Liu,
Juming Xiong,
Yu Wang,
Shilin Zhao,
Catie Chang,
Mitchell Wilkes,
Mengmeng Yin,
Haichun Yang,
Yuankai Huo
Abstract:
Training AI foundation models has emerged as a promising large-scale learning approach for addressing real-world healthcare challenges, including digital pathology. While many of these models have been developed for tasks like disease diagnosis and tissue quantification using extensive and diverse training datasets, their readiness for deployment on some arguably simplest tasks, such as nuclei seg…
▽ More
Training AI foundation models has emerged as a promising large-scale learning approach for addressing real-world healthcare challenges, including digital pathology. While many of these models have been developed for tasks like disease diagnosis and tissue quantification using extensive and diverse training datasets, their readiness for deployment on some arguably simplest tasks, such as nuclei segmentation within a single organ (e.g., the kidney), remains uncertain. This paper seeks to answer this key question, "How good are we?", by thoroughly evaluating the performance of recent cell foundation models on a curated multi-center, multi-disease, and multi-species external testing dataset. Additionally, we tackle a more challenging question, "How can we improve?", by developing and assessing human-in-the-loop data enrichment strategies aimed at enhancing model performance while minimizing the reliance on pixel-level human annotation. To address the first question, we curated a multicenter, multidisease, and multispecies dataset consisting of 2,542 kidney whole slide images (WSIs). Three state-of-the-art (SOTA) cell foundation models-Cellpose, StarDist, and CellViT-were selected for evaluation. To tackle the second question, we explored data enrichment algorithms by distilling predictions from the different foundation models with a human-in-the-loop framework, aiming to further enhance foundation model performance with minimal human efforts. Our experimental results showed that all three foundation models improved over their baselines with model fine-tuning with enriched data. Interestingly, the baseline model with the highest F1 score does not yield the best segmentation outcomes after fine-tuning. This study establishes a benchmark for the development and deployment of cell vision foundation models tailored for real-world data applications.
△ Less
Submitted 31 October, 2024;
originally announced November 2024.
-
Assessment of Cell Nuclei AI Foundation Models in Kidney Pathology
Authors:
Junlin Guo,
Siqi Lu,
Can Cui,
Ruining Deng,
Tianyuan Yao,
Zhewen Tao,
Yizhe Lin,
Marilyn Lionts,
Quan Liu,
Juming Xiong,
Yu Wang,
Shilin Zhao,
Catie Chang,
Mitchell Wilkes,
Mengmeng Yin,
Haichun Yang,
Yuankai Huo
Abstract:
Cell nuclei instance segmentation is a crucial task in digital kidney pathology. Traditional automatic segmentation methods often lack generalizability when applied to unseen datasets. Recently, the success of foundation models (FMs) has provided a more generalizable solution, potentially enabling the segmentation of any cell type. In this study, we perform a large-scale evaluation of three widely…
▽ More
Cell nuclei instance segmentation is a crucial task in digital kidney pathology. Traditional automatic segmentation methods often lack generalizability when applied to unseen datasets. Recently, the success of foundation models (FMs) has provided a more generalizable solution, potentially enabling the segmentation of any cell type. In this study, we perform a large-scale evaluation of three widely used state-of-the-art (SOTA) cell nuclei foundation models (Cellpose, StarDist, and CellViT). Specifically, we created a highly diverse evaluation dataset consisting of 2,542 kidney whole slide images (WSIs) collected from both human and rodent sources, encompassing various tissue types, sizes, and staining methods. To our knowledge, this is the largest-scale evaluation of its kind to date. Our quantitative analysis of the prediction distribution reveals a persistent performance gap in kidney pathology. Among the evaluated models, CellViT demonstrated superior performance in segmenting nuclei in kidney pathology. However, none of the foundation models are perfect; a performance gap remains in general nuclei segmentation for kidney pathology.
△ Less
Submitted 6 February, 2025; v1 submitted 9 August, 2024;
originally announced August 2024.
-
Semi-Supervised Contrastive Learning for Remote Sensing: Identifying Ancient Urbanization in the South Central Andes
Authors:
Jiachen Xu,
Junlin Guo,
James Zimmer-Dauphinee,
Quan Liu,
Yuxuan Shi,
Zuhayr Asad,
D. Mitchell Wilkes,
Parker VanValkenburgh,
Steven A. Wernke,
Yuankai Huo
Abstract:
Archaeology has long faced fundamental issues of sampling and scalar representation. Traditionally, the local-to-regional-scale views of settlement patterns are produced through systematic pedestrian surveys. Recently, systematic manual survey of satellite and aerial imagery has enabled continuous distributional views of archaeological phenomena at interregional scales. However, such 'brute force'…
▽ More
Archaeology has long faced fundamental issues of sampling and scalar representation. Traditionally, the local-to-regional-scale views of settlement patterns are produced through systematic pedestrian surveys. Recently, systematic manual survey of satellite and aerial imagery has enabled continuous distributional views of archaeological phenomena at interregional scales. However, such 'brute force' manual imagery survey methods are both time- and labor-intensive, as well as prone to inter-observer differences in sensitivity and specificity. The development of self-supervised learning methods offers a scalable learning scheme for locating archaeological features using unlabeled satellite and historical aerial images. However, archaeological features are generally only visible in a very small proportion relative to the landscape, while the modern contrastive-supervised learning approach typically yields an inferior performance on highly imbalanced datasets. In this work, we propose a framework to address this long-tail problem. As opposed to the existing contrastive learning approaches that treat the labelled and unlabeled data separately, our proposed method reforms the learning paradigm under a semi-supervised setting in order to utilize the precious annotated data (<7% in our setting). Specifically, the highly unbalanced nature of the data is employed as the prior knowledge in order to form pseudo negative pairs by ranking the similarities between unannotated image patches and annotated anchor images. In this study, we used 95,358 unlabeled images and 5,830 labelled images in order to solve the issues associated with detecting ancient buildings from a long-tailed satellite image dataset. From the results, our semi-supervised contrastive learning model achieved a promising testing balanced accuracy of 79.0%, which is a 3.8% improvement as compared to other state-of-the-art approaches.
△ Less
Submitted 15 April, 2023; v1 submitted 13 December, 2021;
originally announced December 2021.
-
The Pauli and $\text{Lévy-Leblond}$ Equations, and the Spin Current Density
Authors:
James M. Wilkes
Abstract:
We review the literature on the Pauli equation and its current density, discussing the progression from the original phenomenological version of Pauli to its derivation by $\text{Lévy-Leblond}$ from a linearization of the $\text{Schrödinger}$ equation. It was established conclusively by $\text{Lévy-Leblond}$'s work that the spin of a spin-1/2 particle such as an electron is non-relativistic in nat…
▽ More
We review the literature on the Pauli equation and its current density, discussing the progression from the original phenomenological version of Pauli to its derivation by $\text{Lévy-Leblond}$ from a linearization of the $\text{Schrödinger}$ equation. It was established conclusively by $\text{Lévy-Leblond}$'s work that the spin of a spin-1/2 particle such as an electron is non-relativistic in nature, contrary to what was often stated following Dirac's derivation of a relativistic wave equation, and his subsequent demonstration that Pauli's spin interaction term appeared in the non-relativistic limit. In this limit, the Gordon decomposition of the associated probability current density was found to contain a spin-dependent term. Such a term does not follow, however, from the usual derivation of the current density from the Pauli equation, although various physically motivated but otherwise ad hoc explanations were put forward to account for it. We comment on the only exception to these of which we are aware implying the spin term in the current was in fact non-relativistic in nature. However, the earlier work of $\text{Lévy-Leblond}$ had already shown, with no additional assumptions, that this term was a prominent feature of the current density derived from his equation. Hence, just as with the spin itself, the spin current was non-relativistic, claims to the contrary notwithstanding. We present a somewhat simplified derivation of the $\text{Lévy-Leblond}$ equation and its current density, commenting on possibilities for experimental work that might indicate measurable consequences of the spin term in the current density.
△ Less
Submitted 14 October, 2019; v1 submitted 8 August, 2019;
originally announced August 2019.
-
Large-scale silicon quantum photonics implementing arbitrary two-qubit processing
Authors:
Xiaogang Qiang,
Xiaoqi Zhou,
Jianwei Wang,
Callum M. Wilkes,
Thomas Loke,
Sean O'Gara,
Laurent Kling,
Graham D. Marshall,
Raffaele Santagati,
Timothy C. Ralph,
Jingbo B. Wang,
Jeremy L. O'Brien,
Mark G. Thompson,
Jonathan C. F. Matthews
Abstract:
Integrated optics is an engineering solution proposed for exquisite control of photonic quantum information. Here we use silicon photonics and the linear combination of quantum operators scheme to realise a fully programmable two-qubit quantum processor. The device is fabricated with readily available CMOS based processing and comprises four nonlinear photon-sources, four filters, eighty-two beam…
▽ More
Integrated optics is an engineering solution proposed for exquisite control of photonic quantum information. Here we use silicon photonics and the linear combination of quantum operators scheme to realise a fully programmable two-qubit quantum processor. The device is fabricated with readily available CMOS based processing and comprises four nonlinear photon-sources, four filters, eighty-two beam splitters and fifty-eight individually addressable phase shifters. To demonstrate performance, we programmed the device to implement ninety-eight various two-qubit unitary operations (with average quantum process fidelity of 93.2$\pm$4.5%), a two-qubit quantum approximate optimization algorithm and efficient simulation of Szegedy directed quantum walks. This fosters further use of the linear combination architecture with silicon photonics for future photonic quantum processors.
△ Less
Submitted 25 September, 2018;
originally announced September 2018.
-
60dB high-extinction auto-configured Mach--Zehnder interferometer
Authors:
Callum M. Wilkes,
Xiaogang Qiang,
Jianwei Wang,
Raffaele Santagati,
Stefano Paesani,
Xiaoqi Zhou,
David A. B. Miller,
Graham D. Marshall,
Mark G. Thompson,
Jeremy L. O'Brien
Abstract:
Imperfections in integrated photonics manufacturing have a detrimental effect on the maximal achievable visibility in interferometric architectures. These limits have profound implications for further photonics technological developments and in particular for quantum photonics technologies. Active optimisation approaches, together with reconfigurable photonics, have been proposed as a solution to…
▽ More
Imperfections in integrated photonics manufacturing have a detrimental effect on the maximal achievable visibility in interferometric architectures. These limits have profound implications for further photonics technological developments and in particular for quantum photonics technologies. Active optimisation approaches, together with reconfigurable photonics, have been proposed as a solution to overcome this. In this paper, we demonstrate an ultra-high (>60 dB) extinction ratio in a silicon photonic device consisting of cascaded Mach-Zehnder interferometers, in which additional interferometers function as variable beamsplitters. The imperfections of fabricated beamsplitters are compensated using an automated progressive optimization algorithm with no requirement for pre-calibration. This work shows the possibility of integrating and accurately controlling linear-optical components for large-scale quantum information processing and other applications.
△ Less
Submitted 1 September, 2016;
originally announced September 2016.
-
An explanation of the "negative neutrino mass squared" anomaly in tritium $ β$-decay based on a theory of mass
Authors:
R. L. Ingraham,
G. A. Luna-Acosta,
J. M. Wilkes
Abstract:
A proposed solution of the anomalous behavior of the electron spectrum near the endpoint of tritium $β$-decay is offered. It is based on a new theory of mass in which mass becomes a dynamical variable, and the electron in the tritium $β$-decay has a narrow mass distribution. The predicted Kurie plots explain the main feature (``$m_ν^2 < 0 $'') of this anomalous behavior.
A proposed solution of the anomalous behavior of the electron spectrum near the endpoint of tritium $β$-decay is offered. It is based on a new theory of mass in which mass becomes a dynamical variable, and the electron in the tritium $β$-decay has a narrow mass distribution. The predicted Kurie plots explain the main feature (``$m_ν^2 < 0 $'') of this anomalous behavior.
△ Less
Submitted 5 December, 2000;
originally announced December 2000.