-
Optimizing Cloud-to-GPU Throughput for Deep Learning With Earth Observation Data
Authors:
Akram Zaytar,
Caleb Robinson,
Girmaw Abebe Tadesse,
Tammy Glazer,
Gilles Hacheme,
Anthony Ortiz,
Rahul M Dodhia,
Juan M Lavista Ferres
Abstract:
Training deep learning models on petabyte-scale Earth observation (EO) data requires separating compute resources from data storage. However, standard PyTorch data loaders cannot keep modern GPUs utilized when streaming GeoTIFF files directly from cloud storage. In this work, we benchmark GeoTIFF loading throughput from both cloud object storage and local SSD, systematically testing different load…
▽ More
Training deep learning models on petabyte-scale Earth observation (EO) data requires separating compute resources from data storage. However, standard PyTorch data loaders cannot keep modern GPUs utilized when streaming GeoTIFF files directly from cloud storage. In this work, we benchmark GeoTIFF loading throughput from both cloud object storage and local SSD, systematically testing different loader configurations and data parameters. We focus on tile-aligned reads and worker thread pools, using Bayesian optimization to find optimal settings for each storage type. Our optimized configurations increase remote data loading throughput by 20x and local throughput by 4x compared to default settings. On three public EO benchmarks, models trained with optimized remote loading achieve the same accuracy as local training within identical time budgets. We improve validation IoU by 6-15% and maintain 85-95% GPU utilization versus 0-30% with standard configurations. Code is publicly available at https://github.com/microsoft/pytorch-cloud-geotiff-optimization
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models
Authors:
Gilles Quentin Hacheme,
Girmaw Abebe Tadesse,
Caleb Robinson,
Akram Zaytar,
Rahul Dodhia,
Juan M. Lavista Ferres
Abstract:
Classifying geospatial imagery remains a major bottleneck for applications such as disaster response and land-use monitoring-particularly in regions where annotated data is scarce or unavailable. Existing tools (e.g., RS-CLIP) that claim zero-shot classification capabilities for satellite imagery nonetheless rely on task-specific pretraining and adaptation to reach competitive performance. We intr…
▽ More
Classifying geospatial imagery remains a major bottleneck for applications such as disaster response and land-use monitoring-particularly in regions where annotated data is scarce or unavailable. Existing tools (e.g., RS-CLIP) that claim zero-shot classification capabilities for satellite imagery nonetheless rely on task-specific pretraining and adaptation to reach competitive performance. We introduce GeoVision Labeler (GVL), a strictly zero-shot classification framework: a vision Large Language Model (vLLM) generates rich, human-readable image descriptions, which are then mapped to user-defined classes by a conventional Large Language Model (LLM). This modular, and interpretable pipeline enables flexible image classification for a large range of use cases. We evaluated GVL across three benchmarks-SpaceNet v7, UC Merced, and RESISC45. It achieves up to 93.2% zero-shot accuracy on the binary Buildings vs. No Buildings task on SpaceNet v7. For complex multi-class classification tasks (UC Merced, RESISC45), we implemented a recursive LLM-driven clustering to form meta-classes at successive depths, followed by hierarchical classification-first resolving coarse groups, then finer distinctions-to deliver competitive zero-shot performance. GVL is open-sourced at https://github.com/microsoft/geo-vision-labeler to catalyze adoption in real-world geospatial workflows.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
Zema Dataset: A Comprehensive Study of Yaredawi Zema with a Focus on Horologium Chants
Authors:
Mequanent Argaw Muluneh,
Yan-Tsung Peng,
Worku Abebe Degife,
Nigussie Abate Tadesse,
Aknachew Mebreku Demeku,
Li Su
Abstract:
Computational music research plays a critical role in advancing music production, distribution, and understanding across various musical styles worldwide. Despite the immense cultural and religious significance, the Ethiopian Orthodox Tewahedo Church (EOTC) chants are relatively underrepresented in computational music research. This paper contributes to this field by introducing a new dataset spec…
▽ More
Computational music research plays a critical role in advancing music production, distribution, and understanding across various musical styles worldwide. Despite the immense cultural and religious significance, the Ethiopian Orthodox Tewahedo Church (EOTC) chants are relatively underrepresented in computational music research. This paper contributes to this field by introducing a new dataset specifically tailored for analyzing EOTC chants, also known as Yaredawi Zema. This work provides a comprehensive overview of a 10-hour dataset, 369 instances, creation, and curation process, including rigorous quality assurance measures. Our dataset has a detailed word-level temporal boundary and reading tone annotation along with the corresponding chanting mode label of audios. Moreover, we have also identified the chanting options associated with multiple chanting notations in the manuscript by annotating them accordingly. Our goal in making this dataset available to the public 1 is to encourage more research and study of EOTC chants, including lyrics transcription, lyric-to-audio alignment, and music generation tasks. Such research work will advance knowledge and efforts to preserve this distinctive liturgical music, a priceless cultural artifact for the Ethiopian people.
△ Less
Submitted 25 December, 2024;
originally announced December 2024.
-
Distribution Shifts at Scale: Out-of-distribution Detection in Earth Observation
Authors:
Burak Ekim,
Girmaw Abebe Tadesse,
Caleb Robinson,
Gilles Hacheme,
Michael Schmitt,
Rahul Dodhia,
Juan M. Lavista Ferres
Abstract:
Training robust deep learning models is crucial in Earth Observation, where globally deployed models often face distribution shifts that degrade performance, especially in low-data regions. Out-of-distribution (OOD) detection addresses this by identifying inputs that deviate from in-distribution (ID) data. However, existing methods either assume access to OOD data or compromise primary task perfor…
▽ More
Training robust deep learning models is crucial in Earth Observation, where globally deployed models often face distribution shifts that degrade performance, especially in low-data regions. Out-of-distribution (OOD) detection addresses this by identifying inputs that deviate from in-distribution (ID) data. However, existing methods either assume access to OOD data or compromise primary task performance, limiting real-world use. We introduce TARDIS, a post-hoc OOD detection method designed for scalable geospatial deployment. Our core innovation lies in generating surrogate distribution labels by leveraging ID data within the feature space. TARDIS takes a pre-trained model, ID data, and data from an unknown distribution (WILD), separates WILD into surrogate ID and OOD labels based on internal activations, and trains a binary classifier to detect distribution shifts. We validate on EuroSAT and xBD across 17 setups covering covariate and semantic shifts, showing near-upper-bound surrogate labeling performance in 13 cases and matching the performance of top post-hoc activation- and scoring-based methods. Finally, deploying TARDIS on Fields of the World reveals actionable insights into pre-trained model behavior at scale. The code is available at \href{https://github.com/microsoft/geospatial-ood-detection}{https://github.com/microsoft/geospatial-ood-detection}
△ Less
Submitted 8 April, 2025; v1 submitted 17 December, 2024;
originally announced December 2024.
-
Sims: An Interactive Tool for Geospatial Matching and Clustering
Authors:
Akram Zaytar,
Girmaw Abebe Tadesse,
Caleb Robinson,
Eduardo G. Bendito,
Medha Devare,
Meklit Chernet,
Gilles Q. Hacheme,
Rahul Dodhia,
Juan M. Lavista Ferres
Abstract:
Acquiring, processing, and visualizing geospatial data requires significant computing resources, especially for large spatio-temporal domains. This challenge hinders the rapid discovery of predictive features, which is essential for advancing geospatial modeling. To address this, we developed Similarity Search (Sims), a no-code web tool that allows users to perform clustering and similarity search…
▽ More
Acquiring, processing, and visualizing geospatial data requires significant computing resources, especially for large spatio-temporal domains. This challenge hinders the rapid discovery of predictive features, which is essential for advancing geospatial modeling. To address this, we developed Similarity Search (Sims), a no-code web tool that allows users to perform clustering and similarity search over defined regions of interest using Google Earth Engine as a backend. Sims is designed to complement existing modeling tools by focusing on feature exploration rather than model creation. We demonstrate the utility of Sims through a case study analyzing simulated maize yield data in Rwanda, where we evaluate how different combinations of soil, weather, and agronomic features affect the clustering of yield response zones. Sims is open source and available at https://github.com/microsoft/Sims
△ Less
Submitted 20 December, 2024; v1 submitted 13 December, 2024;
originally announced December 2024.
-
Local vs. Global: Local Land-Use and Land-Cover Models Deliver Higher Quality Maps
Authors:
Girmaw Abebe Tadesse,
Caleb Robinson,
Charles Mwangi,
Esther Maina,
Joshua Nyakundi,
Luana Marotti,
Gilles Quentin Hacheme,
Hamed Alemohammad,
Rahul Dodhia,
Juan M. Lavista Ferres
Abstract:
In 2023, 58.0% of the African population experienced moderate to severe food insecurity, with 21.6% facing severe food insecurity. Land-use and land-cover maps provide crucial insights for addressing food insecurity by improving agricultural efforts, including mapping and monitoring crop types and estimating yield. The development of global land-cover maps has been facilitated by the increasing av…
▽ More
In 2023, 58.0% of the African population experienced moderate to severe food insecurity, with 21.6% facing severe food insecurity. Land-use and land-cover maps provide crucial insights for addressing food insecurity by improving agricultural efforts, including mapping and monitoring crop types and estimating yield. The development of global land-cover maps has been facilitated by the increasing availability of earth observation data and advancements in geospatial machine learning. However, these global maps exhibit lower accuracy and inconsistencies in Africa, partly due to the lack of representative training data. To address this issue, we propose a data-centric framework with a teacher-student model setup, which uses diverse data sources of satellite images and label examples to produce local land-cover maps. Our method trains a high-resolution teacher model on images with a resolution of 0.331 m/pixel and a low-resolution student model on publicly available images with a resolution of 10 m/pixel. The student model also utilizes the teacher model's output as its weak label examples through knowledge transfer. We evaluated our framework using Murang'a county in Kenya, renowned for its agricultural productivity, as a use case. Our local models achieved higher quality maps, with improvements of 0.14 in the F1 score and 0.21 in Intersection-over-Union, compared to the best global model. Our evaluation also revealed inconsistencies in existing global maps, with a maximum agreement rate of 0.30 among themselves. Our work provides valuable guidance to decision-makers for driving informed decisions to enhance food security.
△ Less
Submitted 11 December, 2024; v1 submitted 1 December, 2024;
originally announced December 2024.
-
AI and the Future of Work in Africa White Paper
Authors:
Jacki O'Neill,
Vukosi Marivate,
Barbara Glover,
Winnie Karanu,
Girmaw Abebe Tadesse,
Akua Gyekye,
Anne Makena,
Wesley Rosslyn-Smith,
Matthew Grollnek,
Charity Wayua,
Rehema Baguma,
Angel Maduke,
Sarah Spencer,
Daniel Kandie,
Dennis Ndege Maari,
Natasha Mutangana,
Maxamed Axmed,
Nyambura Kamau,
Muhammad Adamu,
Frank Swaniker,
Brian Gatuguti,
Jonathan Donner,
Mark Graham,
Janet Mumo,
Caroline Mbindyo
, et al. (50 additional authors not shown)
Abstract:
This white paper is the output of a multidisciplinary workshop in Nairobi (Nov 2023). Led by a cross-organisational team including Microsoft Research, NEPAD, Lelapa AI, and University of Oxford. The workshop brought together diverse thought-leaders from various sectors and backgrounds to discuss the implications of Generative AI for the future of work in Africa. Discussions centred around four key…
▽ More
This white paper is the output of a multidisciplinary workshop in Nairobi (Nov 2023). Led by a cross-organisational team including Microsoft Research, NEPAD, Lelapa AI, and University of Oxford. The workshop brought together diverse thought-leaders from various sectors and backgrounds to discuss the implications of Generative AI for the future of work in Africa. Discussions centred around four key themes: Macroeconomic Impacts; Jobs, Skills and Labour Markets; Workers' Perspectives and Africa-Centris AI Platforms. The white paper provides an overview of the current state and trends of generative AI and its applications in different domains, as well as the challenges and risks associated with its adoption and regulation. It represents a diverse set of perspectives to create a set of insights and recommendations which aim to encourage debate and collaborative action towards creating a dignified future of work for everyone across Africa.
△ Less
Submitted 15 November, 2024;
originally announced November 2024.
-
Analyzing Decades-Long Environmental Changes in Namibia Using Archival Aerial Photography and Deep Learning
Authors:
Girmaw Abebe Tadesse,
Caleb Robinson,
Gilles Quentin Hacheme,
Akram Zaytar,
Rahul Dodhia,
Tsering Wangyal Shawa,
Juan M. Lavista Ferres,
Emmanuel H. Kreike
Abstract:
This study explores object detection in historical aerial photographs of Namibia to identify long-term environmental changes. Specifically, we aim to identify key objects -- Waterholes, Omuti homesteads, and Big trees -- around Oshikango in Namibia using sub-meter gray-scale aerial imagery from 1943 and 1972. In this work, we propose a workflow for analyzing historical aerial imagery using a deep…
▽ More
This study explores object detection in historical aerial photographs of Namibia to identify long-term environmental changes. Specifically, we aim to identify key objects -- Waterholes, Omuti homesteads, and Big trees -- around Oshikango in Namibia using sub-meter gray-scale aerial imagery from 1943 and 1972. In this work, we propose a workflow for analyzing historical aerial imagery using a deep semantic segmentation model on sparse hand-labels. To this end, we employ a number of strategies including class-weighting, pseudo-labeling and empirical p-value-based filtering to balance skewed and sparse representations of objects in the ground truth data. Results demonstrate the benefits of these different training strategies resulting in an average $F_1=0.661$ and $F_1=0.755$ over the three objects of interest for the 1943 and 1972 imagery, respectively. We also identified that the average size of Waterhole and Big trees increased while the average size of Omuti homesteads decreased between 1943 and 1972 reflecting some of the local effects of the massive post-Second World War economic, agricultural, demographic, and environmental changes. This work also highlights the untapped potential of historical aerial photographs in understanding long-term environmental changes beyond Namibia (and Africa). With the lack of adequate satellite technology in the past, archival aerial photography offers a great alternative to uncover decades-long environmental changes.
△ Less
Submitted 21 April, 2024; v1 submitted 12 April, 2024;
originally announced April 2024.
-
Bootstrapping Rare Object Detection in High-Resolution Satellite Imagery
Authors:
Akram Zaytar,
Caleb Robinson,
Gilles Q. Hacheme,
Girmaw A. Tadesse,
Rahul Dodhia,
Juan M. Lavista Ferres,
Lacey F. Hughey,
Jared A. Stabach,
Irene Amoke
Abstract:
Rare object detection is a fundamental task in applied geospatial machine learning, however is often challenging due to large amounts of high-resolution satellite or aerial imagery and few or no labeled positive samples to start with. This paper addresses the problem of bootstrapping such a rare object detection task assuming there is no labeled data and no spatial prior over the area of interest.…
▽ More
Rare object detection is a fundamental task in applied geospatial machine learning, however is often challenging due to large amounts of high-resolution satellite or aerial imagery and few or no labeled positive samples to start with. This paper addresses the problem of bootstrapping such a rare object detection task assuming there is no labeled data and no spatial prior over the area of interest. We propose novel offline and online cluster-based approaches for sampling patches that are significantly more efficient, in terms of exposing positive samples to a human annotator, than random sampling. We apply our methods for identifying bomas, or small enclosures for herd animals, in the Serengeti Mara region of Kenya and Tanzania. We demonstrate a significant enhancement in detection efficiency, achieving a positive sampling rate increase from 2% (random) to 30%. This advancement enables effective machine learning mapping even with minimal labeling budgets, exemplified by an F1 score on the boma detection task of 0.51 with a budget of 300 total patches.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Weak Labeling for Cropland Mapping in Africa
Authors:
Gilles Quentin Hacheme,
Akram Zaytar,
Girmaw Abebe Tadesse,
Caleb Robinson,
Rahul Dodhia,
Juan M. Lavista Ferres,
Stephen Wood
Abstract:
Cropland mapping can play a vital role in addressing environmental, agricultural, and food security challenges. However, in the context of Africa, practical applications are often hindered by the limited availability of high-resolution cropland maps. Such maps typically require extensive human labeling, thereby creating a scalability bottleneck. To address this, we propose an approach that utilize…
▽ More
Cropland mapping can play a vital role in addressing environmental, agricultural, and food security challenges. However, in the context of Africa, practical applications are often hindered by the limited availability of high-resolution cropland maps. Such maps typically require extensive human labeling, thereby creating a scalability bottleneck. To address this, we propose an approach that utilizes unsupervised object clustering to refine existing weak labels, such as those obtained from global cropland maps. The refined labels, in conjunction with sparse human annotations, serve as training data for a semantic segmentation network designed to identify cropland areas. We conduct experiments to demonstrate the benefits of the improved weak labels generated by our method. In a scenario where we train our model with only 33 human-annotated labels, the F_1 score for the cropland category increases from 0.53 to 0.84 when we add the mined negative labels.
△ Less
Submitted 13 January, 2024;
originally announced January 2024.
-
Efficient Representation of the Activation Space in Deep Neural Networks
Authors:
Tanya Akumu,
Celia Cintas,
Girmaw Abebe Tadesse,
Adebayo Oshingbesan,
Skyler Speakman,
Edward McFowland III
Abstract:
The representations of the activation space of deep neural networks (DNNs) are widely utilized for tasks like natural language processing, anomaly detection and speech recognition. Due to the diverse nature of these tasks and the large size of DNNs, an efficient and task-independent representation of activations becomes crucial. Empirical p-values have been used to quantify the relative strength o…
▽ More
The representations of the activation space of deep neural networks (DNNs) are widely utilized for tasks like natural language processing, anomaly detection and speech recognition. Due to the diverse nature of these tasks and the large size of DNNs, an efficient and task-independent representation of activations becomes crucial. Empirical p-values have been used to quantify the relative strength of an observed node activation compared to activations created by already-known inputs. Nonetheless, keeping raw data for these calculations increases memory resource consumption and raises privacy concerns. To this end, we propose a model-agnostic framework for creating representations of activations in DNNs using node-specific histograms to compute p-values of observed activations without retaining already-known inputs. Our proposed approach demonstrates promising potential when validated with multiple network architectures across various downstream tasks and compared with the kernel density estimates and brute-force empirical baselines. In addition, the framework reduces memory usage by 30% with up to 4 times faster p-value computing time while maintaining state of-the-art detection power in downstream tasks such as the detection of adversarial attacks and synthesized content. Moreover, as we do not persist raw data at inference time, we could potentially reduce susceptibility to attacks and privacy issues.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
DMLR: Data-centric Machine Learning Research -- Past, Present and Future
Authors:
Luis Oala,
Manil Maskey,
Lilith Bat-Leah,
Alicia Parrish,
Nezihe Merve Gürel,
Tzu-Sheng Kuo,
Yang Liu,
Rotem Dror,
Danilo Brajovic,
Xiaozhe Yao,
Max Bartolo,
William A Gaviria Rojas,
Ryan Hileman,
Rainier Aliment,
Michael W. Mahoney,
Meg Risdal,
Matthew Lease,
Wojciech Samek,
Debojyoti Dutta,
Curtis G Northcutt,
Cody Coleman,
Braden Hancock,
Bernard Koch,
Girmaw Abebe Tadesse,
Bojan Karlaš
, et al. (13 additional authors not shown)
Abstract:
Drawing from discussions at the inaugural DMLR workshop at ICML 2023 and meetings prior, in this report we outline the relevance of community engagement and infrastructure development for the creation of next-generation public datasets that will advance machine learning science. We chart a path forward as a collective effort to sustain the creation and maintenance of these datasets and methods tow…
▽ More
Drawing from discussions at the inaugural DMLR workshop at ICML 2023 and meetings prior, in this report we outline the relevance of community engagement and infrastructure development for the creation of next-generation public datasets that will advance machine learning science. We chart a path forward as a collective effort to sustain the creation and maintenance of these datasets and methods towards positive scientific, societal and business impact.
△ Less
Submitted 1 June, 2024; v1 submitted 21 November, 2023;
originally announced November 2023.
-
Domain-agnostic and Multi-level Evaluation of Generative Models
Authors:
Girmaw Abebe Tadesse,
Jannis Born,
Celia Cintas,
William Ogallo,
Dmitry Zubarev,
Matteo Manica,
Komminist Weldemariam
Abstract:
While the capabilities of generative models heavily improved in different domains (images, text, graphs, molecules, etc.), their evaluation metrics largely remain based on simplified quantities or manual inspection with limited practicality. To this end, we propose a framework for Multi-level Performance Evaluation of Generative mOdels (MPEGO), which could be employed across different domains. MPE…
▽ More
While the capabilities of generative models heavily improved in different domains (images, text, graphs, molecules, etc.), their evaluation metrics largely remain based on simplified quantities or manual inspection with limited practicality. To this end, we propose a framework for Multi-level Performance Evaluation of Generative mOdels (MPEGO), which could be employed across different domains. MPEGO aims to quantify generation performance hierarchically, starting from a sub-feature-based low-level evaluation to a global features-based high-level evaluation. MPEGO offers great customizability as the employed features are entirely user-driven and can thus be highly domain/problem-specific while being arbitrarily complex (e.g., outcomes of experimental procedures). We validate MPEGO using multiple generative models across several datasets from the material discovery domain. An ablation study is conducted to study the plausibility of intermediate steps in MPEGO. Results demonstrate that MPEGO provides a flexible, user-driven, and multi-level evaluation framework, with practical insights on the generation quality. The framework, source code, and experiments will be available at https://github.com/GT4SD/mpego.
△ Less
Submitted 20 January, 2023;
originally announced January 2023.
-
BON: An extended public domain dataset for human activity recognition
Authors:
Girmaw Abebe Tadesse,
Oliver Bent,
Komminist Weldemariam,
Md. Abrar Istiak,
Taufiq Hasan,
Andrea Cavallaro
Abstract:
Body-worn first-person vision (FPV) camera enables to extract a rich source of information on the environment from the subject's viewpoint. However, the research progress in wearable camera-based egocentric office activity understanding is slow compared to other activity environments (e.g., kitchen and outdoor ambulatory), mainly due to the lack of adequate datasets to train more sophisticated (e.…
▽ More
Body-worn first-person vision (FPV) camera enables to extract a rich source of information on the environment from the subject's viewpoint. However, the research progress in wearable camera-based egocentric office activity understanding is slow compared to other activity environments (e.g., kitchen and outdoor ambulatory), mainly due to the lack of adequate datasets to train more sophisticated (e.g., deep learning) models for human activity recognition in office environments. This paper provides details of a large and publicly available office activity dataset (BON) collected in different office settings across three geographical locations: Barcelona (Spain), Oxford (UK) and Nairobi (Kenya), using a chest-mounted GoPro Hero camera. The BON dataset contains eighteen common office activities that can be categorised into person-to-person interactions (e.g., Chat with colleagues), person-to-object (e.g., Writing on a whiteboard), and proprioceptive (e.g., Walking). Annotation is provided for each segment of video with 5-seconds duration. Generally, BON contains 25 subjects and 2639 total segments. In order to facilitate further research in the sub-domain, we have also provided results that could be used as baselines for future studies.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
Existence and convergence of commutative mappings on some results of fixed point theory in a class of generalized non-expansive mappings
Authors:
Gezahegn Anberber Tadesse
Abstract:
In this paper, we introduce a commutative mappings satisfying the class of generalized non-expansive mappings which is wider than the class of mappings satisfying the condition (C), so called Condition B gamma, mu. The results obtained in this paper extend and generalized non-expansive mappings and other results in this direction. Different properties and some fixed point results for the mappings…
▽ More
In this paper, we introduce a commutative mappings satisfying the class of generalized non-expansive mappings which is wider than the class of mappings satisfying the condition (C), so called Condition B gamma, mu. The results obtained in this paper extend and generalized non-expansive mappings and other results in this direction. Different properties and some fixed point results for the mappings are obtained here.
Keywords: Non expansive mappings, Condition (C), Condition B gamma mu, Commuting mappings, Fixed point.
△ Less
Submitted 8 July, 2022; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Model-free feature selection to facilitate automatic discovery of divergent subgroups in tabular data
Authors:
Girmaw Abebe Tadesse,
William Ogallo,
Celia Cintas,
Skyler Speakman
Abstract:
Data-centric AI encourages the need of cleaning and understanding of data in order to achieve trustworthy AI. Existing technologies, such as AutoML, make it easier to design and train models automatically, but there is a lack of a similar level of capabilities to extract data-centric insights. Manual stratification of tabular data per a feature (e.g., gender) is limited to scale up for higher feat…
▽ More
Data-centric AI encourages the need of cleaning and understanding of data in order to achieve trustworthy AI. Existing technologies, such as AutoML, make it easier to design and train models automatically, but there is a lack of a similar level of capabilities to extract data-centric insights. Manual stratification of tabular data per a feature (e.g., gender) is limited to scale up for higher feature dimension, which could be addressed using automatic discovery of divergent subgroups. Nonetheless, these automatic discovery techniques often search across potentially exponential combinations of features that could be simplified using a preceding feature selection step. Existing feature selection techniques for tabular data often involve fitting a particular model in order to select important features. However, such model-based selection is prone to model-bias and spurious correlations in addition to requiring extra resource to design, fine-tune and train a model. In this paper, we propose a model-free and sparsity-based automatic feature selection (SAFS) framework to facilitate automatic discovery of divergent subgroups. Different from filter-based selection techniques, we exploit the sparsity of objective measures among feature values to rank and select features. We validated SAFS across two publicly available datasets (MIMIC-III and Allstate Claims) and compared it with six existing feature selection methods. SAFS achieves a reduction of feature selection time by a factor of 81x and 104x, averaged cross the existing methods in the MIMIC-III and Claims datasets respectively. SAFS-selected features are also shown to achieve competitive detection performance, e.g., 18.3% of features selected by SAFS in the Claims dataset detected divergent samples similar to those detected by using the whole features with a Jaccard similarity of 0.95 but with a 16x reduction in detection time.
△ Less
Submitted 8 March, 2022;
originally announced March 2022.
-
Towards Creativity Characterization of Generative Models via Group-based Subset Scanning
Authors:
Celia Cintas,
Payel Das,
Brian Quanz,
Girmaw Abebe Tadesse,
Skyler Speakman,
Pin-Yu Chen
Abstract:
Deep generative models, such as Variational Autoencoders (VAEs) and Generative Adversarial Networks (GANs), have been employed widely in computational creativity research. However, such models discourage out-of-distribution generation to avoid spurious sample generation, thereby limiting their creativity. Thus, incorporating research on human creativity into generative deep learning techniques pre…
▽ More
Deep generative models, such as Variational Autoencoders (VAEs) and Generative Adversarial Networks (GANs), have been employed widely in computational creativity research. However, such models discourage out-of-distribution generation to avoid spurious sample generation, thereby limiting their creativity. Thus, incorporating research on human creativity into generative deep learning techniques presents an opportunity to make their outputs more compelling and human-like. As we see the emergence of generative models directed toward creativity research, a need for machine learning-based surrogate metrics to characterize creative output from these models is imperative. We propose group-based subset scanning to identify, quantify, and characterize creative processes by detecting a subset of anomalous node-activations in the hidden layers of the generative models. Our experiments on the standard image benchmarks, and their "creatively generated" variants, reveal that the proposed subset scores distribution is more useful for detecting creative processes in the activation space rather than the pixel space. Further, we found that creative samples generate larger subsets of anomalies than normal or non-creative samples across datasets. The node activations highlighted during the creative decoding process are different from those responsible for the normal sample generation. Lastly, we assess if the images from the subsets selected by our method were also found creative by human evaluators, presenting a link between creativity perception in humans and node activations within deep neural nets.
△ Less
Submitted 26 May, 2022; v1 submitted 1 March, 2022;
originally announced March 2022.
-
Sparsity-based Feature Selection for Anomalous Subgroup Discovery
Authors:
Girmaw Abebe Tadesse,
William Ogallo,
Catherine Wanjiru,
Charles Wachira,
Isaiah Onando Mulang',
Vibha Anand,
Aisha Walcott-Bryant,
Skyler Speakman
Abstract:
Anomalous pattern detection aims to identify instances where deviation from normalcy is evident, and is widely applicable across domains. Multiple anomalous detection techniques have been proposed in the state of the art. However, there is a common lack of a principled and scalable feature selection method for efficient discovery. Existing feature selection techniques are often conducted by optimi…
▽ More
Anomalous pattern detection aims to identify instances where deviation from normalcy is evident, and is widely applicable across domains. Multiple anomalous detection techniques have been proposed in the state of the art. However, there is a common lack of a principled and scalable feature selection method for efficient discovery. Existing feature selection techniques are often conducted by optimizing the performance of prediction outcomes rather than its systemic deviations from the expected. In this paper, we proposed a sparsity-based automated feature selection (SAFS) framework, which encodes systemic outcome deviations via the sparsity of feature-driven odds ratios. SAFS is a model-agnostic approach with usability across different discovery techniques. SAFS achieves more than $3\times$ reduction in computation time while maintaining detection performance when validated on publicly available critical care dataset. SAFS also results in a superior performance when compared against multiple baselines for feature selection.
△ Less
Submitted 6 January, 2022;
originally announced January 2022.
-
Post-discovery Analysis of Anomalous Subsets
Authors:
Isaiah Onando Mulang',
William Ogallo,
Girmaw Abebe Tadesse,
Aisha Walcott-Bryant
Abstract:
Analyzing the behaviour of a population in response to disease and interventions is critical to unearth variability in healthcare as well as understand sub-populations that require specialized attention, but also to assist in designing future interventions. Two aspects become very essential in such analysis namely: i) Discovery of differentiating patterns exhibited by sub-populations, and ii) Char…
▽ More
Analyzing the behaviour of a population in response to disease and interventions is critical to unearth variability in healthcare as well as understand sub-populations that require specialized attention, but also to assist in designing future interventions. Two aspects become very essential in such analysis namely: i) Discovery of differentiating patterns exhibited by sub-populations, and ii) Characterization of the identified subpopulations. For the discovery phase, an array of approaches in the anomalous pattern detection literature have been employed to reveal differentiating patterns, especially to identify anomalous subgroups. However, these techniques are limited to describing the anomalous subgroups and offer little in form of insightful characterization, thereby limiting interpretability and understanding of these data-driven techniques in clinical practices. In this work, we propose an analysis of differentiated output (rather than discovery) and quantify anomalousness similarly to the counter-factual setting. To this end we design an approach to perform post-discovery analysis of anomalous subsets, in which we initially identify the most important features on the anomalousness of the subsets, then by perturbation, the approach seeks to identify the least number of changes necessary to lose anomalousness. Our approach is presented and the evaluation results on the 2019 MarketScan Commercial Claims and Medicare data, show that extra insights can be obtained by extrapolated examination of the identified subgroups.
△ Less
Submitted 23 November, 2021;
originally announced November 2021.
-
Automated Supervised Feature Selection for Differentiated Patterns of Care
Authors:
Catherine Wanjiru,
William Ogallo,
Girmaw Abebe Tadesse,
Charles Wachira,
Isaiah Onando Mulang',
Aisha Walcott-Bryant
Abstract:
An automated feature selection pipeline was developed using several state-of-the-art feature selection techniques to select optimal features for Differentiating Patterns of Care (DPOC). The pipeline included three types of feature selection techniques; Filters, Wrappers and Embedded methods to select the top K features. Five different datasets with binary dependent variables were used and their di…
▽ More
An automated feature selection pipeline was developed using several state-of-the-art feature selection techniques to select optimal features for Differentiating Patterns of Care (DPOC). The pipeline included three types of feature selection techniques; Filters, Wrappers and Embedded methods to select the top K features. Five different datasets with binary dependent variables were used and their different top K optimal features selected. The selected features were tested in the existing multi-dimensional subset scanning (MDSS) where the most anomalous subpopulations, most anomalous subsets, propensity scores, and effect of measures were recorded to test their performance. This performance was compared with four similar metrics gained after using all covariates in the dataset in the MDSS pipeline. We found out that despite the different feature selection techniques used, the data distribution is key to note when determining the technique to use.
△ Less
Submitted 5 November, 2021;
originally announced November 2021.
-
Pattern Detection in the Activation Space for Identifying Synthesized Content
Authors:
Celia Cintas,
Skyler Speakman,
Girmaw Abebe Tadesse,
Victor Akinwande,
Edward McFowland III,
Komminist Weldemariam
Abstract:
Generative Adversarial Networks (GANs) have recently achieved unprecedented success in photo-realistic image synthesis from low-dimensional random noise. The ability to synthesize high-quality content at a large scale brings potential risks as the generated samples may lead to misinformation that can create severe social, political, health, and business hazards. We propose SubsetGAN to identify ge…
▽ More
Generative Adversarial Networks (GANs) have recently achieved unprecedented success in photo-realistic image synthesis from low-dimensional random noise. The ability to synthesize high-quality content at a large scale brings potential risks as the generated samples may lead to misinformation that can create severe social, political, health, and business hazards. We propose SubsetGAN to identify generated content by detecting a subset of anomalous node-activations in the inner layers of pre-trained neural networks. These nodes, as a group, maximize a non-parametric measure of divergence away from the expected distribution of activations created from real data. This enable us to identify synthesised images without prior knowledge of their distribution. SubsetGAN efficiently scores subsets of nodes and returns the group of nodes within the pre-trained classifier that contributed to the maximum score. The classifier can be a general fake classifier trained over samples from multiple sources or the discriminator network from different GANs. Our approach shows consistently higher detection power than existing detection methods across several state-of-the-art GANs (PGGAN, StarGAN, and CycleGAN) and over different proportions of generated content.
△ Less
Submitted 27 May, 2021; v1 submitted 26 May, 2021;
originally announced May 2021.
-
Out-of-Distribution Detection in Dermatology using Input Perturbation and Subset Scanning
Authors:
Hannah Kim,
Girmaw Abebe Tadesse,
Celia Cintas,
Skyler Speakman,
Kush Varshney
Abstract:
Recent advances in deep learning have led to breakthroughs in the development of automated skin disease classification. As we observe an increasing interest in these models in the dermatology space, it is crucial to address aspects such as the robustness towards input data distribution shifts. Current skin disease models could make incorrect inferences for test samples from different hardware devi…
▽ More
Recent advances in deep learning have led to breakthroughs in the development of automated skin disease classification. As we observe an increasing interest in these models in the dermatology space, it is crucial to address aspects such as the robustness towards input data distribution shifts. Current skin disease models could make incorrect inferences for test samples from different hardware devices and clinical settings or unknown disease samples, which are out-of-distribution (OOD) from the training samples. To this end, we propose a simple yet effective approach that detect these OOD samples prior to making any decision. The detection is performed via scanning in the latent space representation (e.g., activations of the inner layers of any pre-trained skin disease classifier). The input samples could also perturbed to maximise divergence of OOD samples. We validate our ODD detection approach in two use cases: 1) identify samples collected from different protocols, and 2) detect samples from unknown disease classes. Additionally, we evaluate the performance of the proposed approach and compare it with other state-of-the-art methods. Furthermore, data-driven dermatology applications may deepen the disparity in clinical care across racial and ethnic groups since most datasets are reported to suffer from bias in skin tone distribution. Therefore, we also evaluate the fairness of these OOD detection methods across different skin tones. Our experiments resulted in competitive performance across multiple datasets in detecting OOD samples, which could be used (in the future) to design more effective transfer learning techniques prior to inferring on these samples.
△ Less
Submitted 2 June, 2021; v1 submitted 24 May, 2021;
originally announced May 2021.
-
DeepMI: Deep Multi-lead ECG Fusion for Identifying Myocardial Infarction and its Occurrence-time
Authors:
Girmaw Abebe Tadesse,
Hamza Javed,
Yong Liu,
Jin Liu,
Jiyan Chen,
Komminist Weldemariam,
Tingting Zhu
Abstract:
Myocardial Infarction (MI) has the highest mortality of all cardiovascular diseases (CVDs). Detection of MI and information regarding its occurrence-time in particular, would enable timely interventions that may improve patient outcomes, thereby reducing the global rise in CVD deaths. Electrocardiogram (ECG) recordings are currently used to screen MI patients. However, manual inspection of ECGs is…
▽ More
Myocardial Infarction (MI) has the highest mortality of all cardiovascular diseases (CVDs). Detection of MI and information regarding its occurrence-time in particular, would enable timely interventions that may improve patient outcomes, thereby reducing the global rise in CVD deaths. Electrocardiogram (ECG) recordings are currently used to screen MI patients. However, manual inspection of ECGs is time-consuming and prone to subjective bias. Machine learning methods have been adopted for automated ECG diagnosis, but most approaches require extraction of ECG beats or consider leads independently of one another. We propose an end-to-end deep learning approach, DeepMI, to classify MI from normal cases as well as identifying the time-occurrence of MI (defined as acute, recent and old), using a collection of fusion strategies on 12 ECG leads at data-, feature-, and decision-level. In order to minimise computational overhead, we employ transfer learning using existing computer vision networks. Moreover, we use recurrent neural networks to encode the longitudinal information inherent in ECGs. We validated DeepMI on a dataset collected from 17,381 patients, in which over 323,000 samples were extracted per ECG lead. We were able to classify normal cases as well as acute, recent and old onset cases of MI, with AUROCs of 96.7%, 82.9%, 68.6% and 73.8%, respectively. We have demonstrated a multi-lead fusion approach to detect the presence and occurrence-time of MI. Our end-to-end framework provides flexibility for different levels of multi-lead ECG fusion and performs feature extraction via transfer learning.
△ Less
Submitted 31 March, 2021;
originally announced April 2021.
-
Prediction of neonatal mortality in Sub-Saharan African countries using data-level linkage of multiple surveys
Authors:
Girmaw Abebe Tadesse,
Celia Cintas,
Skyler Speakman,
Komminist Weldemariam
Abstract:
Existing datasets available to address crucial problems, such as child mortality and family planning discontinuation in developing countries, are not ample for data-driven approaches. This is partly due to disjoint data collection efforts employed across locations, times, and variations of modalities. On the other hand, state-of-the-art methods for small data problem are confined to image modaliti…
▽ More
Existing datasets available to address crucial problems, such as child mortality and family planning discontinuation in developing countries, are not ample for data-driven approaches. This is partly due to disjoint data collection efforts employed across locations, times, and variations of modalities. On the other hand, state-of-the-art methods for small data problem are confined to image modalities. In this work, we proposed a data-level linkage of disjoint surveys across Sub-Saharan African countries to improve prediction performance of neonatal death and provide cross-domain explainability.
△ Less
Submitted 25 November, 2020;
originally announced November 2020.
-
Limits of Transfer Learning
Authors:
Jake Williams,
Abel Tadesse,
Tyler Sam,
Huey Sun,
George D. Montanez
Abstract:
Transfer learning involves taking information and insight from one problem domain and applying it to a new problem domain. Although widely used in practice, theory for transfer learning remains less well-developed. To address this, we prove several novel results related to transfer learning, showing the need to carefully select which sets of information to transfer and the need for dependence betw…
▽ More
Transfer learning involves taking information and insight from one problem domain and applying it to a new problem domain. Although widely used in practice, theory for transfer learning remains less well-developed. To address this, we prove several novel results related to transfer learning, showing the need to carefully select which sets of information to transfer and the need for dependence between transferred information and target problems. Furthermore, we prove how the degree of probabilistic change in an algorithm using transfer learning places an upper bound on the amount of improvement possible. These results build on the algorithmic search framework for machine learning, allowing the results to apply to a wide range of learning problems using transfer.
△ Less
Submitted 22 June, 2020;
originally announced June 2020.
-
Decomposable Probability-of-Success Metrics in Algorithmic Search
Authors:
Tyler Sam,
Jake Williams,
Abel Tadesse,
Huey Sun,
George Montanez
Abstract:
Previous studies have used a specific success metric within an algorithmic search framework to prove machine learning impossibility results. However, this specific success metric prevents us from applying these results on other forms of machine learning, e.g. transfer learning. We define decomposable metrics as a category of success metrics for search problems which can be expressed as a linear op…
▽ More
Previous studies have used a specific success metric within an algorithmic search framework to prove machine learning impossibility results. However, this specific success metric prevents us from applying these results on other forms of machine learning, e.g. transfer learning. We define decomposable metrics as a category of success metrics for search problems which can be expressed as a linear operation on a probability distribution to solve this issue. Using an arbitrary decomposable metric to measure the success of a search, we demonstrate theorems which bound success in various ways, generalizing several existing results in the literature.
△ Less
Submitted 3 January, 2020;
originally announced January 2020.
-
Severity Detection Tool for Patients with Infectious Disease
Authors:
Girmaw Abebe Tadesse,
Tingting Zhu,
Nhan Le Nguyen Thanh,
Nguyen Thanh Hung,
Ha Thi Hai Duong,
Truong Huu Khanh,
Pham Van Quang,
Duc Duong Tran,
LamMinh Yen,
H Rogier Van Doorn,
Nguyen Van Hao,
John Prince,
Hamza Javed,
DaniKiyasseh,
Le Van Tan,
Louise Thwaites,
David A. Clifton
Abstract:
Hand, foot and mouth disease (HFMD) and tetanus are serious infectious diseases in low and middle income countries. Tetanus in particular has a high mortality rate and its treatment is resource-demanding. Furthermore, HFMD often affects a large number of infants and young children. As a result, its treatment consumes enormous healthcare resources, especially when outbreaks occur. Autonomic nervous…
▽ More
Hand, foot and mouth disease (HFMD) and tetanus are serious infectious diseases in low and middle income countries. Tetanus in particular has a high mortality rate and its treatment is resource-demanding. Furthermore, HFMD often affects a large number of infants and young children. As a result, its treatment consumes enormous healthcare resources, especially when outbreaks occur. Autonomic nervous system dysfunction (ANSD) is the main cause of death for both HFMD and tetanus patients. However, early detection of ANSD is a difficult and challenging problem. In this paper, we aim to provide a proof-of-principle to detect the ANSD level automatically by applying machine learning techniques to physiological patient data, such as electrocardiogram (ECG) and photoplethysmogram (PPG) waveforms, which can be collected using low-cost wearable sensors. Efficient features are extracted that encode variations in the waveforms in the time and frequency domains. A support vector machine is employed to classify the ANSD levels. The proposed approach is validated on multiple datasets of HFMD and tetanus patients in Vietnam. Results show that encouraging performance is achieved in classifying ANSD levels. Moreover, the proposed features are simple, more generalisable and outperformed the standard heart rate variability (HRV) analysis. The proposed approach would facilitate both the diagnosis and treatment of infectious diseases in low and middle income countries, and thereby improve overall patient care.
△ Less
Submitted 10 December, 2019;
originally announced December 2019.
-
Multiple Aging Mechanisms in Ferroelectric Deuterated Potassium Dihydrogen Phosphate
Authors:
Gregory A. Fields,
Samuel F. Cieszynski,
Bo Zhao,
Kidan A. Tadesse,
Eugene V. Colla,
M. B. Weissman
Abstract:
The anomalously large dielectric aging in ferroelectric partially deuterated potassium dihydrogen phosphate (DKDP) is found to have multiple distinct mechanisms. Two components cause decreases in dielectric response over a limited range of fields around the aging field. A large fraction of this aging occurs on time scales of ~1000s after a field change, as expected for a hydrogen/deuterium diffusi…
▽ More
The anomalously large dielectric aging in ferroelectric partially deuterated potassium dihydrogen phosphate (DKDP) is found to have multiple distinct mechanisms. Two components cause decreases in dielectric response over a limited range of fields around the aging field. A large fraction of this aging occurs on time scales of ~1000s after a field change, as expected for a hydrogen/deuterium diffusion mechanism. A slower component can give almost complete loss of domain-wall dielectric response at the aging field after weeks of aging. There is also a particularly unusual aging in which the dielectric response increases with time after rapid cooling.
△ Less
Submitted 21 April, 2019; v1 submitted 2 December, 2018;
originally announced December 2018.
-
Acousto-optic modulation of a photonic crystal nanocavity with Lamb waves in microwave K band
Authors:
Semere A. Tadesse,
Huan Li,
Qiyu Liu,
Mo Li
Abstract:
Integrating nanoscale electromechanical transducers and nanophotonic devices potentially can enable new acousto-optic devices to reach unprecedented high frequencies and modulation efficiency. Here, we demonstrate acousto-optic modulation of a photonic crystal nanocavity using Lamb waves with frequency up to 19 GHz, reaching the microwave K band. The devices are fabricated in suspended aluminum ni…
▽ More
Integrating nanoscale electromechanical transducers and nanophotonic devices potentially can enable new acousto-optic devices to reach unprecedented high frequencies and modulation efficiency. Here, we demonstrate acousto-optic modulation of a photonic crystal nanocavity using Lamb waves with frequency up to 19 GHz, reaching the microwave K band. The devices are fabricated in suspended aluminum nitride membrane. Excitation of acoustic waves is achieved with interdigital transducers with periods as small as 300 nm. Confining both acoustic wave and optical wave within the thickness of the membrane leads to improved acousto-optic modulation efficiency in the new devices than that obtained in previous surface acoustic wave devices. Our system demonstrates a novel scalable optomechanical platform where strong acousto-optic coupling between cavity-confined photons and high frequency traveling phonons can be explored.
△ Less
Submitted 7 August, 2015;
originally announced August 2015.
-
Nanophotonic cavity optomechanics with propagating phonons in microwave Ku band
Authors:
Huan Li,
Semere A. Tadesse,
Qiyu Liu,
Mo Li
Abstract:
Sideband-resolved coupling between multiple photonic nanocavities and propagating mechanical waves in microwave Ku-band is demonstrated. Coherent and strong photon-phonon interaction is manifested with optomechanically induced transparency and absorption, and phase-coherent interaction in multiple cavities. Inside an echo chamber it is shown that a phonon pulse can interact with an embedded nanoca…
▽ More
Sideband-resolved coupling between multiple photonic nanocavities and propagating mechanical waves in microwave Ku-band is demonstrated. Coherent and strong photon-phonon interaction is manifested with optomechanically induced transparency and absorption, and phase-coherent interaction in multiple cavities. Inside an echo chamber it is shown that a phonon pulse can interact with an embedded nanocavity for multiple times. Our device provides a scalable platform to optomechanically couple phonons and photons for microwave photonics and quantum photonics.
△ Less
Submitted 2 July, 2015;
originally announced July 2015.
-
Sub-optical wavelength acoustic wave modulation of integrated photonic resonators at microwave frequencies
Authors:
Semere Ayalew Tadesse,
Mo Li
Abstract:
Light-sound interactions have long been exploited in various acousto-optic devices based on bulk crystalline materials. Conventionally these devices operate in megahertz frequency range where the acoustic wavelength is much longer than the optical wavelength and a long interaction length is required to attain significant coupling. With nanoscale transducers, acoustic waves with sub-optical wavelen…
▽ More
Light-sound interactions have long been exploited in various acousto-optic devices based on bulk crystalline materials. Conventionally these devices operate in megahertz frequency range where the acoustic wavelength is much longer than the optical wavelength and a long interaction length is required to attain significant coupling. With nanoscale transducers, acoustic waves with sub-optical wavelengths can now be excited to induce strong acousto-optic coupling in nanophotonic devices. Here we demonstrate microwave frequency surface acoustic wave transducers co-integrated with nanophotonic resonators on piezoelectric aluminum nitride substrates. Acousto-optic modulation of the resonance modes at above 10 GHz with the acoustic wavelength significantly below the optical wavelength is achieved. The phase and modal matching conditions in this scheme are investigated for efficient modulation. The new acousto-optic platform can lead to novel optical devices based on nonlinear Brillouin processes and provides a direct, wideband link between optical and microwave photons for microwave photonics and quantum optomechanics.
△ Less
Submitted 3 October, 2014;
originally announced October 2014.