Skip to main content

Showing 1–29 of 29 results for author: Ferres, J M L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.20241  [pdf

    cs.LG cs.DC

    Energy Use of AI Inference: Efficiency Pathways and Test-Time Compute

    Authors: Felipe Oviedo, Fiodar Kazhamiaka, Esha Choukse, Allen Kim, Amy Luers, Melanie Nakagawa, Ricardo Bianchini, Juan M. Lavista Ferres

    Abstract: As AI inference scales to billions of queries and emerging reasoning and agentic workflows increase token demand, reliable estimates of per-query energy use are increasingly important for capacity planning, emissions accounting, and efficiency prioritization. Many public estimates are inconsistent and overstate energy use, because they extrapolate from limited benchmarks and fail to reflect effici… ▽ More

    Submitted 24 September, 2025; originally announced September 2025.

    Comments: A preprint version with DOI is available at Zenodo: https://doi.org/10.5281/zenodo.17188770

  2. arXiv:2508.10219  [pdf, ps, other

    cs.LG cs.CV

    AI-Driven Detection and Analysis of Handwriting on Seized Ivory: A Tool to Uncover Criminal Networks in the Illicit Wildlife Trade

    Authors: Will Fein, Ryan J. Horwitz, John E. Brown III, Amit Misra, Felipe Oviedo, Kevin White, Juan M. Lavista Ferres, Samuel K. Wasser

    Abstract: The transnational ivory trade continues to drive the decline of elephant populations across Africa, and trafficking networks remain difficult to disrupt. Tusks seized by law enforcement officials carry forensic information on the traffickers responsible for their export, including DNA evidence and handwritten markings made by traffickers. For 20 years, analyses of tusk DNA have identified where el… ▽ More

    Submitted 15 August, 2025; v1 submitted 13 August, 2025; originally announced August 2025.

    Comments: Submitted. 13 pages, 5 figures, 4 tables

  3. arXiv:2507.08605  [pdf, ps, other

    cs.LG

    Remote Sensing Reveals Adoption of Sustainable Rice Farming Practices Across Punjab, India

    Authors: Ando Shah, Rajveer Singh, Akram Zaytar, Girmaw Abebe Tadesse, Caleb Robinson, Negar Tafti, Stephen A. Wood, Rahul Dodhia, Juan M. Lavista Ferres

    Abstract: Rice cultivation consumes 24-30% of global freshwater, creating critical water management challenges in major rice-producing regions. Sustainable irrigation practices like direct seeded rice (DSR) and alternate wetting and drying (AWD) can reduce water use by 20-40% while maintaining yields, helping secure long-term agricultural productivity as water scarcity intensifies - a key component of the Z… ▽ More

    Submitted 11 July, 2025; originally announced July 2025.

    Comments: Dataset and code will be published shortly and links updated in v2

  4. arXiv:2506.06235  [pdf, ps, other

    cs.CV

    Optimizing Cloud-to-GPU Throughput for Deep Learning With Earth Observation Data

    Authors: Akram Zaytar, Caleb Robinson, Girmaw Abebe Tadesse, Tammy Glazer, Gilles Hacheme, Anthony Ortiz, Rahul M Dodhia, Juan M Lavista Ferres

    Abstract: Training deep learning models on petabyte-scale Earth observation (EO) data requires separating compute resources from data storage. However, standard PyTorch data loaders cannot keep modern GPUs utilized when streaming GeoTIFF files directly from cloud storage. In this work, we benchmark GeoTIFF loading throughput from both cloud object storage and local SSD, systematically testing different load… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  5. arXiv:2505.24340  [pdf, other

    cs.CV cs.CL cs.LG

    GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models

    Authors: Gilles Quentin Hacheme, Girmaw Abebe Tadesse, Caleb Robinson, Akram Zaytar, Rahul Dodhia, Juan M. Lavista Ferres

    Abstract: Classifying geospatial imagery remains a major bottleneck for applications such as disaster response and land-use monitoring-particularly in regions where annotated data is scarce or unavailable. Existing tools (e.g., RS-CLIP) that claim zero-shot classification capabilities for satellite imagery nonetheless rely on task-specific pretraining and adaptation to reach competitive performance. We intr… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

    ACM Class: I.2.10; I.2.7; I.4.8; I.5.3

  6. arXiv:2505.01225  [pdf, ps, other

    cs.CV

    Core-Set Selection for Data-efficient Land Cover Segmentation

    Authors: Keiller Nogueira, Akram Zaytar, Wanli Ma, Ribana Roscher, Ronny Hänsch, Caleb Robinson, Anthony Ortiz, Simone Nsutezo, Rahul Dodhia, Juan M. Lavista Ferres, Oktay Karakuş, Paul L. Rosin

    Abstract: The increasing accessibility of remotely sensed data and the potential of such data to inform large-scale decision-making has driven the development of deep learning models for many Earth Observation tasks. Traditionally, such models must be trained on large datasets. However, the common assumption that broadly larger datasets lead to better outcomes tends to overlook the complexities of the data… ▽ More

    Submitted 1 August, 2025; v1 submitted 2 May, 2025; originally announced May 2025.

  7. arXiv:2503.14860  [pdf, other

    cs.LG cs.CV

    Global Renewables Watch: A Temporal Dataset of Solar and Wind Energy Derived from Satellite Imagery

    Authors: Caleb Robinson, Anthony Ortiz, Allen Kim, Rahul Dodhia, Andrew Zolli, Shivaprakash K Nagaraju, James Oakleaf, Joe Kiesecker, Juan M. Lavista Ferres

    Abstract: We present a comprehensive global temporal dataset of commercial solar photovoltaic (PV) farms and onshore wind turbines, derived from high-resolution satellite imagery analyzed quarterly from the fourth quarter of 2017 to the second quarter of 2024. We create this dataset by training deep learning-based segmentation models to identify these renewable energy installations from satellite imagery, t… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  8. arXiv:2501.08490  [pdf, other

    cs.CV cs.LG

    FLAVARS: A Multimodal Foundational Language and Vision Alignment Model for Remote Sensing

    Authors: Isaac Corley, Simone Fobi Nsutezo, Anthony Ortiz, Caleb Robinson, Rahul Dodhia, Juan M. Lavista Ferres, Peyman Najafirad

    Abstract: Remote sensing imagery is dense with objects and contextual visual information. There is a recent trend to combine paired satellite images and text captions for pretraining performant encoders for downstream tasks. However, while contrastive image-text methods like CLIP enable vision-language alignment and zero-shot classification ability, vision-only downstream performance tends to degrade compar… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

  9. arXiv:2412.13394  [pdf, other

    cs.CV cs.AI cs.LG

    Distribution Shifts at Scale: Out-of-distribution Detection in Earth Observation

    Authors: Burak Ekim, Girmaw Abebe Tadesse, Caleb Robinson, Gilles Hacheme, Michael Schmitt, Rahul Dodhia, Juan M. Lavista Ferres

    Abstract: Training robust deep learning models is crucial in Earth Observation, where globally deployed models often face distribution shifts that degrade performance, especially in low-data regions. Out-of-distribution (OOD) detection addresses this by identifying inputs that deviate from in-distribution (ID) data. However, existing methods either assume access to OOD data or compromise primary task perfor… ▽ More

    Submitted 8 April, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

  10. arXiv:2412.10184  [pdf, other

    cs.CV cs.LG physics.geo-ph

    Sims: An Interactive Tool for Geospatial Matching and Clustering

    Authors: Akram Zaytar, Girmaw Abebe Tadesse, Caleb Robinson, Eduardo G. Bendito, Medha Devare, Meklit Chernet, Gilles Q. Hacheme, Rahul Dodhia, Juan M. Lavista Ferres

    Abstract: Acquiring, processing, and visualizing geospatial data requires significant computing resources, especially for large spatio-temporal domains. This challenge hinders the rapid discovery of predictive features, which is essential for advancing geospatial modeling. To address this, we developed Similarity Search (Sims), a no-code web tool that allows users to perform clustering and similarity search… ▽ More

    Submitted 20 December, 2024; v1 submitted 13 December, 2024; originally announced December 2024.

  11. arXiv:2412.07944  [pdf, other

    cs.CV

    PGRID: Power Grid Reconstruction in Informal Developments Using High-Resolution Aerial Imagery

    Authors: Simone Fobi Nsutezo, Amrita Gupta, Duncan Kebut, Seema Iyer, Luana Marotti, Rahul Dodhia, Juan M. Lavista Ferres, Anthony Ortiz

    Abstract: As of 2023, a record 117 million people have been displaced worldwide, more than double the number from a decade ago [22]. Of these, 32 million are refugees under the UNHCR mandate, with 8.7 million residing in refugee camps. A critical issue faced by these populations is the lack of access to electricity, with 80% of the 8.7 million refugees and displaced persons in camps globally relying on trad… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: Accepted to WACV 2025 IEEE/CVF Winter Conference

  12. arXiv:2412.00777  [pdf, other

    cs.CV cs.AI

    Local vs. Global: Local Land-Use and Land-Cover Models Deliver Higher Quality Maps

    Authors: Girmaw Abebe Tadesse, Caleb Robinson, Charles Mwangi, Esther Maina, Joshua Nyakundi, Luana Marotti, Gilles Quentin Hacheme, Hamed Alemohammad, Rahul Dodhia, Juan M. Lavista Ferres

    Abstract: In 2023, 58.0% of the African population experienced moderate to severe food insecurity, with 21.6% facing severe food insecurity. Land-use and land-cover maps provide crucial insights for addressing food insecurity by improving agricultural efforts, including mapping and monitoring crop types and estimating yield. The development of global land-cover maps has been facilitated by the increasing av… ▽ More

    Submitted 11 December, 2024; v1 submitted 1 December, 2024; originally announced December 2024.

  13. arXiv:2409.16252  [pdf, other

    cs.CV cs.AI cs.LG

    Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation

    Authors: Hannah Kerner, Snehal Chaudhari, Aninda Ghosh, Caleb Robinson, Adeel Ahmad, Eddie Choi, Nathan Jacobs, Chris Holmes, Matthias Mohr, Rahul Dodhia, Juan M. Lavista Ferres, Jennifer Marcus

    Abstract: Crop field boundaries are foundational datasets for agricultural monitoring and assessments but are expensive to collect manually. Machine learning (ML) methods for automatically extracting field boundaries from remotely sensed images could help realize the demand for these datasets at a global scale. However, current ML methods for field instance segmentation lack sufficient geographic coverage,… ▽ More

    Submitted 19 December, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: Accepted at the AAAI-2025 Artificial Intelligence for Social Impact (AISI) track

  14. arXiv:2405.12930  [pdf, other

    cs.CV cs.LG

    Pytorch-Wildlife: A Collaborative Deep Learning Framework for Conservation

    Authors: Andres Hernandez, Zhongqi Miao, Luisa Vargas, Sara Beery, Rahul Dodhia, Pablo Arbelaez, Juan M. Lavista Ferres

    Abstract: The alarming decline in global biodiversity, driven by various factors, underscores the urgent need for large-scale wildlife monitoring. In response, scientists have turned to automated deep learning methods for data processing in wildlife monitoring. However, applying these advanced methods in real-world scenarios is challenging due to their complexity and the need for specialized knowledge, prim… ▽ More

    Submitted 28 November, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: Pytorch-Wildlife is available at https://github.com/microsoft/CameraTraps

  15. arXiv:2404.08544  [pdf, other

    cs.CV cs.AI

    Analyzing Decades-Long Environmental Changes in Namibia Using Archival Aerial Photography and Deep Learning

    Authors: Girmaw Abebe Tadesse, Caleb Robinson, Gilles Quentin Hacheme, Akram Zaytar, Rahul Dodhia, Tsering Wangyal Shawa, Juan M. Lavista Ferres, Emmanuel H. Kreike

    Abstract: This study explores object detection in historical aerial photographs of Namibia to identify long-term environmental changes. Specifically, we aim to identify key objects -- Waterholes, Omuti homesteads, and Big trees -- around Oshikango in Namibia using sub-meter gray-scale aerial imagery from 1943 and 1972. In this work, we propose a workflow for analyzing historical aerial imagery using a deep… ▽ More

    Submitted 21 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

  16. arXiv:2403.02736  [pdf, other

    cs.CV cs.AI

    Bootstrapping Rare Object Detection in High-Resolution Satellite Imagery

    Authors: Akram Zaytar, Caleb Robinson, Gilles Q. Hacheme, Girmaw A. Tadesse, Rahul Dodhia, Juan M. Lavista Ferres, Lacey F. Hughey, Jared A. Stabach, Irene Amoke

    Abstract: Rare object detection is a fundamental task in applied geospatial machine learning, however is often challenging due to large amounts of high-resolution satellite or aerial imagery and few or no labeled positive samples to start with. This paper addresses the problem of bootstrapping such a rare object detection task assuming there is no labeled data and no spatial prior over the area of interest.… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  17. arXiv:2401.07014  [pdf, other

    cs.CV cs.AI

    Weak Labeling for Cropland Mapping in Africa

    Authors: Gilles Quentin Hacheme, Akram Zaytar, Girmaw Abebe Tadesse, Caleb Robinson, Rahul Dodhia, Juan M. Lavista Ferres, Stephen Wood

    Abstract: Cropland mapping can play a vital role in addressing environmental, agricultural, and food security challenges. However, in the context of Africa, practical applications are often hindered by the limited availability of high-resolution cropland maps. Such maps typically require extensive human labeling, thereby creating a scalability bottleneck. To address this, we propose an approach that utilize… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: 5 pages

  18. arXiv:2401.06762  [pdf, other

    cs.CV cs.LG

    Seeing the roads through the trees: A benchmark for modeling spatial dependencies with aerial imagery

    Authors: Caleb Robinson, Isaac Corley, Anthony Ortiz, Rahul Dodhia, Juan M. Lavista Ferres, Peyman Najafirad

    Abstract: Fully understanding a complex high-resolution satellite or aerial imagery scene often requires spatial reasoning over a broad relevant context. The human object recognition system is able to understand object in a scene over a long-range relevant context. For example, if a human observes an aerial scene that shows sections of road broken up by tree canopy, then they will be unlikely to conclude th… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: In submission to IGARSS 2024

  19. arXiv:2312.06153  [pdf, other

    cs.LG cs.AI cs.HC

    Open Datasheets: Machine-readable Documentation for Open Datasets and Responsible AI Assessments

    Authors: Anthony Cintron Roman, Jennifer Wortman Vaughan, Valerie See, Steph Ballard, Jehu Torres, Caleb Robinson, Juan M. Lavista Ferres

    Abstract: This paper introduces a no-code, machine-readable documentation framework for open datasets, with a focus on responsible AI (RAI) considerations. The framework aims to improve comprehensibility, and usability of open datasets, facilitating easier discovery and use, better understanding of content and context, and evaluation of dataset quality and accuracy. The proposed framework is designed to str… ▽ More

    Submitted 27 March, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  20. arXiv:2306.12589  [pdf, other

    cs.CV cs.LG

    Rapid building damage assessment workflow: An implementation for the 2023 Rolling Fork, Mississippi tornado event

    Authors: Caleb Robinson, Simone Fobi Nsutezo, Anthony Ortiz, Tina Sederholm, Rahul Dodhia, Cameron Birge, Kasie Richards, Kris Pitcher, Paulo Duarte, Juan M. Lavista Ferres

    Abstract: Rapid and accurate building damage assessments from high-resolution satellite imagery following a natural disaster is essential to inform and optimize first responder efforts. However, performing such building damage assessments in an automated manner is non-trivial due to the challenges posed by variations in disaster-specific damage, diversity in satellite imagery, and the dearth of extensive, l… ▽ More

    Submitted 24 August, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: Accepted at the 2023 ICCV Humanitarian Assistance and Disaster Response workshop

  21. arXiv:2306.06191  [pdf, other

    cs.LG cs.IR

    Open Data on GitHub: Unlocking the Potential of AI

    Authors: Anthony Cintron Roman, Kevin Xu, Arfon Smith, Jehu Torres Vega, Caleb Robinson, Juan M Lavista Ferres

    Abstract: GitHub is the world's largest platform for collaborative software development, with over 100 million users. GitHub is also used extensively for open data collaboration, hosting more than 800 million open data files, totaling 142 terabytes of data. This study highlights the potential of open data on GitHub and demonstrates how it can accelerate AI research. We analyze the existing landscape of open… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: In submission to NeurIPS 2023 Track Datasets and Benchmarks

  22. arXiv:2305.13456  [pdf, other

    cs.CV cs.LG

    Revisiting pre-trained remote sensing model benchmarks: resizing and normalization matters

    Authors: Isaac Corley, Caleb Robinson, Rahul Dodhia, Juan M. Lavista Ferres, Peyman Najafirad

    Abstract: Research in self-supervised learning (SSL) with natural images has progressed rapidly in recent years and is now increasingly being applied to and benchmarked with datasets containing remotely sensed imagery. A common benchmark case is to evaluate SSL pre-trained model embeddings on datasets of remotely sensed imagery with small patch sizes, e.g., 32x32 pixels, whereas standard SSL pre-training ta… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  23. arXiv:2206.05377  [pdf, other

    cs.CV cs.LG

    Fast building segmentation from satellite imagery and few local labels

    Authors: Caleb Robinson, Anthony Ortiz, Hogeun Park, Nancy Lozano Gracia, Jon Kher Kaw, Tina Sederholm, Rahul Dodhia, Juan M. Lavista Ferres

    Abstract: Innovations in computer vision algorithms for satellite image analysis can enable us to explore global challenges such as urbanization and land use change at the planetary level. However, domain shift problems are a common occurrence when trying to replicate models that drive these analyses to new areas, particularly in the developing world. If a model is trained with imagery and labels from one l… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: Accepted at EarthVision 2022

  24. arXiv:2112.10988  [pdf, other

    cs.CV cs.LG

    Mapping industrial poultry operations at scale with deep learning and aerial imagery

    Authors: Caleb Robinson, Ben Chugg, Brandon Anderson, Juan M. Lavista Ferres, Daniel E. Ho

    Abstract: Concentrated Animal Feeding Operations (CAFOs) pose serious risks to air, water, and public health, but have proven to be challenging to regulate. The U.S. Government Accountability Office notes that a basic challenge is the lack of comprehensive location information on CAFOs. We use the USDA's National Agricultural Imagery Program (NAIP) 1m/pixel aerial imagery to detect poultry CAFOs across the… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

  25. arXiv:2111.08872  [pdf, other

    cs.CV cs.LG

    TorchGeo: Deep Learning With Geospatial Data

    Authors: Adam J. Stewart, Caleb Robinson, Isaac A. Corley, Anthony Ortiz, Juan M. Lavista Ferres, Arindam Banerjee

    Abstract: Remotely sensed geospatial data are critical for applications including precision agriculture, urban planning, disaster monitoring and response, and climate change research, among others. Deep learning methods are particularly promising for modeling many remote sensing tasks given the success of deep neural networks in similar computer vision tasks and the sheer volume of remotely sensed imagery a… ▽ More

    Submitted 17 September, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

  26. arXiv:2106.15448  [pdf, other

    cs.CV cs.LG

    Detecting Cattle and Elk in the Wild from Space

    Authors: Caleb Robinson, Anthony Ortiz, Lacey Hughey, Jared A. Stabach, Juan M. Lavista Ferres

    Abstract: Localizing and counting large ungulates -- hoofed mammals like cows and elk -- in very high-resolution satellite imagery is an important task for supporting ecological studies. Prior work has shown that this is feasible with deep learning based methods and sub-meter multi-spectral satellite imagery. We extend this line of work by proposing a baseline method, CowNet, that simultaneously estimates t… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: Presented at the KDD 2021 Fragile Earth Workshop

  27. arXiv:2104.11757  [pdf, ps, other

    cs.CY

    Becoming Good at AI for Good

    Authors: Meghana Kshirsagar, Caleb Robinson, Siyu Yang, Shahrzad Gholami, Ivan Klyuzhin, Sumit Mukherjee, Md Nasir, Anthony Ortiz, Felipe Oviedo, Darren Tanner, Anusua Trivedi, Yixi Xu, Ming Zhong, Bistra Dilkina, Rahul Dodhia, Juan M. Lavista Ferres

    Abstract: AI for good (AI4G) projects involve developing and applying artificial intelligence (AI) based solutions to further goals in areas such as sustainability, health, humanitarian aid, and social justice. Developing and deploying such solutions must be done in collaboration with partners who are experts in the domain in question and who already have experience in making progress towards such goals. Ba… ▽ More

    Submitted 3 May, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

    Comments: Accepted to AIES-2021

  28. arXiv:2103.09787  [pdf, other

    cs.CV

    Temporal Cluster Matching for Change Detection of Structures from Satellite Imagery

    Authors: Caleb Robinson, Anthony Ortiz, Juan M. Lavista Ferres, Brandon Anderson, Daniel E. Ho

    Abstract: Longitudinal studies are vital to understanding dynamic changes of the planet, but labels (e.g., buildings, facilities, roads) are often available only for a single point in time. We propose a general model, Temporal Cluster Matching (TCM), for detecting building changes in time series of remotely sensed imagery when footprint labels are observed only once. The intuition behind the model is that t… ▽ More

    Submitted 29 June, 2021; v1 submitted 17 March, 2021; originally announced March 2021.

    Comments: Published in ACM COMPASS 2021

  29. arXiv:2010.01485  [pdf, other

    eess.IV cs.CV cs.LG

    Improving Lesion Detection by exploring bias on Skin Lesion dataset

    Authors: Anusua Trivedi, Sreya Muppalla, Shreyaan Pathak, Azadeh Mobasher, Pawel Janowski, Rahul Dodhia, Juan M. Lavista Ferres

    Abstract: All datasets contain some biases, often unintentional, due to how they were acquired and annotated. These biases distort machine-learning models' performance, creating spurious correlations that the models can unfairly exploit, or, contrarily destroying clear correlations that the models could learn. With the popularity of deep learning models, automated skin lesion analysis is starting to play an… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.