-
ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks
Authors:
Akashah Shabbir,
Muhammad Akhtar Munir,
Akshay Dudhane,
Muhammad Umer Sheikh,
Muhammad Haris Khan,
Paolo Fraccaro,
Juan Bernabe Moreno,
Fahad Shahbaz Khan,
Salman Khan
Abstract:
Recent progress in large language models (LLMs) has enabled tool-augmented agents capable of solving complex real-world tasks through step-by-step reasoning. However, existing evaluations often focus on general-purpose or multimodal scenarios, leaving a gap in domain-specific benchmarks that assess tool-use capabilities in complex remote sensing use cases. We present ThinkGeo, an agentic benchmark…
▽ More
Recent progress in large language models (LLMs) has enabled tool-augmented agents capable of solving complex real-world tasks through step-by-step reasoning. However, existing evaluations often focus on general-purpose or multimodal scenarios, leaving a gap in domain-specific benchmarks that assess tool-use capabilities in complex remote sensing use cases. We present ThinkGeo, an agentic benchmark designed to evaluate LLM-driven agents on remote sensing tasks via structured tool use and multi-step planning. Inspired by tool-interaction paradigms, ThinkGeo includes human-curated queries spanning a wide range of real-world applications such as urban planning, disaster assessment and change analysis, environmental monitoring, transportation analysis, aviation monitoring, recreational infrastructure, and industrial site analysis. Each query is grounded in satellite or aerial imagery and requires agents to reason through a diverse toolset. We implement a ReAct-style interaction loop and evaluate both open and closed-source LLMs (e.g., GPT-4o, Qwen2.5) on 436 structured agentic tasks. The benchmark reports both step-wise execution metrics and final answer correctness. Our analysis reveals notable disparities in tool accuracy and planning consistency across models. ThinkGeo provides the first extensive testbed for evaluating how tool-enabled LLMs handle spatial reasoning in remote sensing. Our code and dataset are publicly available
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
PerSense: Personalized Instance Segmentation in Dense Images
Authors:
Muhammad Ibraheem Siddiqui,
Muhammad Umer Sheikh,
Hassan Abid,
Muhammad Haris Khan
Abstract:
The emergence of foundational models has significantly advanced segmentation approaches. However, existing models still face challenges in automatically segmenting personalized instances in dense scenarios, where severe occlusions, scale variations, and background clutter hinder precise instance delineation. To address this, we propose PerSense, an end-to-end, training-free, and model-agnostic one…
▽ More
The emergence of foundational models has significantly advanced segmentation approaches. However, existing models still face challenges in automatically segmenting personalized instances in dense scenarios, where severe occlusions, scale variations, and background clutter hinder precise instance delineation. To address this, we propose PerSense, an end-to-end, training-free, and model-agnostic one-shot framework for personalized instance segmentation in dense images. We start with developing a new baseline capable of automatically generating instance-level point prompts via proposing a novel Instance Detection Module (IDM) that leverages density maps, encapsulating spatial distribution of objects in an image. To reduce false positives, we design the Point Prompt Selection Module (PPSM), which refines the output of IDM based on an adaptive threshold. Both IDM and PPSM seamlessly integrate into our model-agnostic framework. Furthermore, we introduce a feedback mechanism which enables PerSense to improve the accuracy of density maps by automating the exemplar selection process for density map generation. Finally, to promote algorithmic advances and effective tools for this relatively underexplored task, we introduce PerSense-D, an evaluation benchmark exclusive to personalized instance segmentation in dense images. Our extensive experiments establish PerSense superiority in dense scenarios compared to SOTA approaches. Additionally, our qualitative findings demonstrate the adaptability of our framework to images captured in-the-wild.
△ Less
Submitted 11 March, 2025; v1 submitted 22 May, 2024;
originally announced May 2024.
-
Advanced Antenna Techniques and High Order Sectorization with Novel Network Tessellation for Enhancing Macro Cell Capacity in DC-HSDPA Network
Authors:
Muhammad Usman Sheikh,
Jukka Lempiainen
Abstract:
Mobile operators commonly use macro cells with traditional wide beam antennas for wider coverage in the cell, but future capacity demands cannot be achieved by using them only. It is required to achieve maximum practical capacity from macro cells by employing higher order sectorization and by utilizing all possible antenna solutions including smart antennas. This paper presents enhanced tessellati…
▽ More
Mobile operators commonly use macro cells with traditional wide beam antennas for wider coverage in the cell, but future capacity demands cannot be achieved by using them only. It is required to achieve maximum practical capacity from macro cells by employing higher order sectorization and by utilizing all possible antenna solutions including smart antennas. This paper presents enhanced tessellation for 6-sector sites and proposes novel layout for 12-sector sites. The main target of this paper is to compare the performance of conventional wide beam antenna, switched beam smart antenna, adaptive beam antenna and different network layouts in terms of offering better received signal quality and user throughput. Splitting macro cell into smaller micro or pico cells can improve the capacity of network, but this paper highlights the importance of higher order sectorization and advance antenna techniques to attain high Signal to Interference plus Noise Ratio (SINR), along with improved network capacity. Monte Carlo simulations at system level were done for Dual Cell High Speed Downlink Packet Access (DC-HSDPA) technology with multiple (five) users per Transmission Time Interval (TTI) at different Intersite Distance (ISD). The obtained results validate and estimate the gain of using smart antennas and higher order sectorization with proposed network layout.
△ Less
Submitted 10 December, 2013;
originally announced December 2013.