Skip to main content

Showing 1–40 of 40 results for author: Sapkota, R

.
  1. arXiv:2506.08045  [pdf, ps, other

    cs.RO cs.AI

    UAVs Meet Agentic AI: A Multidomain Survey of Autonomous Aerial Intelligence and Agentic UAVs

    Authors: Ranjan Sapkota, Konstantinos I. Roumeliotis, Manoj Karkee

    Abstract: Agentic UAVs represent a new frontier in autonomous aerial intelligence, integrating perception, decision-making, memory, and collaborative planning to operate adaptively in complex, real-world environments. Driven by recent advances in Agentic AI, these systems surpass traditional UAVs by exhibiting goal-driven behavior, contextual reasoning, and interactive autonomy. We provide a comprehensive f… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

    Comments: 40 pages, 6 Figures

  2. arXiv:2506.06627  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Lithography defined semiconductor moires with anomalous in-gap quantum Hall states

    Authors: Wei Pan, D. Bruce Burckel, Catalin D. Spataru, Keshab R. Sapkota, Aaron J. Muhowski, Samuel D. Hawkins, John F. Klem, Layla S. Smith, Doyle A. Temple, Zachery A. Enderson, Zhigang Jiang, Komalavalli Thirunavukkuarasu, Li Xiang, Mykhaylo Ozerov, Dmitry Smirnov, Chang Niu, Peide D. Ye, Praveen Pai, Fan Zhang

    Abstract: Quantum materials and phenomena have attracted great interest for their potential applications in next-generation microelectronics and quantum-information technologies. In one especially interesting class of quantum materials, moire superlattices (MSL) formed by twisted bilayers of 2D materials, a wide range of novel phenomena are observed. However, there exist daunting challenges such as reproduc… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: published by Nano Letters

  3. arXiv:2506.04133  [pdf, ps, other

    cs.AI

    TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems

    Authors: Shaina Raza, Ranjan Sapkota, Manoj Karkee, Christos Emmanouilidis

    Abstract: Agentic AI systems, built on large language models (LLMs) and deployed in multi-agent configurations, are redefining intelligent autonomy, collaboration and decision-making across enterprise and societal domains. This review presents a structured analysis of Trust, Risk, and Security Management (TRiSM) in the context of LLM-based agentic multi-agent systems (AMAS). We begin by examining the concep… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  4. arXiv:2505.21861  [pdf

    cond-mat.supr-con cond-mat.mes-hall cond-mat.mtrl-sci

    Robust and Symmetric Magnetic Field Dependency of Superconducting Diode Effect in Asymmetric Dirac Semimetal SQUIDs

    Authors: H. C. Travaglini, J. J. Cuozzo, K. R. Sapkota, I. A. Leahy, A. D. Rice, K. Alberi, W. Pan

    Abstract: The recent demonstration of the superconducting diode effect (SDE) has generated renewed interests in superconducting electronics in which devices such as compact superconducting diodes that can perform signal rectification where low-energy operations are needed. In this article, we present our results of robust and symmetric-in-magnetic-field SDE in asymmetric superconducting quantum interference… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: accepted to APL Quantum

  5. arXiv:2505.19443  [pdf, other

    cs.SE cs.AI cs.CL

    Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications of Agentic AI

    Authors: Ranjan Sapkota, Konstantinos I. Roumeliotis, Manoj Karkee

    Abstract: This review presents a comprehensive analysis of two emerging paradigms in AI-assisted software development: vibe coding and agentic coding. While both leverage large language models (LLMs), they differ fundamentally in autonomy, architectural design, and the role of the developer. Vibe coding emphasizes intuitive, human-in-the-loop interaction through prompt-based, conversational workflows that s… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 35 Pages, 8 Figures, 6 Tables

  6. arXiv:2505.10468  [pdf, other

    cs.AI

    AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges

    Authors: Ranjan Sapkota, Konstantinos I. Roumeliotis, Manoj Karkee

    Abstract: This study critically distinguishes between AI Agents and Agentic AI, offering a structured conceptual taxonomy, application mapping, and challenge analysis to clarify their divergent design philosophies and capabilities. We begin by outlining the search strategy and foundational definitions, characterizing AI Agents as modular systems driven by Large Language Models (LLMs) and Large Image Models… ▽ More

    Submitted 27 May, 2025; v1 submitted 15 May, 2025; originally announced May 2025.

    Comments: 36 pages, 14 figures, 11 tables

  7. arXiv:2505.04769  [pdf, other

    cs.CV

    Vision-Language-Action Models: Concepts, Progress, Applications and Challenges

    Authors: Ranjan Sapkota, Yang Cao, Konstantinos I. Roumeliotis, Manoj Karkee

    Abstract: Vision-Language-Action (VLA) models mark a transformative advancement in artificial intelligence, aiming to unify perception, natural language understanding, and embodied action within a single computational framework. This foundational review presents a comprehensive synthesis of recent advancements in Vision-Language-Action models, systematically organized across five thematic pillars that struc… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 36 pages, 18 Figures, 4 Tables

  8. arXiv:2504.20419  [pdf

    cs.CV

    Plant Disease Detection through Multimodal Large Language Models and Convolutional Neural Networks

    Authors: Konstantinos I. Roumeliotis, Ranjan Sapkota, Manoj Karkee, Nikolaos D. Tselikas, Dimitrios K. Nasiopoulos

    Abstract: Automation in agriculture plays a vital role in addressing challenges related to crop monitoring and disease management, particularly through early detection systems. This study investigates the effectiveness of combining multimodal Large Language Models (LLMs), specifically GPT-4o, with Convolutional Neural Networks (CNNs) for automated plant disease classification using leaf imagery. Leveraging… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  9. arXiv:2504.18738  [pdf, other

    cs.CV

    A Review of 3D Object Detection with Vision-Language Models

    Authors: Ranjan Sapkota, Konstantinos I Roumeliotis, Rahul Harsha Cheppally, Marco Flores Calero, Manoj Karkee

    Abstract: This review provides a systematic analysis of comprehensive survey of 3D object detection with vision-language models(VLMs) , a rapidly advancing area at the intersection of 3D vision and multimodal AI. By examining over 100 research papers, we provide the first systematic analysis dedicated to 3D object detection with vision-language models. We begin by outlining the unique challenges of 3D objec… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

  10. arXiv:2504.13099  [pdf, other

    cs.CV

    RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity

    Authors: Ranjan Sapkota, Rahul Harsha Cheppally, Ajay Sharda, Manoj Karkee

    Abstract: This study conducts a detailed comparison of RF-DETR object detection base model and YOLOv12 object detection model configurations for detecting greenfruits in a complex orchard environment marked by label ambiguity, occlusions, and background blending. A custom dataset was developed featuring both single-class (greenfruit) and multi-class (occluded and non-occluded greenfruits) annotations to ass… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  11. arXiv:2503.00057  [pdf, other

    cs.CV cs.CL

    Improved YOLOv12 with LLM-Generated Synthetic Data for Enhanced Apple Detection and Benchmarking Against YOLOv11 and YOLOv10

    Authors: Ranjan Sapkota, Manoj Karkee

    Abstract: This study evaluated the performance of the YOLOv12 object detection model, and compared against the performances YOLOv11 and YOLOv10 for apple detection in commercial orchards based on the model training completed entirely on synthetic images generated by Large Language Models (LLMs). The YOLOv12n configuration achieved the highest precision at 0.916, the highest recall at 0.969, and the highest… ▽ More

    Submitted 19 March, 2025; v1 submitted 26 February, 2025; originally announced March 2025.

    Comments: 8 pages, 5 Figures, 2 Tables

  12. arXiv:2502.18505  [pdf, other

    cs.SE cs.AI cs.CL

    Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models

    Authors: Ranjan Sapkota, Shaina Raza, Manoj Karkee

    Abstract: Despite increasing discussions on open-source Artificial Intelligence (AI), existing research lacks a discussion on the transparency and accessibility of state-of-the-art (SoTA) Large Language Models (LLMs). The Open Source Initiative (OSI) has recently released its first formal definition of open-source software. This definition, when combined with standard dictionary definitions and the sparse p… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

  13. arXiv:2502.08650  [pdf, other

    cs.CY

    Who is Responsible? The Data, Models, Users or Regulations? A Comprehensive Survey on Responsible Generative AI for a Sustainable Future

    Authors: Shaina Raza, Rizwan Qureshi, Anam Zahid, Joseph Fioresi, Ferhat Sadak, Muhammad Saeed, Ranjan Sapkota, Aditya Jain, Anas Zafar, Muneeb Ul Hassan, Aizan Zafar, Hasan Maqbool, Ashmal Vayani, Jia Wu, Maged Shoman

    Abstract: Responsible Artificial Intelligence (RAI) has emerged as a crucial framework for addressing ethical concerns in the development and deployment of Artificial Intelligence (AI) systems. A significant body of literature exists, primarily focusing on either RAI guidelines and principles or the technical aspects of RAI, largely within the realm of traditional AI. However, a notable gap persists in brid… ▽ More

    Submitted 28 April, 2025; v1 submitted 15 January, 2025; originally announced February 2025.

    Comments: under review

  14. arXiv:2501.18648  [pdf, other

    cs.CV

    Multimodal Large Language Models for Image, Text, and Speech Data Augmentation: A Survey

    Authors: Ranjan Sapkota, Shaina Raza, Maged Shoman, Achyut Paudel, Manoj Karkee

    Abstract: In the past five years, research has shifted from traditional Machine Learning (ML) and Deep Learning (DL) approaches to leveraging Large Language Models (LLMs) , including multimodality, for data augmentation to enhance generalization, and combat overfitting in training deep convolutional neural networks. However, while existing surveys predominantly focus on ML and DL techniques or limited modal… ▽ More

    Submitted 21 March, 2025; v1 submitted 29 January, 2025; originally announced January 2025.

    Comments: 52 pages

  15. arXiv:2501.16345  [pdf, other

    cs.LG cs.AI

    Self-Clustering Graph Transformer Approach to Model Resting-State Functional Brain Activity

    Authors: Bishal Thapaliya, Esra Akbas, Ram Sapkota, Bhaskar Ray, Vince Calhoun, Jingyu Liu

    Abstract: Resting-state functional magnetic resonance imaging (rs-fMRI) offers valuable insights into the human brain's functional organization and is a powerful tool for investigating the relationship between brain function and cognitive processes, as it allows for the functional organization of the brain to be captured without relying on a specific task or stimuli. In this study, we introduce a novel atte… ▽ More

    Submitted 7 February, 2025; v1 submitted 17 January, 2025; originally announced January 2025.

    Comments: 5 pages, 2 figures - Accepted under International Symposium on Biomedical Imaging (ISBI 2025) Conference

  16. arXiv:2412.05728  [pdf, other

    cs.CV

    Integrating YOLO11 and Convolution Block Attention Module for Multi-Season Segmentation of Tree Trunks and Branches in Commercial Apple Orchards

    Authors: Ranjan Sapkota, Manoj Karkee

    Abstract: In this study, we developed a customized instance segmentation model by integrating the Convolutional Block Attention Module (CBAM) with the YOLO11 architecture. This model, trained on a mixed dataset of dormant and canopy season apple orchard images, aimed to enhance the segmentation of tree trunks and branches under varying seasonal conditions throughout the year. The model was individually vali… ▽ More

    Submitted 7 December, 2024; originally announced December 2024.

    Comments: 19 Pages, YOLOv11

  17. arXiv:2411.11285  [pdf, other

    cs.CV cs.AI

    Zero-Shot Automatic Annotation and Instance Segmentation using LLM-Generated Datasets: Eliminating Field Imaging and Manual Annotation for Deep Learning Model Development

    Authors: Ranjan Sapkota, Achyut Paudel, Manoj Karkee

    Abstract: Currently, deep learning-based instance segmentation for various applications (e.g., Agriculture) is predominantly performed using a labor-intensive process involving extensive field data collection using sophisticated sensors, followed by careful manual annotation of images, presenting significant logistical and financial challenges to researchers and organizations. The process also slows down th… ▽ More

    Submitted 27 February, 2025; v1 submitted 18 November, 2024; originally announced November 2024.

  18. arXiv:2410.19869  [pdf, other

    cs.CV

    Comparing YOLOv11 and YOLOv8 for instance segmentation of occluded and non-occluded immature green fruits in complex orchard environment

    Authors: Ranjan Sapkota, Manoj Karkee

    Abstract: This study conducted a comprehensive performance evaluation on YOLO11 (or YOLOv11) and YOLOv8, the latest in the "You Only Look Once" (YOLO) series, focusing on their instance segmentation capabilities for immature green apples in orchard environments. YOLO11n-seg achieved the highest mask precision across all categories with a notable score of 0.831, highlighting its effectiveness in fruit detect… ▽ More

    Submitted 26 January, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

    Comments: 16 Pages, 10 Figures, 3 Tables

  19. arXiv:2410.19846  [pdf, other

    cs.CV

    YOLO11 and Vision Transformers based 3D Pose Estimation of Immature Green Fruits in Commercial Apple Orchards for Robotic Thinning

    Authors: Ranjan Sapkota, Manoj Karkee

    Abstract: In this study, a robust method for 3D pose estimation of immature green apples (fruitlets) in commercial orchards was developed, utilizing the YOLO11(or YOLOv11) object detection and pose estimation algorithm alongside Vision Transformers (ViT) for depth estimation (Dense Prediction Transformer (DPT) and Depth Anything V2). For object detection and pose estimation, performance comparisons of YOLO1… ▽ More

    Submitted 30 March, 2025; v1 submitted 21 October, 2024; originally announced October 2024.

    Comments: 24 Pages, 13 Figures, 1 Table

  20. arXiv:2410.06085  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.supr-con

    Epitaxial aluminum layer on antimonide heterostructures for exploring Josephson junction effects

    Authors: W. Pan, K. R. Sapkota, P. Lu, A. J. Muhowski, W. M. Martinez, C. L. H. Sovinec, R. Reyna, J. P. Mendez, D. Mamaluy, S. D. Hawkins, J. F. Klem, L. S. L. Smith, D. A. Temple, Z. Enderson, Z. Jiang, E. Rossi

    Abstract: In this article, we present results of our recent work of epitaxially-grown aluminum (epi-Al) on antimonide heterostructures, where the epi-Al thin film is grown at either room temperature or below zero $^o$C. A sharp superconducting transition at $T \sim 1.3$ K is observed in these epi-Al films. We further show that supercurrent states are realized in Josephson junctions fabricated in the epi-Al/… ▽ More

    Submitted 17 April, 2025; v1 submitted 8 October, 2024; originally announced October 2024.

    Journal ref: Materials Science and Engineering: B 318, 118285 (2025)

  21. A vision-based robotic system for precision pollination of apples

    Authors: Uddhav Bhattarai, Ranjan Sapkota, Safal Kshetri, Changki Mo, Matthew D. Whiting, Qin Zhang, Manoj Karkee

    Abstract: Global food production depends upon successful pollination, a process that relies on natural and managed pollinators. However, natural pollinators are declining due to factors such as climate change, habitat loss, and pesticide use. This paper presents an integrated robotic system for precision pollination in apples. The system consisted of a machine vision system to identify target flower cluster… ▽ More

    Submitted 9 March, 2025; v1 submitted 29 September, 2024; originally announced September 2024.

    Comments: Accepted for publication in Computers and Electronics in Agriculture

    Journal ref: Computers and Electronics in Agriculture, Volume 234, 2025

  22. arXiv:2407.19404  [pdf

    cond-mat.mtrl-sci

    Non-equilibrium States and Interactions in the Topological Insulator and Topological Crystalline Insulator Phases of NaCd4As3

    Authors: Tika R Kafle, Yingchao Zhang, Yi-yan Wang, Xun Shi, Na Li, Richa Sapkota, Jeremy Thurston, Wenjing You, Shunye Gao, Qingxin Dong, Kai Rossnagel, Gen-Fu Chen, James K Freericks, Henry C Kapteyn, Margaret M Murnane

    Abstract: Topological materials are of great interest because they can support metallic edge or surface states that are robust against perturbations, with the potential for technological applications. Here we experimentally explore the light-induced non-equilibrium properties of two distinct topological phases in NaCd4As3: a topological crystalline insulator (TCI) phase and a topological insulator (TI) phas… ▽ More

    Submitted 20 August, 2024; v1 submitted 28 July, 2024; originally announced July 2024.

    Comments: 21 pages, 9 figures, added theoretical insight in the discussion section and modified abstract, corrected typos and rephrased sentences, results and figures unchanged, added a co-author involved in sample preparation

  23. arXiv:2407.18894  [pdf

    cond-mat.mtrl-sci

    Uncovering the Timescales of Spin Reorientation in $TbMn_{6}Sn_{6}$

    Authors: Sinéad A. Ryan, Anya Grafov, Na Li, Hans T. Nembach, Justin M. Shaw, Hari Bhandari, Tika Kafle, Richa Sapkota, Henry C. Kapteyn, Nirmal J. Ghimire, Margaret M. Murnane

    Abstract: $TbMn_{6}Sn_{6}… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  24. arXiv:2407.12040  [pdf, other

    cs.CV cs.AI

    Comprehensive Performance Evaluation of YOLOv12, YOLO11, YOLOv10, YOLOv9 and YOLOv8 on Detecting and Counting Fruitlet in Complex Orchard Environments

    Authors: Ranjan Sapkota, Zhichao Meng, Martin Churuvija, Xiaoqiang Du, Zenghong Ma, Manoj Karkee

    Abstract: This study systematically performed an extensive real-world evaluation of the performances of all configurations of YOLOv8, YOLOv9, YOLOv10, YOLO11( or YOLOv11), and YOLOv12 object detection algorithms in terms of precision, recall, mean Average Precision at 50\% Intersection over Union (mAP@50), and computational speeds including pre-processing, inference, and post-processing times immature green… ▽ More

    Submitted 25 February, 2025; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: 16 figures, 9 figures, 3 tables

  25. YOLO advances to its genesis: a decadal and comprehensive review of the You Only Look Once (YOLO) series

    Authors: Ranjan Sapkota, Marco Flores Calero, Rizwan Qureshi, Chetan Badgujar, Upesh Nepal, Alwin Poulose, Peter Zeno, Uday Bhanu Prakash Vaddevolu, Sheheryar Khan, Maged Shoman, Hong Yan, Manoj Karkee

    Abstract: This review systematically examines the progression of the You Only Look Once (YOLO) object detection algorithms from YOLOv1 to the recently unveiled YOLOv12. Employing a reverse chronological analysis, this study examines the advancements introduced by YOLO algorithms, beginning with YOLOv12 and progressing through YOLO11 (or YOLOv11), YOLOv10, YOLOv9, YOLOv8, and subsequent versions to explore e… ▽ More

    Submitted 30 May, 2025; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Published in Artificial Intelligence Review as https://doi.org/10.1007/s10462-025-11253-3

    Journal ref: Artificial Intelligence Review, SpringerNature, 2025

  26. arXiv:2405.15805  [pdf, other

    q-bio.NC cs.AI cs.LG

    DSAM: A Deep Learning Framework for Analyzing Temporal and Spatial Dynamics in Brain Networks

    Authors: Bishal Thapaliya, Robyn Miller, Jiayu Chen, Yu-Ping Wang, Esra Akbas, Ram Sapkota, Bhaskar Ray, Pranav Suresh, Santosh Ghimire, Vince Calhoun, Jingyu Liu

    Abstract: Resting-state functional magnetic resonance imaging (rs-fMRI) is a noninvasive technique pivotal for understanding human neural mechanisms of intricate cognitive processes. Most rs-fMRI studies compute a single static functional connectivity matrix across brain regions of interest, or dynamic functional connectivity matrices with a sliding window approach. These approaches are at risk of oversimpl… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 18 Pages, 4 figures

    Journal ref: Medical Image Analysis, Volume 91, 2025, Article 102124

  27. Immature Green Apple Detection and Sizing in Commercial Orchards using YOLOv8 and Shape Fitting Techniques

    Authors: Ranjan Sapkota, Dawood Ahmed, Martin Churuvija, Manoj Karkee

    Abstract: Detecting and estimating size of apples during the early stages of growth is crucial for predicting yield, pest management, and making informed decisions related to crop-load management, harvest and post-harvest logistics, and marketing. Traditional fruit size measurement methods are laborious and timeconsuming. This study employs the state-of-the-art YOLOv8 object detection and instance segmentat… ▽ More

    Submitted 2 April, 2024; v1 submitted 8 December, 2023; originally announced January 2024.

  28. Comparing YOLOv8 and Mask RCNN for object segmentation in complex orchard environments

    Authors: Ranjan Sapkota, Dawood Ahmed, Manoj Karkee

    Abstract: Instance segmentation, an important image processing operation for automation in agriculture, is used to precisely delineate individual objects of interest within images, which provides foundational information for various automated or robotic tasks such as selective harvesting and precision pruning. This study compares the one-stage YOLOv8 and the two-stage Mask R-CNN machine learning models for… ▽ More

    Submitted 4 July, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 15 figures, 2 tables

  29. arXiv:2311.10755  [pdf

    cs.RO

    Robotic Pollination of Apples in Commercial Orchards

    Authors: Ranjan Sapkota, Dawood Ahmed, Salik Ram Khanal, Uddhav Bhattarai, Changki Mo, Matthew D. Whiting, Manoj Karkee

    Abstract: This research presents a novel, robotic pollination system designed for targeted pollination of apple flowers in modern fruiting wall orchards. Developed in response to the challenges of global colony collapse disorder, climate change, and the need for sustainable alternatives to traditional pollinators, the system utilizes a commercial manipulator, a vision system, and a spray nozzle for pollen a… ▽ More

    Submitted 3 February, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

    Comments: 2 Page, 1 figure

  30. arXiv:2311.03520  [pdf, other

    cs.LG cs.AI q-bio.NC

    Brain Networks and Intelligence: A Graph Neural Network Based Approach to Resting State fMRI Data

    Authors: Bishal Thapaliya, Esra Akbas, Jiayu Chen, Raam Sapkota, Bhaskar Ray, Pranav Suresh, Vince Calhoun, Jingyu Liu

    Abstract: Resting-state functional magnetic resonance imaging (rsfMRI) is a powerful tool for investigating the relationship between brain function and cognitive processes as it allows for the functional organization of the brain to be captured without relying on a specific task or stimuli. In this paper, we present a novel modeling architecture called BrainRGIN for predicting intelligence (fluid, crystalli… ▽ More

    Submitted 27 October, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Journal ref: Medical Image Analysis, Volume 90, 2024, Article 102123

  31. arXiv:2307.08789  [pdf

    cs.CV

    Generative AI in Agriculture: Creating Image Datasets Using DALL.E's Advanced Large Language Model Capabilities

    Authors: Ranjan Sapkota, Manoj Karkee

    Abstract: This research investigated the role of artificial intelligence (AI), specifically the DALL.E model by OpenAI, in advancing data generation and visualization techniques in agriculture. DALL.E, an advanced AI image generator, works alongside ChatGPT's language processing to transform text descriptions and image clues into realistic visual representations of the content. The study used both approache… ▽ More

    Submitted 15 March, 2025; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: 9 Figures, 1 table, 19 pages

  32. arXiv:2304.13282  [pdf

    cs.RO

    Machine Vision-Based Crop-Load Estimation Using YOLOv8

    Authors: Dawood Ahmed, Ranjan Sapkota, Martin Churuvija, Manoj Karkee

    Abstract: Labor shortages in fruit crop production have prompted the development of mechanized and automated machines as alternatives to labor-intensive orchard operations such as harvesting, pruning, and thinning. Agricultural robots capable of identifying tree canopy parts and estimating geometric and topological parameters, such as branch diameter, length, and angles, can optimize crop yields through aut… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  33. arXiv:2304.09351  [pdf

    cs.CV

    Machine Vision System for Early-stage Apple Flowers and Flower Clusters Detection for Precision Thinning and Pollination

    Authors: Salik Ram Khanal, Ranjan Sapkota, Dawood Ahmed, Uddhav Bhattarai, Manoj Karkee

    Abstract: Early-stage identification of fruit flowers that are in both opened and unopened condition in an orchard environment is significant information to perform crop load management operations such as flower thinning and pollination using automated and robotic platforms. These operations are important in tree-fruit agriculture to enhance fruit quality, manage crop load, and enhance the overall profit. T… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  34. Site-specific weed management in corn using UAS imagery analysis and computer vision techniques

    Authors: Ranjan Sapkota, John Stenger, Michael Ostlie, Paulo Flores

    Abstract: Currently, weed control in commercial corn production is performed without considering weed distribution information in the field. This kind of weed management practice leads to excessive amounts of chemical herbicides being applied in a given field. The objective of this study was to perform site-specific weed control (SSWC) in a corn field by 1) using an unmanned aerial system (UAS) to map the s… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

    Comments: arXiv admin note: text overlap with arXiv:2204.12417, arXiv:2206.01734

    Report number: 6548

    Journal ref: Sci Rep 13, 6548 (2023)

  35. arXiv:2206.07201  [pdf, other

    cs.RO

    An autonomous robot for pruning modern, planar fruit trees

    Authors: Alexander You, Nidhi Parayil, Josyula Gopala Krishna, Uddhav Bhattarai, Ranjan Sapkota, Dawood Ahmed, Matthew Whiting, Manoj Karkee, Cindy M. Grimm, Joseph R. Davidson

    Abstract: Dormant pruning of fruit trees is an important task for maintaining tree health and ensuring high-quality fruit. Due to decreasing labor availability, pruning is a prime candidate for robotic automation. However, pruning also represents a uniquely difficult problem for robots, requiring robust systems for perception, pruning point determination, and manipulation that must operate under variable li… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  36. arXiv:2206.01734  [pdf

    cs.CV cs.AI eess.IV

    Using UAS Imagery and Computer Vision to Support Site-Specific Weed Control in Corn

    Authors: Ranjan Sapkota, Paulo Flores

    Abstract: Currently, weed control in a corn field is performed by a blanket application of herbicides that do not consider spatial distribution information of weeds and also uses an extensive amount of chemical herbicides. To reduce the amount of chemicals, we used drone-based high-resolution imagery and computer-vision techniques to perform site-specific weed control in corn.

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: 16 Figures, 3 Tables,. arXiv admin note: substantial text overlap with arXiv:2204.12417

  37. arXiv:2204.12417   

    eess.IV

    UAS Imagery and Computer Vision for Site-Specific Weed Control in Corn

    Authors: Ranjan Sapkota, Paulo Flores

    Abstract: Currently, weed control in a corn field is performed by a blanket application of herbicides which do not consider spatial distribution information of weeds and also uses an extensive amount of chemical herbicides. In order to reduce the amount of chemicals, we used drone based high-resolution imagery and computer-vision techniwue to perform site-specific weed control in corn.

    Submitted 28 April, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: Mistakes found

  38. arXiv:1803.06593  [pdf

    cond-mat.mtrl-sci

    Bulk transport properties of Bismuth selenide thin films approaching the two-dimensional limit

    Authors: Yub Raj Sapkota, Dipanjan Mazumdar

    Abstract: We have investigated the transport properties of topological insulator Bi2Se3 thin films grown using magnetron sputtering with an emphasis on understanding the behavior as a function of thickness. We show that thickness has a strong influence on all aspects of transport as the two-dimensional limit is approached. Bulk resistivity and Hall mobility show disproportionately large changes below 6 quin… ▽ More

    Submitted 17 March, 2018; originally announced March 2018.

    Comments: 13 pages, 4 figures

  39. arXiv:1703.01019  [pdf

    cond-mat.mtrl-sci

    Optical evidence of blue shift in topological insulator bismuth selenide in the few-layer limit

    Authors: Yub Raj Sapkota, Asma Alkabsh, Aaron Walber, Hassana Samassekou, Dipanjan Mazumdar

    Abstract: Optical band gap properties of high-quality few-layer topological insulator Bi2Se3 thin films grown with magnetron sputtering are investigated using broadband absorption spectroscopy. We provide direct optical evidence of a rigid blue-shift to up to 0.5 eV in the band gap of Bi2Se3 as it approaches the two-dimensional limit. The onset of this behavior is most significant below six quintuple layers… ▽ More

    Submitted 2 March, 2017; originally announced March 2017.

  40. arXiv:1609.03609  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Estimation of spin relaxation lengths in spin valves of In and In2O3 nanostructures

    Authors: Keshab R Sapkota, Parshu Gyawali, Ian L. Pegg, John Philip

    Abstract: We report the electrical injection and detection of spin polarized current in lateral ferromagnet-nonmagnet-ferromagnet spin valve devices, ferromagnet being cobalt and nonmagnet being indium (In) or indium oxide (In2O3) nanostructures. The In nanostructures were grown by depositing pure In on lithographically pre-patterned structures. In2O3 nanostructures were obtained by oxidation of In nanostru… ▽ More

    Submitted 12 September, 2016; originally announced September 2016.