Skip to main content

Showing 1–30 of 30 results for author: Venkatesh, K

.
  1. arXiv:2506.20496  [pdf, ps, other

    cs.RO

    Critical Anatomy-Preserving & Terrain-Augmenting Navigation (CAPTAiN): Application to Laminectomy Surgical Education

    Authors: Jonathan Wang, Hisashi Ishida, David Usevitch, Kesavan Venkatesh, Yi Wang, Mehran Armand, Rachel Bronheim, Amit Jain, Adnan Munawar

    Abstract: Surgical training remains a crucial milestone in modern medicine, with procedures such as laminectomy exemplifying the high risks involved. Laminectomy drilling requires precise manual control to mill bony tissue while preserving spinal segment integrity and avoiding breaches in the dura: the protective membrane surrounding the spinal cord. Despite unintended tears occurring in up to 11.3% of case… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  2. arXiv:2505.09858  [pdf, other

    cs.CV

    Mission Balance: Generating Under-represented Class Samples using Video Diffusion Models

    Authors: Danush Kumar Venkatesh, Isabel Funke, Micha Pfeiffer, Fiona Kolbinger, Hanna Maria Schmeiser, Juergen Weitz, Marius Distler, Stefanie Speidel

    Abstract: Computer-assisted interventions can improve intra-operative guidance, particularly through deep learning methods that harness the spatiotemporal information in surgical videos. However, the severe data imbalance often found in surgical video datasets hinders the development of high-performing models. In this work, we aim to overcome the data imbalance by synthesizing surgical videos. We propose a… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: Early accept at MICCAI 2025

  3. arXiv:2504.05306  [pdf, other

    cs.CV

    CREA: A Collaborative Multi-Agent Framework for Creative Content Generation with Diffusion Models

    Authors: Kavana Venkatesh, Connor Dunlop, Pinar Yanardag

    Abstract: Creativity in AI imagery remains a fundamental challenge, requiring not only the generation of visually compelling content but also the capacity to add novel, expressive, and artistically rich transformations to images. Unlike conventional editing tasks that rely on direct prompt-based modifications, creative image editing demands an autonomous, iterative approach that balances originality, cohere… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: Project URL: https://crea-diffusion.github.io

  4. arXiv:2504.00022  [pdf, other

    eess.IV cs.CV

    Autonomous AI for Multi-Pathology Detection in Chest X-Rays: A Multi-Site Study in the Indian Healthcare System

    Authors: Bargava Subramanian, Shajeev Jaikumar, Praveen Shastry, Naveen Kumarasami, Kalyan Sivasailam, Anandakumar D, Keerthana R, Mounigasri M, Kishore Prasath Venkatesh

    Abstract: Study Design: The study outlines the development of an autonomous AI system for chest X-ray (CXR) interpretation, trained on a vast dataset of over 5 million X rays sourced from healthcare systems across India. This AI system integrates advanced architectures including Vision Transformers, Faster R-CNN, and various U Net models (such as Attention U-Net, U-Net++, and Dense U-Net) to enable comprehe… ▽ More

    Submitted 2 April, 2025; v1 submitted 28 March, 2025; originally announced April 2025.

    Comments: 27 pages , 8 figures

    MSC Class: 68T07

  5. arXiv:2503.22176  [pdf, other

    eess.IV cs.CV

    A Multi-Site Study on AI-Driven Pathology Detection and Osteoarthritis Grading from Knee X-Ray

    Authors: Bargava Subramanian, Naveen Kumarasami, Praveen Shastry, Kalyan Sivasailam, Anandakumar D, Keerthana R, Mounigasri M, Abilaasha G, Kishore Prasath Venkatesh

    Abstract: Introduction: Bone health disorders like osteoarthritis and osteoporosis pose major global health challenges, often leading to delayed diagnoses due to limited diagnostic tools. This study presents an AI-powered system that analyzes knee X-rays to detect key pathologies, including joint space narrowing, sclerosis, osteophytes, tibial spikes, alignment issues, and soft tissue anomalies. It also gra… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

    Comments: 15 pages, 2 figures

    MSC Class: 68T07

  6. arXiv:2503.20316  [pdf, other

    eess.IV cs.CV

    AI-Driven MRI Spine Pathology Detection: A Comprehensive Deep Learning Approach for Automated Diagnosis in Diverse Clinical Settings

    Authors: Bargava Subramanian, Naveen Kumarasami, Praveen Shastry, Raghotham Sripadraj, Kalyan Sivasailam, Anandakumar D, Abinaya Ramachandran, Sudhir MP, Gunakutti G, Kishore Prasath Venkatesh

    Abstract: Study Design: This study presents the development of an autonomous AI system for MRI spine pathology detection, trained on a dataset of 2 million MRI spine scans sourced from diverse healthcare facilities across India. The AI system integrates advanced architectures, including Vision Transformers, U-Net with cross-attention, MedSAM, and Cascade R-CNN, enabling comprehensive classification, segment… ▽ More

    Submitted 28 March, 2025; v1 submitted 26 March, 2025; originally announced March 2025.

    Comments: 20 pages , 3 figurea

    MSC Class: 68T07

  7. arXiv:2503.20306  [pdf, other

    eess.IV cs.CV

    3D Convolutional Neural Networks for Improved Detection of Intracranial bleeding in CT Imaging

    Authors: Bargava Subramanian, Naveen Kumarasami, Praveen Shastry, Kalyan Sivasailam, Anandakumar D, Elakkiya R, Harsha KG, Rithanya V, Harini T, Afshin Hussain, Kishore Prasath Venkatesh

    Abstract: Background: Intracranial bleeding (IB) is a life-threatening condition caused by traumatic brain injuries, including epidural, subdural, subarachnoid, and intraparenchymal hemorrhages. Rapid and accurate detection is crucial to prevent severe complications. Traditional imaging can be slow and prone to variability, especially in high-pressure scenarios. Artificial Intelligence (AI) provides a solut… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

    Comments: 12 pages,4 figures

    MSC Class: 68T07

  8. arXiv:2503.14538  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Vision-Language Models for Acute Tuberculosis Diagnosis: A Multimodal Approach Combining Imaging and Clinical Data

    Authors: Ananya Ganapthy, Praveen Shastry, Naveen Kumarasami, Anandakumar D, Keerthana R, Mounigasri M, Varshinipriya M, Kishore Prasath Venkatesh, Bargava Subramanian, Kalyan Sivasailam

    Abstract: Background: This study introduces a Vision-Language Model (VLM) leveraging SIGLIP and Gemma-3b architectures for automated acute tuberculosis (TB) screening. By integrating chest X-ray images and clinical notes, the model aims to enhance diagnostic accuracy and efficiency, particularly in resource-limited settings. Methods: The VLM combines visual data from chest X-rays with clinical context to… ▽ More

    Submitted 1 April, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

    Comments: 11 pages, 3 figures

    MSC Class: 68T07; 68T45; 92C55; 92C50; 68U10

  9. arXiv:2503.14536  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Advancing Chronic Tuberculosis Diagnostics Using Vision-Language Models: A Multi modal Framework for Precision Analysis

    Authors: Praveen Shastry, Sowmya Chowdary Muthulur, Naveen Kumarasami, Anandakumar D, Mounigasri M, Keerthana R, Kishore Prasath Venkatesh, Bargava Subramanian, Kalyan Sivasailam, Revathi Ezhumalai, Abitha Marimuthu

    Abstract: Background: This study proposes a Vision-Language Model (VLM) leveraging the SIGLIP encoder and Gemma-3b transformer decoder to enhance automated chronic tuberculosis (TB) screening. By integrating chest X-ray images with clinical data, the model addresses the challenges of manual interpretation, improving diagnostic consistency and accessibility, particularly in resource-constrained settings. M… ▽ More

    Submitted 28 March, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

    Comments: 10 pages , 3 figures

    MSC Class: 68T07; 92C55; 68U10; 92C50; 60G35

  10. arXiv:2503.11281  [pdf, other

    eess.IV cs.AI

    AI and Deep Learning for Automated Segmentation and Quantitative Measurement of Spinal Structures in MRI

    Authors: Praveen Shastry, Bhawana Sonawane, Kavya Mohan, Naveen Kumarasami, Raghotham Sripadraj, Anandakumar D, Keerthana R, Mounigasri M, Kaviya SP, Kishore Prasath Venkatesh, Bargava Subramanian, Kalyan Sivasailam

    Abstract: Background: Accurate spinal structure measurement is crucial for assessing spine health and diagnosing conditions like spondylosis, disc herniation, and stenosis. Manual methods for measuring intervertebral disc height and spinal canal diameter are subjective and time-consuming. Automated solutions are needed to improve accuracy, efficiency, and reproducibility in clinical practice. Purpose: Thi… ▽ More

    Submitted 19 March, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

    Comments: 16 pages, 2 figures

    MSC Class: 92C55; 68T07; 68U10; 62P10; 65D18

  11. arXiv:2503.10717  [pdf, other

    eess.IV cs.AI cs.CV

    Deep Learning-Based Automated Workflow for Accurate Segmentation and Measurement of Abdominal Organs in CT Scans

    Authors: Praveen Shastry, Ashok Sharma, Kavya Mohan, Naveen Kumarasami, Anandakumar D, Mounigasri M, Keerthana R, Kishore Prasath Venkatesh, Bargava Subramanian, Kalyan Sivasailam

    Abstract: Background: Automated analysis of CT scans for abdominal organ measurement is crucial for improving diagnostic efficiency and reducing inter-observer variability. Manual segmentation and measurement of organs such as the kidneys, liver, spleen, and prostate are time-consuming and subject to inconsistency, underscoring the need for automated approaches. Purpose: The purpose of this study is to de… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: 13 pages , 3 figures

    MSC Class: 68T99

  12. arXiv:2412.09614  [pdf, other

    cs.CV cs.CL

    Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG

    Authors: Kavana Venkatesh, Yusuf Dalva, Ismini Lourentzou, Pinar Yanardag

    Abstract: We introduce a novel approach to enhance the capabilities of text-to-image models by incorporating a graph-based RAG. Our system dynamically retrieves detailed character information and relational data from the knowledge graph, enabling the generation of visually accurate and contextually rich images. This capability significantly improves upon the limitations of existing T2I models, which often s… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

    Comments: Project Page: https://context-canvas.github.io/

  13. arXiv:2412.09611  [pdf, other

    cs.CV

    FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers

    Authors: Yusuf Dalva, Kavana Venkatesh, Pinar Yanardag

    Abstract: Rectified flow models have emerged as a dominant approach in image generation, showcasing impressive capabilities in high-quality image synthesis. However, despite their effectiveness in visual generation, rectified flow models often struggle with disentangled editing of images. This limitation prevents the ability to perform precise, attribute-specific modifications without affecting unrelated as… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

    Comments: Project Page: https://fluxspace.github.io

  14. arXiv:2410.07753  [pdf, other

    cs.CV cs.LG

    Data Augmentation for Surgical Scene Segmentation with Anatomy-Aware Diffusion Models

    Authors: Danush Kumar Venkatesh, Dominik Rivoir, Micha Pfeiffer, Fiona Kolbinger, Stefanie Speidel

    Abstract: In computer-assisted surgery, automatically recognizing anatomical organs is crucial for understanding the surgical scene and providing intraoperative assistance. While machine learning models can identify such structures, their deployment is hindered by the need for labeled, diverse surgical datasets with anatomical annotations. Labeling multiple classes (i.e., organs) in a surgical scene is time… ▽ More

    Submitted 21 November, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: Accepted at WACV 2025

  15. Fault Analysis And Predictive Maintenance Of Induction Motor Using Machine Learning

    Authors: Kavana Venkatesh, Neethi M

    Abstract: Induction motors are one of the most crucial electrical equipment and are extensively used in industries in a wide range of applications. This paper presents a machine learning model for the fault detection and classification of induction motor faults by using three phase voltages and currents as inputs. The aim of this work is to protect vital electrical components and to prevent abnormal event p… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

    Comments: Presented at ICEECCOT-2018, Published in IEEE Xplore, 6 pages, 3 figures

    Journal ref: ICEECCOT-2018, Mysuru, India, 2018, pp. 1-6

  16. arXiv:2408.09822  [pdf, other

    cs.CV

    SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models

    Authors: Danush Kumar Venkatesh, Dominik Rivoir, Micha Pfeiffer, Stefanie Speidel

    Abstract: Computer-assisted surgery (CAS) systems are designed to assist surgeons during procedures, thereby reducing complications and enhancing patient care. Training machine learning models for these systems requires a large corpus of annotated datasets, which is challenging to obtain in the surgical domain due to patient privacy concerns and the significant labeling effort required from doctors. Previou… ▽ More

    Submitted 11 October, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: Accepted at ECCV workshop on Synthetic Data for ComputerVision

  17. arXiv:2402.08088  [pdf, other

    cs.AI cs.LG eess.IV

    Out-of-Distribution Detection and Data Drift Monitoring using Statistical Process Control

    Authors: Ghada Zamzmi, Kesavan Venkatesh, Brandon Nelson, Smriti Prathapan, Paul H. Yi, Berkman Sahiner, Jana G. Delfino

    Abstract: Background: Machine learning (ML) methods often fail with data that deviates from their training distribution. This is a significant concern for ML-enabled devices in clinical settings, where data drift may cause unexpected performance that jeopardizes patient safety. Method: We propose a ML-enabled Statistical Process Control (SPC) framework for out-of-distribution (OOD) detection and drift mon… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  18. arXiv:2312.08621  [pdf, other

    cs.RO eess.SY

    Quadrupedal Locomotion Control On Inclined Surfaces Using Collocation Method

    Authors: Adarsh Salagame, Maria Gianello, Chenghao Wang, Kaushik Venkatesh, Shreyansh Pitroda, Rohit Rajput, Eric Sihite, Miriam Leeser, Alireza Ramezani

    Abstract: Inspired by Chukars wing-assisted incline running (WAIR), in this work, we employ a high-fidelity model of our Husky Carbon quadrupedal-legged robot to walk over steep slopes of up to 45 degrees. Chukars use the aerodynamic forces generated by their flapping wings to manipulate ground contact forces and traverse steep slopes and even overhangs. By exploiting the thrusters on Husky, we employed a c… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2306.00179

  19. arXiv:2311.14878  [pdf, other

    cs.RO eess.SY

    How Strong a Kick Should be to Topple Northeastern's Tumbling Robot?

    Authors: Adarsh Salagame, Neha Bhattachan, Andre Caetano, Ian McCarthy, Henry Noyes, Brandon Petersen, Alexander Qiu, Matthew Schroeter, Nolan Smithwick, Konrad Sroka, Jason Widjaja, Yash Bohra, Kaushik Venkatesh, Kruthika Gangaraju, Paul Ghanem, Ioannis Mandralis, Eric Sihite, Arash Kalantari, Alireza Ramezani

    Abstract: Rough terrain locomotion has remained one of the most challenging mobility questions. In 2022, NASA's Innovative Advanced Concepts (NIAC) Program invited US academic institutions to participate NASA's Breakthrough, Innovative \& Game-changing (BIG) Idea competition by proposing novel mobility systems that can negotiate extremely rough terrain, lunar bumpy craters. In this competition, Northeastern… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  20. arXiv:2309.03048  [pdf, other

    cs.CV

    Exploring Semantic Consistency in Unpaired Image Translation to Generate Data for Surgical Applications

    Authors: Danush Kumar Venkatesh, Dominik Rivoir, Micha Pfeiffer, Fiona Kolbinger, Marius Distler, Jürgen Weitz, Stefanie Speidel

    Abstract: In surgical computer vision applications, obtaining labeled training data is challenging due to data-privacy concerns and the need for expert annotation. Unpaired image-to-image translation techniques have been explored to automatically generate large annotated datasets by translating synthetic images to the realistic domain. However, preserving the structure and semantic consistency between the i… ▽ More

    Submitted 21 February, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: Accepted at IPCAI 2024

  21. arXiv:2308.00183  [pdf, other

    cs.RO eess.SY

    Hovering Control of Flapping Wings in Tandem with Multi-Rotors

    Authors: Aniket Dhole, Bibek Gupta, Adarsh Salagame, Xuejian Niu, Yizhe Xu, Kaushik Venkatesh, Paul Ghanem, Ioannis Mandralis, Eric Sihite, Alireza Ramezani

    Abstract: This work briefly covers our efforts to stabilize the flight dynamics of Northeastern's tailless bat-inspired micro aerial vehicle, Aerobat. Flapping robots are not new. A plethora of examples is mainly dominated by insect-style design paradigms that are passively stable. However, Aerobat, in addition for being tailless, possesses morphing wings that add to the inherent complexity of flight contro… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

  22. arXiv:2206.08738  [pdf, other

    cs.LG

    Detecting Adversarial Examples in Batches -- a geometrical approach

    Authors: Danush Kumar Venkatesh, Peter Steinbach

    Abstract: Many deep learning methods have successfully solved complex tasks in computer vision and speech recognition applications. Nonetheless, the robustness of these models has been found to be vulnerable to perturbed inputs or adversarial examples, which are imperceptible to the human eye, but lead the model to erroneous output decisions. In this study, we adapt and introduce two geometric metrics, dens… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: Submitted to AdvML workshop at ICML2022

  23. arXiv:2204.05591  [pdf

    cs.CV cs.AI

    Automatic detection of glaucoma via fundus imaging and artificial intelligence: A review

    Authors: Lauren Coan, Bryan Williams, Krishna Adithya Venkatesh, Swati Upadhyaya, Silvester Czanner, Rengaraj Venkatesh, Colin E. Willoughby, Srinivasan Kavitha, Gabriela Czanner

    Abstract: Glaucoma is a leading cause of irreversible vision impairment globally and cases are continuously rising worldwide. Early detection is crucial, allowing timely intervention which can prevent further visual field loss. To detect glaucoma, examination of the optic nerve head via fundus imaging can be performed, at the centre of which is the assessment of the optic cup and disc boundaries. Fundus ima… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

  24. arXiv:2112.14382  [pdf, other

    cs.CV cs.AI

    Self-Supervised Robustifying Guidance for Monocular 3D Face Reconstruction

    Authors: Hitika Tiwari, Min-Hung Chen, Yi-Min Tsai, Hsien-Kai Kuo, Hung-Jen Chen, Kevin Jou, K. S. Venkatesh, Yong-Sheng Chen

    Abstract: Despite the recent developments in 3D Face Reconstruction from occluded and noisy face images, the performance is still unsatisfactory. Moreover, most existing methods rely on additional dependencies, posing numerous constraints over the training procedure. Therefore, we propose a Self-Supervised RObustifying GUidancE (ROGUE) framework to obtain robustness against occlusions and noise in the face… ▽ More

    Submitted 21 October, 2022; v1 submitted 28 December, 2021; originally announced December 2021.

    Comments: Accepted by The 33rd British Machine Vision Conference (BMVC) 2022. Evaluation code and datasets: https://github.com/ArcTrinity9/Datasets-ReaChOcc-and-SynChOcc

  25. arXiv:2111.08275  [pdf, other

    cs.LG

    Deep Distilling: automated code generation using explainable deep learning

    Authors: Paul J. Blazek, Kesavan Venkatesh, Milo M. Lin

    Abstract: Human reasoning can distill principles from observed patterns and generalize them to explain and solve novel problems. The most powerful artificial intelligence systems lack explainability and symbolic reasoning ability, and have therefore not achieved supremacy in domains requiring human understanding, such as science or common sense reasoning. Here we introduce deep distilling, a machine learnin… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    MSC Class: 68T05 (Primary); 68T07; 68T20; 68T37 (Secondary) ACM Class: I.2.2; I.2.6

  26. arXiv:2108.05287  [pdf, other

    eess.SP

    Semantic Mobile Base Station Placement

    Authors: Kritik Soman, K. S. Venkatesh

    Abstract: Location of Base Stations (BS) in mobile networks plays an important role in coverage and received signal strength. As Internet ofThings (IoT), autonomous vehicles and smart cities evolve, wireless net-work coverage will have an important role in ensuring seamless connectivity. Due to use of higher carrier frequencies, blockages cause communication to primarily be Line of Sight (LoS), increasing t… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

    Comments: 12 pages

    MSC Class: 68T01

  27. arXiv:2104.02656  [pdf, other

    cs.CV cs.AI cs.GR cs.MM cs.SD eess.AS eess.IV

    Collaborative Learning to Generate Audio-Video Jointly

    Authors: Vinod K Kurmi, Vipul Bajaj, Badri N Patro, K S Venkatesh, Vinay P Namboodiri, Preethi Jyothi

    Abstract: There have been a number of techniques that have demonstrated the generation of multimedia data for one modality at a time using GANs, such as the ability to generate images, videos, and audio. However, so far, the task of multi-modal generation of data, specifically for audio and videos both, has not been sufficiently well-explored. Towards this, we propose a method that demonstrates that we are… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

    Comments: ICASSP 2021 (Accepted)

  28. arXiv:1705.07080  [pdf, other

    cs.CV

    Bitwise Operations of Cellular Automaton on Gray-scale Images

    Authors: Karttikeya Mangalam, K S Venkatesh

    Abstract: Cellular Automata (CA) theory is a discrete model that represents the state of each of its cells from a finite set of possible values which evolve in time according to a pre-defined set of transition rules. CA have been applied to a number of image processing tasks such as Convex Hull Detection, Image Denoising etc. but mostly under the limitation of restricting the input to binary images. In gene… ▽ More

    Submitted 19 May, 2017; originally announced May 2017.

    Comments: 5 Pages. The code is available at : https://github.com/karttikeya/Bitwise-CA-Opeartions/

  29. arXiv:1703.02340  [pdf, ps, other

    cs.RO

    Design and Development of an automated Robotic Pick & Stow System for an e-Commerce Warehouse

    Authors: Swagat Kumar, Anima Majumder, Samrat Dutta, Rekha Raja, Sharath Jotawar, Ashish Kumar, Manish Soni, Venkat Raju, Olyvia Kundu, Ehtesham Hassan Laxmidhar Behera, K. S. Venkatesh, Rajesh Sinha

    Abstract: In this paper, we provide details of a robotic system that can automate the task of picking and stowing objects from and to a rack in an e-commerce fulfillment warehouse. The system primarily comprises of four main modules: (1) Perception module responsible for recognizing query objects and localizing them in the 3-dimensional robot workspace; (2) Planning module generates necessary paths that the… ▽ More

    Submitted 7 March, 2017; originally announced March 2017.

    Comments: 15 Pages, 25 Figures, 4 Tables, Journal Paper

  30. arXiv:1507.08445  [pdf, other

    cs.CV

    People Counting in High Density Crowds from Still Images

    Authors: Ankan Bansal, K. S. Venkatesh

    Abstract: We present a method of estimating the number of people in high density crowds from still images. The method estimates counts by fusing information from multiple sources. Most of the existing work on crowd counting deals with very small crowds (tens of individuals) and use temporal information from videos. Our method uses only still images to estimate the counts in high density images (hundreds to… ▽ More

    Submitted 30 July, 2015; originally announced July 2015.