Skip to main content

Showing 1–50 of 72 results for author: Chowdhury, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.17409  [pdf, ps, other

    cs.SD cs.LG eess.AS eess.SP

    Adaptive Control Attention Network for Underwater Acoustic Localization and Domain Adaptation

    Authors: Quoc Thinh Vo, Joe Woods, Priontu Chowdhury, David K. Han

    Abstract: Localizing acoustic sound sources in the ocean is a challenging task due to the complex and dynamic nature of the environment. Factors such as high background noise, irregular underwater geometries, and varying acoustic properties make accurate localization difficult. To address these obstacles, we propose a multi-branch network architecture designed to accurately predict the distance between a mo… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: This paper has been accepted for the 33rd European Signal Processing Conference (EUSIPCO) 2025 in Palermo, Italy

  2. arXiv:2506.16892  [pdf, ps, other

    cs.RO eess.SY

    Orbital Collision: An Indigenously Developed Web-based Space Situational Awareness Platform

    Authors: Partha Chowdhury, Harsha M, Ayush Gupta, Sanat K Biswas

    Abstract: This work presents an indigenous web based platform Orbital Collision (OrCo), created by the Space Systems Laboratory at IIIT Delhi, to enhance Space Situational Awareness (SSA) by predicting collision probabilities of space objects using Two Line Elements (TLE) data. The work highlights the growing challenges of congestion in the Earth's orbital environment, mainly due to space debris and defunct… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: This work has been already submitted for STEP-IPSC 2025 Conference Proceedings

  3. arXiv:2506.09759  [pdf, ps, other

    cs.SE

    Towards Bridging Formal Methods and Human Interpretability

    Authors: Abhijit Paul, Proma Chowdhury, Kazi Sakib

    Abstract: Labeled Transition Systems (LTS) are integral to model checking and design repair tools. System engineers frequently examine LTS designs during model checking or design repair to debug, identify inconsistencies, and validate system behavior. Despite LTS's significance, no prior research has examined human comprehension of these designs. To address this, we draw on traditional software engineering… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: Need to improve data annotation process in methodology section

  4. arXiv:2506.08702  [pdf, ps, other

    cs.ET

    Educators' Perceptions of Large Language Models as Tutors: Comparing Human and AI Tutors in a Blind Text-only Setting

    Authors: Sankalan Pal Chowdhury, Terry Jingchen Zhang, Donya Rooein, Dirk Hovy, Tanja Käser, Mrinmaya Sachan

    Abstract: The rapid development of Large Language Models (LLMs) opens up the possibility of using them as personal tutors. This has led to the development of several intelligent tutoring systems and learning assistants that use LLMs as back-ends with various degrees of engineering. In this study, we seek to compare human tutors with LLM tutors in terms of engagement, empathy, scaffolding, and conciseness. W… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: Accepted to BEA@ACL 2025

  5. arXiv:2505.23763  [pdf, ps, other

    cs.CV

    Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch

    Authors: Aneeshan Sain, Subhajit Maity, Pinaki Nath Chowdhury, Subhadeep Koley, Ayan Kumar Bhunia, Yi-Zhe Song

    Abstract: As sketch research has collectively matured over time, its adaptation for at-mass commercialisation emerges on the immediate horizon. Despite an already mature research endeavour for photos, there is no research on the efficient inference specifically designed for sketch data. In this paper, we first demonstrate existing state-of-the-art efficient light-weight models designed for photos do not wor… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Accepted at CVPR 2025, Project Page: https://subhajitmaity.me/SketchDownTheFLOPs

  6. arXiv:2505.08585  [pdf, ps, other

    cs.CV

    A Large-scale Benchmark on Geological Fault Delineation Models: Domain Shift, Training Dynamics, Generalizability, Evaluation and Inferential Behavior

    Authors: Jorge Quesada, Chen Zhou, Prithwijit Chowdhury, Mohammad Alotaibi, Ahmad Mustafa, Yusufjon Kumamnov, Mohit Prabhushankar, Ghassan AlRegib

    Abstract: Machine learning has taken a critical role in seismic interpretation workflows, especially in fault delineation tasks. However, despite the recent proliferation of pretrained models and synthetic datasets, the field still lacks a systematic understanding of the generalizability limits of these models across seismic data representing a variety of geologic, acquisition and processing settings. Distr… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  7. arXiv:2504.17720  [pdf, other

    cs.CL cs.AI

    Multilingual Performance Biases of Large Language Models in Education

    Authors: Vansh Gupta, Sankalan Pal Chowdhury, Vilém Zouhar, Donya Rooein, Mrinmaya Sachan

    Abstract: Large language models (LLMs) are increasingly being adopted in educational settings. These applications expand beyond English, though current LLMs remain primarily English-centric. In this work, we ascertain if their use in education settings in non-English languages is warranted. We evaluated the performance of popular LLMs on four educational tasks: identifying student misconceptions, providing… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  8. arXiv:2503.14129  [pdf, other

    cs.CV

    SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models

    Authors: Subhadeep Koley, Tapas Kumar Dutta, Aneeshan Sain, Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Yi-Zhe Song

    Abstract: While foundation models have revolutionised computer vision, their effectiveness for sketch understanding remains limited by the unique challenges of abstract, sparse visual inputs. Through systematic analysis, we uncover two fundamental limitations: Stable Diffusion (SD) struggles to extract meaningful features from abstract sketches (unlike its success with photos), and exhibits a pronounced fre… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

    Comments: Accepted in CVPR 2025. Project page available at https://subhadeepkoley.github.io/SketchFusion/

  9. arXiv:2501.16022  [pdf, other

    cs.CV

    SketchYourSeg: Mask-Free Subjective Image Segmentation via Freehand Sketches

    Authors: Subhadeep Koley, Viswanatha Reddy Gajjala, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Ayan Kumar Bhunia, Yi-Zhe Song

    Abstract: We introduce SketchYourSeg, a novel framework that establishes freehand sketches as a powerful query modality for subjective image segmentation across entire galleries through a single exemplar sketch. Unlike text prompts that struggle with spatial specificity or interactive methods confined to single-image operations, sketches naturally combine semantic intent with structural precision. This uniq… ▽ More

    Submitted 17 March, 2025; v1 submitted 27 January, 2025; originally announced January 2025.

  10. arXiv:2412.08589  [pdf

    astro-ph.SR astro-ph.IM cs.CV cs.LG

    SPACE-SUIT: An Artificial Intelligence based chromospheric feature extractor and classifier for SUIT

    Authors: Pranava Seth, Vishal Upendran, Megha Anand, Janmejoy Sarkar, Soumya Roy, Priyadarshan Chaki, Pratyay Chowdhury, Borishan Ghosh, Durgesh Tripathi

    Abstract: The Solar Ultraviolet Imaging Telescope(SUIT) onboard Aditya-L1 is an imager that observes the solar photosphere and chromosphere through observations in the wavelength range of 200-400 nm. A comprehensive understanding of the plasma and thermodynamic properties of chromospheric and photospheric morphological structures requires a large sample statistical study, necessitating the development of au… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

  11. arXiv:2412.05180  [pdf, other

    cs.CV

    DreamColour: Controllable Video Colour Editing without Training

    Authors: Chaitat Utintu, Pinaki Nath Chowdhury, Aneeshan Sain, Subhadeep Koley, Ayan Kumar Bhunia, Yi-Zhe Song

    Abstract: Video colour editing is a crucial task for content creation, yet existing solutions either require painstaking frame-by-frame manipulation or produce unrealistic results with temporal artefacts. We present a practical, training-free framework that makes precise video colour editing accessible through an intuitive interface while maintaining professional-quality output. Our key insight is that by d… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

    Comments: Project page available at https://chaitron.github.io/DreamColour-demo

  12. arXiv:2409.15981  [pdf, other

    cs.CY

    GPT-4 as a Homework Tutor can Improve Student Engagement and Learning Outcomes

    Authors: Alessandro Vanzo, Sankalan Pal Chowdhury, Mrinmaya Sachan

    Abstract: This work contributes to the scarce empirical literature on LLM-based interactive homework in real-world educational settings and offers a practical, scalable solution for improving homework in schools. Homework is an important part of education in schools across the world, but in order to maximize benefit, it needs to be accompanied with feedback and followup questions. We developed a prompting s… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: Submitted to LAK25

  13. arXiv:2407.18108  [pdf, other

    cs.LG cs.CY cs.SI physics.soc-ph

    Graph Neural Ordinary Differential Equations for Coarse-Grained Socioeconomic Dynamics

    Authors: James Koch, Pranab Roy Chowdhury, Heng Wan, Parin Bhaduri, Jim Yoon, Vivek Srikrishnan, W. Brent Daniel

    Abstract: We present a data-driven machine-learning approach for modeling space-time socioeconomic dynamics. Through coarse-graining fine-scale observations, our modeling framework simplifies these complex systems to a set of tractable mechanistic relationships -- in the form of ordinary differential equations -- while preserving critical system behaviors. This approach allows for expedited 'what if' studie… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  14. arXiv:2407.03893  [pdf, other

    cs.CV cs.AI

    Do Generalised Classifiers really work on Human Drawn Sketches?

    Authors: Hmrishav Bandyopadhyay, Pinaki Nath Chowdhury, Aneeshan Sain, Subhadeep Koley, Tao Xiang, Ayan Kumar Bhunia, Yi-Zhe Song

    Abstract: This paper, for the first time, marries large foundation models with human sketch understanding. We demonstrate what this brings -- a paradigm shift in terms of generalised sketch representation learning (e.g., classification). This generalisation happens on two fronts: (i) generalisation across unknown categories (i.e., open-set), and (ii) generalisation traversing abstraction levels (i.e., good… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  15. arXiv:2407.01810  [pdf, other

    cs.CV

    Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval

    Authors: Aneeshan Sain, Pinaki Nath Chowdhury, Subhadeep Koley, Ayan Kumar Bhunia, Yi-Zhe Song

    Abstract: In this paper, we delve into the intricate dynamics of Fine-Grained Sketch-Based Image Retrieval (FG-SBIR) by addressing a critical yet overlooked aspect -- the choice of viewpoint during sketch creation. Unlike photo systems that seamlessly handle diverse views through extensive datasets, sketch systems, with limited data collected from fixed perspectives, face challenges. Our pilot study, employ… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted in European Conference on Computer Vision (ECCV) 2024

  16. arXiv:2406.15210  [pdf, other

    cs.CR

    Assessing Effectiveness of Cyber Essentials Technical Controls

    Authors: Priyanka Badva, Partha Das Chowdhury, Kopo M. Ramokapane, Barnaby Craggs, Awais Rashid

    Abstract: Cyber Essentials (CE) comprise a set of controls designed to protect organisations, irrespective of their size, against cyber attacks. The controls are firewalls, secure configuration, user access control, malware protection & security update management. In this work, we explore the extent to which CE remains robust against an ever-evolving threat landscape. To that end, we reconstruct 45 breaches… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: We have submitted this paper in ACM Digital Threats: Research and Practice (DTRAP) Journal. The paper is currently in the review process

  17. arXiv:2406.07820  [pdf, other

    cs.CV cs.LG

    Are Objective Explanatory Evaluation metrics Trustworthy? An Adversarial Analysis

    Authors: Prithwijit Chowdhury, Mohit Prabhushankar, Ghassan AlRegib, Mohamed Deriche

    Abstract: Explainable AI (XAI) has revolutionized the field of deep learning by empowering users to have more trust in neural network models. The field of XAI allows users to probe the inner workings of these algorithms to elucidate their decision-making processes. The rise in popularity of XAI has led to the advent of different strategies to produce explanations, all of which only occasionally agree. Thus… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  18. arXiv:2405.18716  [pdf, other

    cs.CV

    SketchDeco: Decorating B&W Sketches with Colour

    Authors: Chaitat Utintu, Pinaki Nath Chowdhury, Aneeshan Sain, Subhadeep Koley, Ayan Kumar Bhunia, Yi-Zhe Song

    Abstract: This paper introduces a novel approach to sketch colourisation, inspired by the universal childhood activity of colouring and its professional applications in design and story-boarding. Striking a balance between precision and convenience, our method utilises region masks and colour palettes to allow intuitive user control, steering clear of the meticulousness of manual colour assignments or the l… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  19. arXiv:2403.09480  [pdf, other

    cs.CV cs.AI

    What Sketch Explainability Really Means for Downstream Tasks

    Authors: Hmrishav Bandyopadhyay, Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Aneeshan Sain, Tao Xiang, Yi-Zhe Song

    Abstract: In this paper, we explore the unique modality of sketch for explainability, emphasising the profound impact of human strokes compared to conventional pixel-oriented studies. Beyond explanations of network behavior, we discern the genuine implications of explainability across diverse downstream sketch-related tasks. We propose a lightweight and portable explainability solution -- a seamless plugin… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  20. arXiv:2403.09344  [pdf, other

    cs.CV cs.AI

    SketchINR: A First Look into Sketches as Implicit Neural Representations

    Authors: Hmrishav Bandyopadhyay, Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Tao Xiang, Timothy Hospedales, Yi-Zhe Song

    Abstract: We propose SketchINR, to advance the representation of vector sketches with implicit neural models. A variable length vector sketch is compressed into a latent space of fixed dimension that implicitly encodes the underlying shape as a function of time and strokes. The learned function predicts the $xy$ point coordinates in a sketch at each time and stroke. Despite its simplicity, SketchINR outperf… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  21. arXiv:2403.07234  [pdf, other

    cs.CV

    It's All About Your Sketch: Democratising Sketch Control in Diffusion Models

    Authors: Subhadeep Koley, Ayan Kumar Bhunia, Deeptanshu Sekhri, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: This paper unravels the potential of sketches for diffusion models, addressing the deceptive promise of direct sketch control in generative AI. We importantly democratise the process, enabling amateur sketches to generate precise images, living up to the commitment of "what you sketch is what you get". A pilot study underscores the necessity, revealing that deformities in existing models stem from… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted in CVPR 2024. Project page available at https://subhadeepkoley.github.io/StableSketching

  22. arXiv:2403.07222  [pdf, other

    cs.CV

    You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval

    Authors: Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: Two primary input modalities prevail in image retrieval: sketch and text. While text is widely used for inter-category retrieval tasks, sketches have been established as the sole preferred modality for fine-grained image retrieval due to their ability to capture intricate visual details. In this paper, we question the reliance on sketches alone for fine-grained image retrieval by simultaneously ex… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted in CVPR 2024. Project page available at https://subhadeepkoley.github.io/Sketch2Word

  23. arXiv:2403.07214  [pdf, other

    cs.CV

    Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers

    Authors: Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: This paper, for the first time, explores text-to-image diffusion models for Zero-Shot Sketch-based Image Retrieval (ZS-SBIR). We highlight a pivotal discovery: the capacity of text-to-image diffusion models to seamlessly bridge the gap between sketches and photos. This proficiency is underpinned by their robust cross-modal capabilities and shape bias, findings that are substantiated through our pi… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted in CVPR 2024. Project page available at https://subhadeepkoley.github.io/DiffusionZSSBIR

  24. arXiv:2403.07203  [pdf, other

    cs.CV

    How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval?

    Authors: Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: In this paper, we propose a novel abstraction-aware sketch-based image retrieval framework capable of handling sketch abstraction at varied levels. Prior works had mainly focused on tackling sub-factors such as drawing style and order, we instead attempt to model abstraction as a whole, and propose feature-level and retrieval granularity-level designs so that the system builds into its DNA the nec… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted in CVPR 2024. Project page available at https://subhadeepkoley.github.io/AbstractAway

  25. arXiv:2403.03307  [pdf, other

    cs.CL

    Book2Dial: Generating Teacher-Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots

    Authors: Junling Wang, Jakub Macina, Nico Daheim, Sankalan Pal Chowdhury, Mrinmaya Sachan

    Abstract: Educational chatbots are a promising tool for assisting student learning. However, the development of effective chatbots in education has been challenging, as high-quality data is seldom available in this domain. In this paper, we propose a framework for generating synthetic teacher-student interactions grounded in a set of textbooks. Our approaches capture one aspect of learning interactions wher… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 24 pages, 19 tables, 2 figures

  26. arXiv:2402.09216  [pdf, other

    cs.CL cs.HC

    AutoTutor meets Large Language Models: A Language Model Tutor with Rich Pedagogy and Guardrails

    Authors: Sankalan Pal Chowdhury, Vilém Zouhar, Mrinmaya Sachan

    Abstract: Large Language Models (LLMs) have found several use cases in education, ranging from automatic question generation to essay evaluation. In this paper, we explore the potential of using Large Language Models (LLMs) to author Intelligent Tutoring Systems. A common pitfall of LLMs is their straying from desired pedagogical strategies such as leaking the answer to the student, and in general, providin… ▽ More

    Submitted 25 April, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: To be presented at Learning@Scale 2024

  27. arXiv:2312.04364  [pdf, other

    cs.CV

    DemoCaricature: Democratising Caricature Generation with a Rough Sketch

    Authors: Dar-Yen Chen, Ayan Kumar Bhunia, Subhadeep Koley, Aneeshan Sain, Pinaki Nath Chowdhury, Yi-Zhe Song

    Abstract: In this paper, we democratise caricature generation, empowering individuals to effortlessly craft personalised caricatures with just a photo and a conceptual sketch. Our objective is to strike a delicate balance between abstraction and identity, while preserving the creativity and subjectivity inherent in a sketch. To achieve this, we present Explicit Rank-1 Model Editing alongside single-image pe… ▽ More

    Submitted 24 March, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

  28. arXiv:2312.04043  [pdf, other

    cs.CV cs.AI

    Doodle Your 3D: From Abstract Freehand Sketches to Precise 3D Shapes

    Authors: Hmrishav Bandyopadhyay, Subhadeep Koley, Ayan Das, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: In this paper, we democratise 3D content creation, enabling precise generation of 3D shapes from abstract sketches while overcoming limitations tied to drawing skills. We introduce a novel part-level modelling and alignment framework that facilitates abstraction modelling and cross-modal correspondence. Leveraging the same part-level decoder, our approach seamlessly extends to sketch modelling by… ▽ More

    Submitted 7 June, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: CVPR 2024, Project Page: https://hmrishavbandy.github.io/doodle23d/

  29. arXiv:2307.02332  [pdf, other

    cs.HC cs.CY

    Co-creating a Transdisciplinary Map of Technology-mediated Harms, Risks and Vulnerabilities: Challenges, Ambivalences and Opportunities

    Authors: Andrés Domínguez Hernández, Kopo M. Ramokapane, Partha Das Chowdhury, Ola Michalec, Emily Johnstone, Emily Godwin, Alicia G Cork, Awais Rashid

    Abstract: The phrase "online harms" has emerged in recent years out of a growing political willingness to address the ethical and social issues associated with the use of the Internet and digital technology at large. The broad landscape that surrounds online harms gathers a multitude of disciplinary, sectoral and organizational efforts while raising myriad challenges and opportunities for the crossing entre… ▽ More

    Submitted 19 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: 21 pages, 8 figures, to appear in The 26th ACM Conference On Computer-Supported Cooperative Work And Social Computing. October 13-18, 2023. Minneapolis, MN USA

  30. arXiv:2306.10830  [pdf, other

    cs.CV

    3D VR Sketch Guided 3D Shape Prototyping and Exploration

    Authors: Ling Luo, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song, Yulia Gryaditskaya

    Abstract: 3D shape modeling is labor-intensive, time-consuming, and requires years of expertise. To facilitate 3D shape modeling, we propose a 3D shape generation network that takes a 3D VR sketch as a condition. We assume that sketches are created by novices without art training and aim to reconstruct geometrically realistic 3D shapes of a given category. To handle potential sketch ambiguity, our method cr… ▽ More

    Submitted 10 January, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted by ICCV 2023

  31. arXiv:2305.14536  [pdf, other

    cs.CL

    MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems

    Authors: Jakub Macina, Nico Daheim, Sankalan Pal Chowdhury, Tanmay Sinha, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan

    Abstract: While automatic dialogue tutors hold great potential in making education personalized and more accessible, research on such systems has been hampered by a lack of sufficiently large and high-quality datasets. Collecting such datasets remains challenging, as recording tutoring sessions raises privacy concerns and crowdsourcing leads to insufficient data quality. To address this, we propose a framew… ▽ More

    Submitted 23 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Jakub Macina, Nico Daheim, and Sankalan Pal Chowdhury contributed equally to this work. Accepted at EMNLP2023 Findings. Code and dataset available: https://github.com/eth-nlped/mathdial

  32. arXiv:2303.15149  [pdf, other

    cs.CV

    What Can Human Sketches Do for Object Detection?

    Authors: Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Aneeshan Sain, Subhadeep Koley, Tao Xiang, Yi-Zhe Song

    Abstract: Sketches are highly expressive, inherently capturing subjective and fine-grained visual cues. The exploration of such innate properties of human sketches has, however, been limited to that of image retrieval. In this paper, for the first time, we cultivate the expressiveness of sketches but for the fundamental vision task of object detection. The end result is a sketch-enabled object detection fra… ▽ More

    Submitted 28 October, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: Best Paper Finalist (Top 12 Best Papers). Presented in special single-track plenary sessions to all attendees in Computer Vision and Pattern Recognition (CVPR), 2023. Updated an error in Fig.3 (from Softmax to Cross Entropy). Thanks to the community for pointing it out

  33. arXiv:2303.13779  [pdf, other

    cs.CV

    Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR

    Authors: Aneeshan Sain, Ayan Kumar Bhunia, Subhadeep Koley, Pinaki Nath Chowdhury, Soumitri Chattopadhyay, Tao Xiang, Yi-Zhe Song

    Abstract: This paper advances the fine-grained sketch-based image retrieval (FG-SBIR) literature by putting forward a strong baseline that overshoots prior state-of-the-arts by ~11%. This is not via complicated design though, but by addressing two critical issues facing the community (i) the gold standard triplet loss does not enforce holistic latent space geometry, and (ii) there are never enough sketches… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted in CVPR 2023. Project page available at https://aneeshan95.github.io/Sketch_PVT/

  34. arXiv:2303.13440  [pdf, other

    cs.CV

    CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not

    Authors: Aneeshan Sain, Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Subhadeep Koley, Tao Xiang, Yi-Zhe Song

    Abstract: In this paper, we leverage CLIP for zero-shot sketch based image retrieval (ZS-SBIR). We are largely inspired by recent advances on foundation models and the unparalleled generalisation ability they seem to offer, but for the first time tailor it to benefit the sketch community. We put forward novel designs on how best to achieve this synergy, for both the category setting and the fine-grained set… ▽ More

    Submitted 27 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted in CVPR 2023. Project page available at https://aneeshan95.github.io/Sketch_LVM/

  35. arXiv:2303.11502  [pdf, other

    cs.CV

    Sketch2Saliency: Learning to Detect Salient Objects from Human Drawings

    Authors: Ayan Kumar Bhunia, Subhadeep Koley, Amandeep Kumar, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: Human sketch has already proved its worth in various visual understanding tasks (e.g., retrieval, segmentation, image-captioning, etc). In this paper, we reveal a new trait of sketches - that they are also salient. This is intuitive as sketching is a natural attentive process at its core. More specifically, we aim to study how sketches can be used as a weak label to detect salient objects present… ▽ More

    Submitted 30 March, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: CVPR 2023. Project page available at https://ayankumarbhunia.github.io/Sketch2Saliency/

  36. arXiv:2303.11162  [pdf, other

    cs.CV

    Picture that Sketch: Photorealistic Image Generation from Abstract Sketches

    Authors: Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: Given an abstract, deformed, ordinary sketch from untrained amateurs like you and me, this paper turns it into a photorealistic image - just like those shown in Fig. 1(a), all non-cherry-picked. We differ significantly from prior art in that we do not dictate an edgemap-like sketch to start with, but aim to work with abstract free-hand human sketches. In doing so, we essentially democratise the sk… ▽ More

    Submitted 30 March, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted in CVPR 2023. Project page available at https://subhadeepkoley.github.io/PictureThatSketch

  37. arXiv:2301.05653  [pdf, other

    cs.CR

    Threat Models over Space and Time: A Case Study of E2EE Messaging Applications

    Authors: Partha Das Chowdhury, Maria Sameen, Jenny Blessing, Nicholas Boucher, Joseph Gardiner, Tom Burrows, Ross Anderson, Awais Rashid

    Abstract: Threat modelling is foundational to secure systems engineering and should be done in consideration of the context within which systems operate. On the other hand, the continuous evolution of both the technical sophistication of threats and the system attack surface is an inescapable reality. In this work, we explore the extent to which real-world systems engineering reflects the changing threat co… ▽ More

    Submitted 28 May, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

  38. arXiv:2211.02341  [pdf, other

    cs.SE

    Better Call Saltzer \& Schroeder: A Retrospective Security Analysis of SolarWinds \& Log4j

    Authors: Partha Das Chowdhury, Mohammad Tahaei, Awais Rashid

    Abstract: Saltzer \& Schroeder's principles aim to bring security to the design of computer systems. We investigate SolarWinds Orion update and Log4j to unpack the intersections where observance of these principles could have mitigated the embedded vulnerabilities. The common principles that were not observed include \emph{fail safe defaults}, \emph{economy of mechanism}, \emph{complete mediation} and \emph… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  39. arXiv:2207.01723  [pdf, other

    cs.CV

    Adaptive Fine-Grained Sketch-Based Image Retrieval

    Authors: Ayan Kumar Bhunia, Aneeshan Sain, Parth Shah, Animesh Gupta, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: The recent focus on Fine-Grained Sketch-Based Image Retrieval (FG-SBIR) has shifted towards generalising a model to new categories without any training data from them. In real-world applications, however, a trained FG-SBIR model is often applied to both new categories and different human sketchers, i.e., different drawing styles. Although this complicates the generalisation problem, fortunately, a… ▽ More

    Submitted 19 August, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted in ECCV 2022. Minor typos and Eq.4 corrected

  40. arXiv:2204.11964  [pdf, other

    cs.CV

    SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text

    Authors: Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Aneeshan Sain, Subhadeep Koley, Tao Xiang, Yi-Zhe Song

    Abstract: In this paper, we extend scene understanding to include that of human sketch. The result is a complete trilogy of scene representation from three diverse and complementary modalities -- sketch, photo, and text. Instead of learning a rigid three-way embedding and be done with it, we focus on learning a flexible joint embedding that fully supports the ``optionality" that this complementarity brings.… ▽ More

    Submitted 26 March, 2023; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted in Computer Vision and Pattern Recognition (CVPR), 2023

  41. arXiv:2204.08029  [pdf

    cs.CV

    Deep Learning based Automatic Detection of Dicentric Chromosome

    Authors: Angad Singh Wadhwa, Nikhil Tyagi, Pinaki Roy Chowdhury

    Abstract: Automatic detection of dicentric chromosomes is an essential step to estimate radiation exposure and development of end to end emergency bio dosimetry systems. During accidents, a large amount of data is required to be processed for extensive testing to formulate a medical treatment plan for the masses, which requires this process to be automated. Current approaches require human adjustments accor… ▽ More

    Submitted 17 April, 2022; originally announced April 2022.

  42. arXiv:2203.14817  [pdf, other

    cs.CV

    Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval

    Authors: Ayan Kumar Bhunia, Subhadeep Koley, Abdullah Faiz Ur Rahman Khilji, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: Sketching enables many exciting applications, notably, image retrieval. The fear-to-sketch problem (i.e., "I can't sketch") has however proven to be fatal for its widespread adoption. This paper tackles this "fear" head on, and for the first time, proposes an auxiliary module for existing retrieval models that predominantly lets the users sketch without having to worry. We first conducted a pilot… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2022 Code: https://github.com/AyanKumarBhunia/Stroke_Subset_Selector-for-FGSBIR

  43. arXiv:2203.14804  [pdf, other

    cs.CV

    Partially Does It: Towards Scene-Level FG-SBIR with Partial Input

    Authors: Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Viswanatha Reddy Gajjala, Aneeshan Sain, Tao Xiang, Yi-Zhe Song

    Abstract: We scrutinise an important observation plaguing scene-level sketch research -- that a significant portion of scene sketches are "partial". A quick pilot study reveals: (i) a scene sketch does not necessarily contain all objects in the corresponding photo, due to the subjective holistic interpretation of scenes, (ii) there exists significant empty (white) regions as a result of object-level abstrac… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Accepted in CVPR 2022

  44. arXiv:2203.14691  [pdf, other

    cs.CV

    Sketch3T: Test-Time Training for Zero-Shot SBIR

    Authors: Aneeshan Sain, Ayan Kumar Bhunia, Vaishnav Potlapalli, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: Zero-shot sketch-based image retrieval typically asks for a trained model to be applied as is to unseen categories. In this paper, we question to argue that this setup by definition is not compatible with the inherent abstract and subjective nature of sketches, i.e., the model might transfer well to new categories, but will not understand sketches existing in different test-time distribution as a… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: 10 pages, 5 figures. Accepted in CVPR 2022

  45. arXiv:2203.02113  [pdf, other

    cs.CV

    FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context

    Authors: Pinaki Nath Chowdhury, Aneeshan Sain, Ayan Kumar Bhunia, Tao Xiang, Yulia Gryaditskaya, Yi-Zhe Song

    Abstract: We advance sketch research to scenes with the first dataset of freehand scene sketches, FS-COCO. With practical applications in mind, we collect sketches that convey scene content well but can be sketched within a few minutes by a person with any sketching skills. Our dataset comprises 10,000 freehand scene vector sketches with per point space-time information by 100 non-expert individuals, offeri… ▽ More

    Submitted 20 July, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Accepted in ECCV 2022. Project Page: https://fscoco.github.io

  46. arXiv:2202.08548  [pdf, other

    cs.CY

    From Utility to Capability: A New Paradigm to Conceptualize and Develop Inclusive PETs

    Authors: Partha Das Chowdhury, Andres Dominguez, Kopo M. Ramokapane, Awais Rashid

    Abstract: The wider adoption of PETs has relied on usability studies, which focus mainly on an assessment of how a specified group of users interface, in particular contexts, with the technical properties of a system. While human centred efforts in usability aim to achieve important technical improvements and drive technology adoption, a focus on the usability of PETs alone is not enough. PETs development a… ▽ More

    Submitted 29 September, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: 16 Pages, 2 Figures

    MSC Class: NA ACM Class: K.4

  47. arXiv:2110.08323  [pdf, other

    cs.LG cs.CL

    On Learning the Transformer Kernel

    Authors: Sankalan Pal Chowdhury, Adamos Solomou, Avinava Dubey, Mrinmaya Sachan

    Abstract: In this work we introduce KERNELIZED TRANSFORMER, a generic, scalable, data driven framework for learning the kernel function in Transformers. Our framework approximates the Transformer kernel as a dot product between spectral feature maps and learns the kernel by learning the spectral distribution. This not only helps in learning a generic kernel end-to-end, but also reduces the time and space co… ▽ More

    Submitted 21 July, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Accepted to TMLR

  48. arXiv:2110.04998  [pdf, other

    stat.ML cs.LG math.FA math.PR math.ST stat.ME

    Nonparametric Functional Analysis of Generalized Linear Models Under Nonlinear Constraints

    Authors: K. P. Chowdhury

    Abstract: This article introduces a novel nonparametric methodology for Generalized Linear Models which combines the strengths of the binary regression and latent variable formulations for categorical data, while overcoming their disadvantages. Requiring minimal assumptions, it extends recently published parametric versions of the methodology and generalizes it. If the underlying data generating process is… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  49. arXiv:2108.11636  [pdf, other

    cs.CV

    SketchLattice: Latticed Representation for Sketch Manipulation

    Authors: Yonggang Qi, Guoyao Su, Pinaki Nath Chowdhury, Mingkang Li, Yi-Zhe Song

    Abstract: The key challenge in designing a sketch representation lies with handling the abstract and iconic nature of sketches. Existing work predominantly utilizes either, (i) a pixelative format that treats sketches as natural images employing off-the-shelf CNN-based networks, or (ii) an elaborately designed vector format that leverages the structural information of drawing orders using sequential RNN-bas… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: accepted to ICCV 2021

  50. arXiv:2108.11554  [pdf, other

    cs.CV cs.AI

    XCI-Sketch: Extraction of Color Information from Images for Generation of Colored Outlines and Sketches

    Authors: V Manushree, Sameer Saxena, Parna Chowdhury, Manisimha Varma, Harsh Rathod, Ankita Ghosh, Sahil Khose

    Abstract: Sketches are a medium to convey a visual scene from an individual's creative perspective. The addition of color substantially enhances the overall expressivity of a sketch. This paper proposes two methods to mimic human-drawn colored sketches by utilizing the Contour Drawing Dataset. Our first approach renders colored outline sketches by applying image processing techniques aided by k-means color… ▽ More

    Submitted 7 January, 2022; v1 submitted 25 August, 2021; originally announced August 2021.

    Comments: ML for Creativity and Design workshop at NeurIPS 2021