Skip to main content

Showing 1–14 of 14 results for author: RoyChowdhury, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.23926  [pdf, ps, other

    cs.CV

    Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts

    Authors: Xuweiyi Chen, Wentao Zhou, Aruni RoyChowdhury, Zezhou Cheng

    Abstract: While scaling laws have transformed natural language processing and computer vision, 3D point cloud understanding has yet to reach that stage. This can be attributed to both the comparatively smaller scale of 3D datasets, as well as the disparate sources of the data itself. Point clouds are captured by diverse sensors (e.g., depth cameras, LiDAR) across varied domains (e.g., indoor, outdoor), each… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Project page: https://uva-computer-vision-lab.github.io/point-moe/

  2. arXiv:2408.04870  [pdf, other

    cs.CR cs.AI

    ConfusedPilot: Confused Deputy Risks in RAG-based LLMs

    Authors: Ayush RoyChowdhury, Mulong Luo, Prateek Sahu, Sarbartha Banerjee, Mohit Tiwari

    Abstract: Retrieval augmented generation (RAG) is a process where a large language model (LLM) retrieves useful information from a database and then generates the responses. It is becoming popular in enterprise settings for daily business operations. For example, Copilot for Microsoft 365 has accumulated millions of businesses. However, the security implications of adopting such RAG-based systems are unclea… ▽ More

    Submitted 23 October, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

  3. arXiv:2307.07929  [pdf, other

    cs.CV

    DocTr: Document Transformer for Structured Information Extraction in Documents

    Authors: Haofu Liao, Aruni RoyChowdhury, Weijian Li, Ankan Bansal, Yuting Zhang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan

    Abstract: We present a new formulation for structured information extraction (SIE) from visually rich documents. It aims to address the limitations of existing IOB tagging or graph-based formulations, which are either overly reliant on the correct ordering of input text or struggle with decoding a complex graph. Instead, motivated by anchor-based object detectors in vision, we represent an entity as an anch… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

  4. arXiv:2112.13942  [pdf, other

    cs.CV

    PriFit: Learning to Fit Primitives Improves Few Shot Point Cloud Segmentation

    Authors: Gopal Sharma, Bidya Dash, Aruni RoyChowdhury, Matheus Gadelha, Marios Loizou, Liangliang Cao, Rui Wang, Erik Learned-Miller, Subhransu Maji, Evangelos Kalogerakis

    Abstract: We present PriFit, a semi-supervised approach for label-efficient learning of 3D point cloud segmentation networks. PriFit combines geometric primitive fitting with point-based representation learning. Its key idea is to learn point representations whose clustering reveals shape regions that can be approximated well by basic geometric primitives, such as cuboids and ellipsoids. The learned point r… ▽ More

    Submitted 23 June, 2022; v1 submitted 27 December, 2021; originally announced December 2021.

  5. arXiv:2007.06995  [pdf, other

    cs.CV

    Improving Face Recognition by Clustering Unlabeled Faces in the Wild

    Authors: Aruni RoyChowdhury, Xiang Yu, Kihyuk Sohn, Erik Learned-Miller, Manmohan Chandraker

    Abstract: While deep face recognition has benefited significantly from large-scale labeled data, current research is focused on leveraging unlabeled data to further boost performance, reducing the cost of human annotation. Prior work has mostly been in controlled settings, where the labeled and unlabeled data sets have no overlapping identities by construction. This is not realistic in large-scale face reco… ▽ More

    Submitted 15 July, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: ECCV 2020

  6. arXiv:2003.13834  [pdf, other

    cs.CV cs.GR cs.LG

    Label-Efficient Learning on Point Clouds using Approximate Convex Decompositions

    Authors: Matheus Gadelha, Aruni RoyChowdhury, Gopal Sharma, Evangelos Kalogerakis, Liangliang Cao, Erik Learned-Miller, Rui Wang, Subhransu Maji

    Abstract: The problems of shape classification and part segmentation from 3D point clouds have garnered increasing attention in the last few years. Both of these problems, however, suffer from relatively small training sets, creating the need for statistically efficient methods to learn 3D shape representations. In this paper, we investigate the use of Approximate Convex Decompositions (ACD) as a self-super… ▽ More

    Submitted 4 August, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: First two authors had equal contribution. ECCV'20 version. 19 pages, 5 figures

    Journal ref: 16th European Conference on Computer Vision (ECCV 2020)

  7. arXiv:1904.07305  [pdf, other

    cs.CV cs.LG

    Automatic adaptation of object detectors to new domains using self-training

    Authors: Aruni RoyChowdhury, Prithvijit Chakrabarty, Ashish Singh, SouYoung Jin, Huaizu Jiang, Liangliang Cao, Erik Learned-Miller

    Abstract: This work addresses the unsupervised adaptation of an existing object detector to a new target domain. We assume that a large number of unlabeled videos from this domain are readily available. We automatically obtain labels on the target data by using high-confidence detections from the existing detector, augmented with hard (misclassified) examples acquired by exploiting temporal cues using a tra… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: Accepted at CVPR 2019

  8. arXiv:1902.01293  [pdf, other

    cs.CV cs.RO

    Real-time Prediction of Automotive Collision Risk from Monocular Video

    Authors: Derek J. Phillips, Juan Carlos Aragon, Anjali Roychowdhury, Regina Madigan, Sunil Chintakindi, Mykel J. Kochenderfer

    Abstract: Many automotive applications, such as Advanced Driver Assistance Systems (ADAS) for collision avoidance and warnings, require estimating the future automotive risk of a driving scene. We present a low-cost system that predicts the collision risk over an intermediate time horizon from a monocular video source, such as a dashboard-mounted camera. The modular system includes components for object det… ▽ More

    Submitted 4 February, 2019; originally announced February 2019.

    Comments: Submitted to IV2019. 7 pages, 4 figures, 3 tables

  9. arXiv:1808.04285  [pdf, other

    cs.CV

    Unsupervised Hard Example Mining from Videos for Improved Object Detection

    Authors: SouYoung Jin, Aruni RoyChowdhury, Huaizu Jiang, Ashish Singh, Aditya Prasad, Deep Chakraborty, Erik Learned-Miller

    Abstract: Important gains have recently been obtained in object detection by using training objectives that focus on {\em hard negative} examples, i.e., negative examples that are currently rated as positive or ambiguous by the detector. These examples can strongly influence parameters when the network is trained to correct them. Unfortunately, they are often sparse in the training data, and are expensive t… ▽ More

    Submitted 13 August, 2018; originally announced August 2018.

    Comments: 14 pages, 7 figures, accepted at ECCV 2018

  10. arXiv:1711.02159  [pdf, other

    cs.LG cs.AI stat.ML

    Adaptive Bayesian Sampling with Monte Carlo EM

    Authors: Anirban Roychowdhury, Srinivasan Parthasarathy

    Abstract: We present a novel technique for learning the mass matrices in samplers obtained from discretized dynamics that preserve some energy function. Existing adaptive samplers use Riemannian preconditioning techniques, where the mass matrices are functions of the parameters being sampled. This leads to significant complexities in the energy reformulations and resultant dynamics, often leading to implici… ▽ More

    Submitted 6 November, 2017; originally announced November 2017.

    Comments: In Proc. 30th Advances in Neural Information Processing Systems (NIPS), 2017 (to appear)

  11. arXiv:1704.01700  [pdf, other

    math.OC cs.LG math.DG stat.ML

    Accelerated Stochastic Quasi-Newton Optimization on Riemann Manifolds

    Authors: Anirban Roychowdhury

    Abstract: We propose an L-BFGS optimization algorithm on Riemannian manifolds using minibatched stochastic variance reduction techniques for fast convergence with constant step sizes, without resorting to linesearch methods designed to satisfy Wolfe conditions. We provide a new convergence proof for strongly convex functions without using curvature conditions on the manifold, as well as a convergence discus… ▽ More

    Submitted 22 May, 2017; v1 submitted 5 April, 2017; originally announced April 2017.

  12. arXiv:1506.01342  [pdf, other

    cs.CV

    One-to-many face recognition with bilinear CNNs

    Authors: Aruni RoyChowdhury, Tsung-Yu Lin, Subhransu Maji, Erik Learned-Miller

    Abstract: The recent explosive growth in convolutional neural network (CNN) research has produced a variety of new architectures for deep learning. One intriguing new architecture is the bilinear CNN (B-CNN), which has shown dramatic performance gains on certain fine-grained recognition problems [15]. We apply this new CNN to the challenging new face recognition benchmark, the IARPA Janus Benchmark A (IJB-A… ▽ More

    Submitted 28 March, 2016; v1 submitted 3 June, 2015; originally announced June 2015.

    Comments: Published version at WACV 2016

  13. arXiv:1504.07889  [pdf, other

    cs.CV

    Bilinear CNNs for Fine-grained Visual Recognition

    Authors: Tsung-Yu Lin, Aruni RoyChowdhury, Subhransu Maji

    Abstract: We present a simple and effective architecture for fine-grained visual recognition called Bilinear Convolutional Neural Networks (B-CNNs). These networks represent an image as a pooled outer product of features derived from two CNNs and capture localized feature interactions in a translationally invariant manner. B-CNNs belong to the class of orderless texture representations but unlike prior work… ▽ More

    Submitted 1 June, 2017; v1 submitted 29 April, 2015; originally announced April 2015.

  14. arXiv:1410.1068  [pdf, other

    stat.ML cs.AI cs.LG

    Gamma Processes, Stick-Breaking, and Variational Inference

    Authors: Anirban Roychowdhury, Brian Kulis

    Abstract: While most Bayesian nonparametric models in machine learning have focused on the Dirichlet process, the beta process, or their variants, the gamma process has recently emerged as a useful nonparametric prior in its own right. Current inference schemes for models involving the gamma process are restricted to MCMC-based methods, which limits their scalability. In this paper, we present a variational… ▽ More

    Submitted 4 October, 2014; originally announced October 2014.