Skip to main content

Showing 1–10 of 10 results for author: Tawari, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.19910  [pdf, other

    cs.CV cs.IR

    CoLLM: A Large Language Model for Composed Image Retrieval

    Authors: Chuong Huynh, Jinyu Yang, Ashish Tawari, Mubarak Shah, Son Tran, Raffay Hamid, Trishul Chilimbi, Abhinav Shrivastava

    Abstract: Composed Image Retrieval (CIR) is a complex task that aims to retrieve images based on a multimodal query. Typical training data consists of triplets containing a reference image, a textual description of desired modifications, and the target image, which are expensive and time-consuming to acquire. The scarcity of CIR datasets has led to zero-shot approaches utilizing synthetic triplets or levera… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: CVPR 2025. Project page: https://collm-cvpr25.github.io/

  2. arXiv:2407.09073  [pdf, other

    cs.CV

    Open Vocabulary Multi-Label Video Classification

    Authors: Rohit Gupta, Mamshad Nayeem Rizve, Jayakrishnan Unnikrishnan, Ashish Tawari, Son Tran, Mubarak Shah, Benjamin Yao, Trishul Chilimbi

    Abstract: Pre-trained vision-language models (VLMs) have enabled significant progress in open vocabulary computer vision tasks such as image classification, object detection and image segmentation. Some recent works have focused on extending VLMs to open vocabulary single label action classification in videos. However, previous methods fall short in holistic video understanding which requires the ability to… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV 2024

  3. arXiv:2008.05728  [pdf, ps, other

    cs.CC

    Dynamic Complexity of Expansion

    Authors: Samir Datta, Anuj Tawari, Yadu Vasudev

    Abstract: Dynamic Complexity was introduced by Immerman and Patnaik \cite{PatnaikImmerman97} (see also \cite{DongST95}). It has seen a resurgence of interest in the recent past, see \cite{DattaHK14,ZeumeS15,MunozVZ16,BouyerJ17,Zeume17,DKMSZ18,DMVZ18,BarceloRZ18,DMSVZ19,SchmidtSVZK20,DKMTVZ20} for some representative examples. Use of linear algebra has been a notable feature of some of these papers. We exten… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: 29 pages

  4. arXiv:2004.12739  [pdf, other

    cs.LO cs.CC

    Dynamic complexity of Reachability: How many changes can we handle?

    Authors: Samir Datta, Pankaj Kumar, Anish Mukherjee, Anuj Tawari, Nils Vortmeier, Thomas Zeume

    Abstract: In 2015, it was shown that reachability for arbitrary directed graphs can be updated by first-order formulas after inserting or deleting single edges. Later, in 2018, this was extended for changes of size $\frac{\log n}{\log \log n}$, where $n$ is the size of the graph. Changes of polylogarithmic size can be handled when also majority quantifiers may be used. In this paper we extend these result… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

  5. arXiv:2003.06045  [pdf, other

    cs.CV cs.RO

    Interaction Graphs for Object Importance Estimation in On-road Driving Videos

    Authors: Zehua Zhang, Ashish Tawari, Sujitha Martin, David Crandall

    Abstract: A vehicle driving along the road is surrounded by many objects, but only a small subset of them influence the driver's decisions and actions. Learning to estimate the importance of each object on the driver's real-time decision-making may help better understand human driving behavior and lead to more reliable autonomous driving systems. Solving this problem requires models that understand the inte… ▽ More

    Submitted 12 March, 2020; originally announced March 2020.

    Comments: Accepted by ICRA 2020

  6. arXiv:1911.06978  [pdf, other

    cs.CV

    Grounding Human-to-Vehicle Advice for Self-driving Vehicles

    Authors: Jinkyu Kim, Teruhisa Misu, Yi-Ting Chen, Ashish Tawari, John Canny

    Abstract: Recent success suggests that deep neural control networks are likely to be a key component of self-driving vehicles. These networks are trained on large datasets to imitate human actions, but they lack semantic understanding of image contents. This makes them brittle and potentially unsafe in situations that do not match training data. Here, we propose to address this issue by augmenting training… ▽ More

    Submitted 16 November, 2019; originally announced November 2019.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019

  7. arXiv:1909.05152  [pdf, other

    cs.CV cs.HC cs.LG eess.IV

    Context Aware Road-user Importance Estimation (iCARE)

    Authors: Alireza Rahimpour, Sujitha Martin, Ashish Tawari, Hairong Qi

    Abstract: Road-users are a critical part of decision-making for both self-driving cars and driver assistance systems. Some road-users, however, are more important for decision-making than others because of their respective intentions, ego vehicle's intention and their effects on each other. In this paper, we propose a novel architecture for road-user importance estimation which takes advantage of the local… ▽ More

    Submitted 30 August, 2019; originally announced September 2019.

    Comments: Published in: IEEE Intelligent Vehicles (IV), 2019

  8. arXiv:1905.02848  [pdf, other

    cs.CV

    Goal-oriented Object Importance Estimation in On-road Driving Videos

    Authors: Mingfei Gao, Ashish Tawari, Sujitha Martin

    Abstract: We formulate a new problem as Object Importance Estimation (OIE) in on-road driving videos, where the road users are considered as important objects if they have influence on the control decision of the ego-vehicle's driver. The importance of a road user depends on both its visual dynamics, e.g., appearance, motion and location, in the driving scene and the driving goal, \emph{e.g}., the planned p… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

  9. arXiv:1603.02605  [pdf, ps, other

    cs.CC

    Sums of read-once formulas: How many summands suffice?

    Authors: Meena Mahajan, Anuj Tawari

    Abstract: An arithmetic read-once formula (ROF) is a formula (circuit of fan-out 1) over $+,\times$ where each variable labels at most one leaf. Every multilinear polynomial can be expressed as the sum of ROFs. In this work, we prove, for certain multilinear polynomials, a tight lower bound on the number of summands in such an expression.

    Submitted 8 March, 2016; originally announced March 2016.

  10. arXiv:1512.04386  [pdf, ps, other

    cs.CC

    Read-once polynomials: How many summands suffice?

    Authors: Meena Mahajan, Anuj Tawari

    Abstract: An arithmetic read-once formula (ROF) is a formula (circuit of fan-out 1) over $+, \times$ where each variable labels at most one leaf. Every multilinear polynomial can be expressed as the sum of ROFs. In this work, we prove, for certain multilinear polynomials, a tight lower bound on the number of summands in such an expression.

    Submitted 14 December, 2015; originally announced December 2015.

    Comments: 16 pages