Skip to main content

Showing 1–10 of 10 results for author: Guillory, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.08008  [pdf, ps, other

    cs.CV cs.AI cs.LG

    Hidden in plain sight: VLMs overlook their visual representations

    Authors: Stephanie Fu, Tyler Bonnen, Devin Guillory, Trevor Darrell

    Abstract: Language provides a natural interface to specify and evaluate performance on visual tasks. To realize this possibility, vision language models (VLMs) must successfully integrate visual and linguistic information. Our work compares VLMs to a direct readout of their visual encoders to understand their ability to integrate across these modalities. Across a series of vision-centric benchmarks (e.g., d… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: Project page: https://hidden-plain-sight.github.io/

  2. arXiv:2306.11180  [pdf, other

    cs.CV cs.AI

    Hyperbolic Active Learning for Semantic Segmentation under Domain Shift

    Authors: Luca Franco, Paolo Mandica, Konstantinos Kallidromitis, Devin Guillory, Yu-Teng Li, Trevor Darrell, Fabio Galasso

    Abstract: We introduce a hyperbolic neural network approach to pixel-level active learning for semantic segmentation. Analysis of the data statistics leads to a novel interpretation of the hyperbolic radius as an indicator of data scarcity. In HALO (Hyperbolic Active Learning Optimization), for the first time, we propose the use of epistemic uncertainty as a data acquisition strategy, following the intuitio… ▽ More

    Submitted 4 June, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: ICML 2024. Project repository: https://github.com/paolomandica/HALO

  3. arXiv:2210.09520  [pdf, other

    cs.CV

    Using Language to Extend to Unseen Domains

    Authors: Lisa Dunlap, Clara Mohri, Devin Guillory, Han Zhang, Trevor Darrell, Joseph E. Gonzalez, Aditi Raghunathan, Anja Rohrbach

    Abstract: It is expensive to collect training data for every possible domain that a vision model may encounter when deployed. We instead consider how simply verbalizing the training domain (e.g. "photos of birds") as well as domains we want to extend to but do not have data for (e.g. "paintings of birds") can improve robustness. Using a multimodal model with a joint image and language embedding space, our m… ▽ More

    Submitted 29 April, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

  4. arXiv:2209.02836  [pdf, other

    cs.CV cs.LG

    Studying Bias in GANs through the Lens of Race

    Authors: Vongani H. Maluleke, Neerja Thakkar, Tim Brooks, Ethan Weber, Trevor Darrell, Alexei A. Efros, Angjoo Kanazawa, Devin Guillory

    Abstract: In this work, we study how the performance and evaluation of generative image models are impacted by the racial composition of their training datasets. By examining and controlling the racial distributions in various training datasets, we are able to observe the impacts of different training distributions on generated image quality and the racial distributions of the generated images. Our results… ▽ More

    Submitted 14 September, 2022; v1 submitted 6 September, 2022; originally announced September 2022.

    Comments: ECCV 2022. Project Page: https://neerja.me/bias-gans/

    ACM Class: I.4

  5. arXiv:2207.01708  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    Disentangled Action Recognition with Knowledge Bases

    Authors: Zhekun Luo, Shalini Ghosh, Devin Guillory, Keizo Kato, Trevor Darrell, Huijuan Xu

    Abstract: Action in video usually involves the interaction of human with objects. Action labels are typically composed of various combinations of verbs and nouns, but we may not have training data for all possible combinations. In this paper, we aim to improve the generalization ability of the compositional action recognition model to novel verbs or novel nouns that are unseen during training time, by lever… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: NAACL 2022

  6. arXiv:2107.03315  [pdf, other

    cs.LG cs.CV stat.ML

    Predicting with Confidence on Unseen Distributions

    Authors: Devin Guillory, Vaishaal Shankar, Sayna Ebrahimi, Trevor Darrell, Ludwig Schmidt

    Abstract: Recent work has shown that the performance of machine learning models can vary substantially when models are evaluated on data drawn from a distribution that is close to but different from the training distribution. As a result, predicting model performance on unseen distributions is an important challenge. Our work connects techniques from domain adaptation and predictive uncertainty literature,… ▽ More

    Submitted 19 August, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: ICCV Camera ready; new scatter plots in supplementary material

    ACM Class: I.2.10

  7. arXiv:2103.12718  [pdf, other

    cs.CV

    Self-Supervised Pretraining Improves Self-Supervised Pretraining

    Authors: Colorado J. Reed, Xiangyu Yue, Ani Nrusimha, Sayna Ebrahimi, Vivek Vijaykumar, Richard Mao, Bo Li, Shanghang Zhang, Devin Guillory, Sean Metzger, Kurt Keutzer, Trevor Darrell

    Abstract: While self-supervised pretraining has proven beneficial for many computer vision tasks, it requires expensive and lengthy computation, large amounts of data, and is sensitive to data augmentation. Prior work demonstrates that models pretrained on datasets dissimilar to their target data, such as chest X-ray models trained on ImageNet, underperform models trained from scratch. Users that lack the r… ▽ More

    Submitted 24 March, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

  8. arXiv:2006.16879  [pdf, ps, other

    cs.CY

    Combating Anti-Blackness in the AI Community

    Authors: Devin Guillory

    Abstract: In response to a national and international awakening on the issues of anti-Blackness and systemic discrimination, we have penned this piece to serve as a resource for allies in the AI community who are wondering how they can more effectively engage with dismantling racist systems. This work aims to help elucidate areas where the AI community actively and passively contributes to anti-Blackness an… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    ACM Class: I.2.0

  9. arXiv:2004.00163  [pdf, other

    cs.CV cs.LG stat.ML

    Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning

    Authors: Zhekun Luo, Devin Guillory, Baifeng Shi, Wei Ke, Fang Wan, Trevor Darrell, Huijuan Xu

    Abstract: Weakly-supervised action localization requires training a model to localize the action segments in the video given only video level action label. It can be solved under the Multiple Instance Learning (MIL) framework, where a bag (video) contains multiple instances (action segments). Since only the bag's label is known, the main challenge is assigning which key instances within the bag to trigger t… ▽ More

    Submitted 25 August, 2020; v1 submitted 31 March, 2020; originally announced April 2020.

    Comments: Accepted at European Conference on Computer Vision (ECCV), 2020

  10. An Ensemble-based Approach to Click-Through Rate Prediction for Promoted Listings at Etsy

    Authors: Kamelia Aryafar, Devin Guillory, Liangjie Hong

    Abstract: Etsy is a global marketplace where people across the world connect to make, buy and sell unique goods. Sellers at Etsy can promote their product listings via advertising campaigns similar to traditional sponsored search ads. Click-Through Rate (CTR) prediction is an integral part of online search advertising systems where it is utilized as an input to auctions which determine the final ranking of… ▽ More

    Submitted 21 November, 2017; v1 submitted 3 November, 2017; originally announced November 2017.