Skip to main content

Showing 1–17 of 17 results for author: Pless, R

.
  1. arXiv:2505.21647  [pdf, ps, other

    cs.CV cs.LG

    QuARI: Query Adaptive Retrieval Improvement

    Authors: Eric Xing, Abby Stylianou, Robert Pless, Nathan Jacobs

    Abstract: Massive-scale pretraining has made vision-language models increasingly popular for image-to-image and text-to-image retrieval across a broad collection of domains. However, these models do not perform well when used for challenging retrieval tasks, such as instance retrieval in very large-scale image collections. Recent work has shown that linear transformations of VLM features trained for instanc… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 13 pages, 4 figures, 4 tables

  2. arXiv:2505.20764  [pdf, ps, other

    cs.CV cs.LG

    ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval

    Authors: Eric Xing, Pranavi Kolouju, Robert Pless, Abby Stylianou, Nathan Jacobs

    Abstract: Composed image retrieval (CIR) is the task of retrieving a target image specified by a query image and a relative text that describes a semantic modification to the query image. Existing methods in CIR struggle to accurately represent the image and the text modification, resulting in subpar performance. To address this limitation, we introduce a CIR framework, ConText-CIR, trained with a Text Conc… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 15 pages, 8 figures, 6 tables. CVPR 2025

  3. arXiv:2503.17871  [pdf, other

    cs.CV cs.AI

    good4cir: Generating Detailed Synthetic Captions for Composed Image Retrieval

    Authors: Pranavi Kolouju, Eric Xing, Robert Pless, Nathan Jacobs, Abby Stylianou

    Abstract: Composed image retrieval (CIR) enables users to search images using a reference image combined with textual modifications. Recent advances in vision-language models have improved CIR, but dataset limitations remain a barrier. Existing datasets often rely on simplistic, ambiguous, or insufficient manual annotations, hindering fine-grained retrieval. We introduce good4cir, a structured pipeline leve… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

  4. arXiv:2208.11693  [pdf, other

    cs.CR cs.CY

    DP2-Pub: Differentially Private High-Dimensional Data Publication with Invariant Post Randomization

    Authors: Honglu Jiang, Haotian Yu, Xiuzhen Cheng, Jian Pei, Robert Pless, Jiguo Yu

    Abstract: A large amount of high-dimensional and heterogeneous data appear in practical applications, which are often published to third parties for data analysis, recommendations, targeted advertising, and reliable predictions. However, publishing these data may disclose personal sensitive information, resulting in an increasing concern on privacy violations. Privacy-preserving data publishing has received… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  5. arXiv:2201.11307  [pdf, other

    cs.CV cs.AI cs.IR cs.LG

    Dissecting the impact of different loss functions with gradient surgery

    Authors: Hong Xuan, Robert Pless

    Abstract: Pair-wise loss is an approach to metric learning that learns a semantic embedding by optimizing a loss function that encourages images from the same semantic class to be mapped closer than images from different classes. The literature reports a large and growing set of variations of the pair-wise loss strategies. Here we decompose the gradient of these loss functions into components that relate to… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  6. arXiv:2108.04090  [pdf, other

    q-bio.QM cs.LG

    Classification and Visualization of Genotype x Phenotype Interactions in Biomass Sorghum

    Authors: Abby Stylianou, Robert Pless, Nadia Shakoor, Todd Mockler

    Abstract: We introduce a simple approach to understanding the relationship between single nucleotide polymorphisms (SNPs), or groups of related SNPs, and the phenotypes they control. The pipeline involves training deep convolutional neural networks (CNNs) to differentiate between images of plants with reference and alternate versions of various SNPs, and then using visualization approaches to highlight what… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: ICCV 2021 Workshop on Computer Vision Problems in Plant Phenotyping and Agriculture (CVPPA)

  7. arXiv:2105.08649  [pdf, other

    cs.LG cs.AI cs.IR cs.MM

    DCAP: Deep Cross Attentional Product Network for User Response Prediction

    Authors: Zekai Chen, Fangtian Zhong, Zhumin Chen, Xiao Zhang, Robert Pless, Xiuzhen Cheng

    Abstract: User response prediction, which aims to predict the probability that a user will provide a predefined positive response in a given context such as clicking on an ad or purchasing an item, is crucial to many industrial applications such as online advertising, recommender systems, and search ranking. However, due to the high dimensionality and super sparsity of the data collected in these tasks, han… ▽ More

    Submitted 22 August, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Comments: 10 pages, 7 figures, Accepted by CIKM'21

  8. arXiv:2007.12749  [pdf, other

    cs.CV cs.LG stat.ML

    Hard negative examples are hard, but useful

    Authors: Hong Xuan, Abby Stylianou, Xiaotong Liu, Robert Pless

    Abstract: Triplet loss is an extremely common approach to distance metric learning. Representations of images from the same class are optimized to be mapped closer together in an embedding space than representations of images from different classes. Much work on triplet losses focuses on selecting the most useful triplets of images to consider, with strategies that select dissimilar examples from the same c… ▽ More

    Submitted 25 February, 2021; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: CV, Triplet loss, Image embedding, 14 pages, 9 figures, ECCV 2020

  9. arXiv:1910.03455  [pdf, other

    cs.CV cs.AI cs.LG

    TraffickCam: Explainable Image Matching For Sex Trafficking Investigations

    Authors: Abby Stylianou, Richard Souvenir, Robert Pless

    Abstract: Investigations of sex trafficking sometimes have access to photographs of victims in hotel rooms. These images directly link victims to places, which can help verify where victims have been trafficked or where traffickers might operate in the future. Current machine learning approaches give promising results in image search to find the matching hotel. This paper explores approaches to make this en… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: Presented at AAAI FSS-19: Artificial Intelligence in Government and Public Sector, Arlington, Virginia, USA

  10. arXiv:1909.07499  [pdf, other

    cs.CV

    Learning Geo-Temporal Image Features

    Authors: Menghua Zhai, Tawfiq Salem, Connor Greenwell, Scott Workman, Robert Pless, Nathan Jacobs

    Abstract: We propose to implicitly learn to extract geo-temporal image features, which are mid-level features related to when and where an image was captured, by explicitly optimizing for a set of location and time estimation tasks. To train our method, we take advantage of a large image dataset, captured by outdoor webcams and cell phones. The only form of supervision we provide are the known capture time… ▽ More

    Submitted 16 September, 2019; originally announced September 2019.

    Comments: British Machine Vision Conference (BMVC) 2018

  11. arXiv:1909.07464  [pdf, other

    cs.LG stat.ML

    Visualizing How Embeddings Generalize

    Authors: Xiaotong Liu, Hong Xuan, Zeyu Zhang, Abby Stylianou, Robert Pless

    Abstract: Deep metric learning is often used to learn an embedding function that captures the semantic differences within a dataset. A key factor in many problem domains is how this embedding generalizes to new classes of data. In observing many triplet selection strategies for Metric Learning, we find that the best performance consistently arises from approaches that focus on a few, well selected triplets.… ▽ More

    Submitted 16 September, 2019; originally announced September 2019.

    Comments: 8 pages,4 figures, published in ICML workshop:Understanding and Improving Generalization in Deep Learning

  12. arXiv:1904.04370  [pdf, other

    cs.CV cs.LG

    Improved Embeddings with Easy Positive Triplet Mining

    Authors: Hong Xuan, Abby Stylianou, Robert Pless

    Abstract: Deep metric learning seeks to define an embedding where semantically similar images are embedded to nearby locations, and semantically dissimilar images are embedded to distant locations. Substantial work has focused on loss functions and strategies to learn these embeddings by pushing images from the same class as close together in the embedding space as possible. In this paper, we propose an alt… ▽ More

    Submitted 18 March, 2020; v1 submitted 8 April, 2019; originally announced April 2019.

  13. arXiv:1901.11397  [pdf, other

    cs.CV cs.LG stat.ML

    Hotels-50K: A Global Hotel Recognition Dataset

    Authors: Abby Stylianou, Hong Xuan, Maya Shende, Jonathan Brandt, Richard Souvenir, Robert Pless

    Abstract: Recognizing a hotel from an image of a hotel room is important for human trafficking investigations. Images directly link victims to places and can help verify where victims have been trafficked, and where their traffickers might move them or others in the future. Recognizing the hotel from images is challenging because of low image quality, uncommon camera perspectives, large occlusions (often th… ▽ More

    Submitted 26 January, 2019; originally announced January 2019.

  14. arXiv:1901.00536  [pdf, other

    cs.CV cs.LG

    Visualizing Deep Similarity Networks

    Authors: Abby Stylianou, Richard Souvenir, Robert Pless

    Abstract: For convolutional neural network models that optimize an image embedding, we propose a method to highlight the regions of images that contribute most to pairwise similarity. This work is a corollary to the visualization tools developed for classification networks, but applicable to the problem domains better suited to similarity learning. The visualization shows how similarity networks that are fi… ▽ More

    Submitted 2 January, 2019; originally announced January 2019.

  15. arXiv:1808.04469  [pdf, other

    cs.CV

    Deep Randomized Ensembles for Metric Learning

    Authors: Hong Xuan, Richard Souvenir, Robert Pless

    Abstract: Learning embedding functions, which map semantically related inputs to nearby locations in a feature space supports a variety of classification and information retrieval tasks. In this work, we propose a novel, generalizable and fast method to define a family of embedding functions that can be used as an ensemble to give improved results. Each embedding function is learned by randomly bagging the… ▽ More

    Submitted 4 September, 2018; v1 submitted 13 August, 2018; originally announced August 2018.

    Comments: ECCV 2018

  16. arXiv:1611.05507  [pdf, other

    cs.CV

    Deep Feature Interpolation for Image Content Changes

    Authors: Paul Upchurch, Jacob Gardner, Geoff Pleiss, Robert Pless, Noah Snavely, Kavita Bala, Kilian Weinberger

    Abstract: We propose Deep Feature Interpolation (DFI), a new data-driven baseline for automatic high-resolution image transformation. As the name suggests, it relies only on simple linear interpolation of deep convolutional features from pre-trained convnets. We show that despite its simplicity, DFI can perform high-level semantic transformations like "make older/younger", "make bespectacled", "add smile",… ▽ More

    Submitted 19 June, 2017; v1 submitted 16 November, 2016; originally announced November 2016.

    Comments: First two authors contributed equally. Accepted by CVPR 2017. Code at https://github.com/paulu/deepfeatinterp

  17. arXiv:1304.4112  [pdf, other

    cs.CV

    Shadow Estimation Method for "The Episolar Constraint: Monocular Shape from Shadow Correspondence"

    Authors: Austin Abrams, Chris Hawley, Kylia Miskell, Adina Stoica, Nathan Jacobs, Robert Pless

    Abstract: Recovering shadows is an important step for many vision algorithms. Current approaches that work with time-lapse sequences are limited to simple thresholding heuristics. We show these approaches only work with very careful tuning of parameters, and do not work well for long-term time-lapse sequences taken over the span of many months. We introduce a parameter-free expectation maximization approach… ▽ More

    Submitted 15 April, 2013; originally announced April 2013.