Skip to main content

Showing 1–8 of 8 results for author: Kislyuk, D

.
  1. arXiv:2505.21454  [pdf, ps, other

    cs.CV

    Visual Product Graph: Bridging Visual Products And Composite Images For End-to-End Style Recommendations

    Authors: Yue Li Du, Ben Alexander, Mikhail Antonenka, Rohan Mahadev, Hao-yu Wu, Dmitry Kislyuk

    Abstract: Retrieving semantically similar but visually distinct contents has been a critical capability in visual search systems. In this work, we aim to tackle this problem with Visual Product Graph (VPG), leveraging high-performance infrastructure for storage and state-of-the-art computer vision models for image understanding. VPG is built to be an online real-time retrieval system that enables navigation… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 10 pages, 10 figures

  2. arXiv:2401.12244  [pdf, other

    cs.CV cs.AI cs.LG

    Large-scale Reinforcement Learning for Diffusion Models

    Authors: Yinan Zhang, Eric Tzeng, Yilun Du, Dmitry Kislyuk

    Abstract: Text-to-image diffusion models are a class of deep generative models that have demonstrated an impressive capacity for high-quality image generation. However, these models are susceptible to implicit biases that arise from web-scale text-image training pairs and may inaccurately model aspects of images we care about. This can result in suboptimal samples, model bias, and images that do not align w… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  3. arXiv:2108.05887  [pdf, other

    cs.CV cs.AI cs.LG

    Billion-Scale Pretraining with Vision Transformers for Multi-Task Visual Representations

    Authors: Josh Beal, Hao-Yu Wu, Dong Huk Park, Andrew Zhai, Dmitry Kislyuk

    Abstract: Large-scale pretraining of visual representations has led to state-of-the-art performance on a range of benchmark computer vision tasks, yet the benefits of these techniques at extreme scale in complex production systems has been relatively unexplored. We consider the case of a popular visual discovery product, where these representations are trained with multi-task learning, from use-case specifi… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: Accepted by WACV 2022

  4. arXiv:2012.09958  [pdf, other

    cs.CV cs.AI cs.LG

    Toward Transformer-Based Object Detection

    Authors: Josh Beal, Eric Kim, Eric Tzeng, Dong Huk Park, Andrew Zhai, Dmitry Kislyuk

    Abstract: Transformers have become the dominant model in natural language processing, owing to their ability to pretrain on massive amounts of data, then transfer to smaller, more specific tasks via fine-tuning. The Vision Transformer was the first major attempt to apply a pure transformer model directly to images as input, demonstrating that as compared to convolutional networks, transformer-based architec… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

  5. arXiv:1702.07969  [pdf, other

    cs.IR

    Related Pins at Pinterest: The Evolution of a Real-World Recommender System

    Authors: David C. Liu, Stephanie Rogers, Raymond Shiau, Dmitry Kislyuk, Kevin C. Ma, Zhigang Zhong, Jenny Liu, Yushi Jing

    Abstract: Related Pins is the Web-scale recommender system that powers over 40% of user engagement on Pinterest. This paper is a longitudinal study of three years of its development, exploring the evolution of the system and its components from prototypes to present state. Each component was originally built with many constraints on engineering effort and computational resources, so we prioritized the simpl… ▽ More

    Submitted 25 February, 2017; originally announced February 2017.

  6. arXiv:1702.04680  [pdf, other

    cs.CV

    Visual Discovery at Pinterest

    Authors: Andrew Zhai, Dmitry Kislyuk, Yushi Jing, Michael Feng, Eric Tzeng, Jeff Donahue, Yue Li Du, Trevor Darrell

    Abstract: Over the past three years Pinterest has experimented with several visual search and recommendation services, including Related Pins (2014), Similar Looks (2015), Flashlight (2016) and Lens (2017). This paper presents an overview of our visual discovery engine powering these services, and shares the rationales behind our technical and product decisions such as the use of object detection and intera… ▽ More

    Submitted 25 March, 2017; v1 submitted 15 February, 2017; originally announced February 2017.

  7. arXiv:1511.04003  [pdf, other

    cs.CV

    Human Curation and Convnets: Powering Item-to-Item Recommendations on Pinterest

    Authors: Dmitry Kislyuk, Yuchen Liu, David Liu, Eric Tzeng, Yushi Jing

    Abstract: This paper presents Pinterest Related Pins, an item-to-item recommendation system that combines collaborative filtering with content-based ranking. We demonstrate that signals derived from user curation, the activity of users organizing content, are highly effective when used in conjunction with content-based ranking. This paper also demonstrates the effectiveness of visual features, such as image… ▽ More

    Submitted 12 November, 2015; originally announced November 2015.

  8. arXiv:1505.07647  [pdf, other

    cs.CV

    Visual Search at Pinterest

    Authors: Yushi Jing, David Liu, Dmitry Kislyuk, Andrew Zhai, Jiajing Xu, Jeff Donahue, Sarah Tavel

    Abstract: We demonstrate that, with the availability of distributed computation platforms such as Amazon Web Services and open-source tools, it is possible for a small engineering team to build, launch and maintain a cost-effective, large-scale visual search system with widely available tools. We also demonstrate, through a comprehensive set of live experiments at Pinterest, that content recommendation powe… ▽ More

    Submitted 8 March, 2017; v1 submitted 28 May, 2015; originally announced May 2015.

    Comments: in Proceedings of the 21th ACM SIGKDD International Conference on Knowledge and Discovery and Data Mining, 2015