Skip to main content

Showing 1–5 of 5 results for author: Ryou, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2212.07401  [pdf, other

    cs.CV cs.AI

    BKinD-3D: Self-Supervised 3D Keypoint Discovery from Multi-View Videos

    Authors: Jennifer J. Sun, Lili Karashchuk, Amil Dravid, Serim Ryou, Sonia Fereidooni, John Tuthill, Aggelos Katsaggelos, Bingni W. Brunton, Georgia Gkioxari, Ann Kennedy, Yisong Yue, Pietro Perona

    Abstract: Quantifying motion in 3D is important for studying the behavior of humans and other animals, but manual pose annotations are expensive and time-consuming to obtain. Self-supervised keypoint discovery is a promising strategy for estimating 3D poses without annotations. However, current keypoint discovery approaches commonly process single 2D views and do not operate in the 3D space. We propose a ne… ▽ More

    Submitted 2 June, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: CVPR 2023. Project page: https://sites.google.com/view/b-kind/3d Code: https://github.com/neuroethology/BKinD-3D

  2. arXiv:2112.05121  [pdf, other

    cs.CV

    Self-Supervised Keypoint Discovery in Behavioral Videos

    Authors: Jennifer J. Sun, Serim Ryou, Roni Goldshmid, Brandon Weissbourd, John Dabiri, David J. Anderson, Ann Kennedy, Yisong Yue, Pietro Perona

    Abstract: We propose a method for learning the posture and structure of agents from unlabelled behavioral videos. Starting from the observation that behaving agents are generally the main sources of movement in behavioral videos, our method, Behavioral Keypoint Discovery (B-KinD), uses an encoder-decoder architecture with a geometric bottleneck to reconstruct the spatiotemporal difference between video fram… ▽ More

    Submitted 27 April, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: CVPR 2022. Code: https://github.com/neuroethology/BKinD Project page: https://sites.google.com/view/b-kind

  3. arXiv:2109.13423  [pdf, other

    cs.CV

    Weakly Supervised Keypoint Discovery

    Authors: Serim Ryou, Pietro Perona

    Abstract: In this paper, we propose a method for keypoint discovery from a 2D image using image-level supervision. Recent works on unsupervised keypoint discovery reliably discover keypoints of aligned instances. However, when the target instances have high viewpoint or appearance variation, the discovered keypoints do not match the semantic correspondences over different images. Our work aims to discover k… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  4. arXiv:2007.04275  [pdf, other

    cs.LG stat.ML

    Graph Neural Networks for the Prediction of Substrate-Specific Organic Reaction Conditions

    Authors: Serim Ryou, Michael R. Maser, Alexander Y. Cui, Travis J. DeLano, Yisong Yue, Sarah E. Reisman

    Abstract: We present a systematic investigation using graph neural networks (GNNs) to model organic chemical reactions. To do so, we prepared a dataset collection of four ubiquitous reactions from the organic chemistry literature. We evaluate seven different GNN architectures for classification tasks pertaining to the identification of experimental reagents and conditions. We find that models are able to id… ▽ More

    Submitted 9 July, 2020; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: 23 pages, 10 tables, 13 figures, to appear in the ICML 2020 Workshop on Graph Representation Learning and Beyond (GRLB)

  5. arXiv:1909.11155  [pdf, other

    cs.CV cs.LG

    Anchor Loss: Modulating Loss Scale based on Prediction Difficulty

    Authors: Serim Ryou, Seong-Gyun Jeong, Pietro Perona

    Abstract: We propose a novel loss function that dynamically rescales the cross entropy based on prediction difficulty regarding a sample. Deep neural network architectures in image classification tasks struggle to disambiguate visually similar objects. Likewise, in human pose estimation symmetric body parts often confuse the network with assigning indiscriminative scores to them. This is due to the output p… ▽ More

    Submitted 24 September, 2019; originally announced September 2019.

    Comments: To appear in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2019. (oral)