Skip to main content

Showing 1–38 of 38 results for author: Moon, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.07460  [pdf, ps, other

    cs.CV cs.CL

    GLOS: Sign Language Generation with Temporally Aligned Gloss-Level Conditioning

    Authors: Taeryung Lee, Hyeongjin Nam, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: Sign language generation (SLG), or text-to-sign generation, bridges the gap between signers and non-signers. Despite recent progress in SLG, existing methods still often suffer from incorrect lexical ordering and low semantic accuracy. This is primarily due to sentence-level condition, which encodes the entire sentence of the input text into a single feature vector as a condition for SLG. This app… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  2. arXiv:2407.21686  [pdf, other

    cs.CV

    Expressive Whole-Body 3D Gaussian Avatar

    Authors: Gyeongsik Moon, Takaaki Shiratori, Shunsuke Saito

    Abstract: Facial expression and hand motions are necessary to express our emotions and interact with the world. Nevertheless, most of the 3D human avatars modeled from a casually captured video only support body motions without facial expressions and hand motions.In this work, we present ExAvatar, an expressive whole-body 3D human avatar learned from a short monocular video. We design ExAvatar as a combinat… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024. Project page: https://mks0601.github.io/ExAvatar/

  3. arXiv:2405.07933  [pdf, other

    cs.CV

    Authentic Hand Avatar from a Phone Scan via Universal Hand Model

    Authors: Gyeongsik Moon, Weipeng Xu, Rohan Joshi, Chenglei Wu, Takaaki Shiratori

    Abstract: The authentic 3D hand avatar with every identifiable information, such as hand shapes and textures, is necessary for immersive experiences in AR/VR. In this paper, we present a universal hand model (UHM), which 1) can universally represent high-fidelity 3D hand meshes of arbitrary identities (IDs) and 2) can be adapted to each person with a short phone scan for the authentic hand avatar. For effec… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Accepted to CVPR 2024

  4. arXiv:2404.04819  [pdf, other

    cs.CV

    Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer

    Authors: Hyeongjin Nam, Daniel Sungho Jung, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: Human-object contact serves as a strong cue to understand how humans physically interact with objects. Nevertheless, it is not widely explored to utilize human-object contact information for the joint reconstruction of 3D human and object from a single image. In this work, we present a novel joint 3D human-object reconstruction method (CONTHO) that effectively exploits contact information between… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Published at CVPR 2024, 19 pages including the supplementary material

  5. arXiv:2401.05334  [pdf, other

    cs.CV cs.GR

    URHand: Universal Relightable Hands

    Authors: Zhaoxi Chen, Gyeongsik Moon, Kaiwen Guo, Chen Cao, Stanislav Pidhorskyi, Tomas Simon, Rohan Joshi, Yuan Dong, Yichen Xu, Bernardo Pires, He Wen, Lucas Evans, Bo Peng, Julia Buffalini, Autumn Trimble, Kevyn McPhail, Melissa Schoeller, Shoou-I Yu, Javier Romero, Michael Zollhöfer, Yaser Sheikh, Ziwei Liu, Shunsuke Saito

    Abstract: Existing photorealistic relightable hand models require extensive identity-specific observations in different views, poses, and illuminations, and face challenges in generalizing to natural illuminations and novel identities. To bridge this gap, we present URHand, the first universal relightable hand model that generalizes across viewpoints, poses, illuminations, and identities. Our model allows f… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: Project Page https://frozenburning.github.io/projects/urhand/

  6. arXiv:2310.17768  [pdf, other

    cs.CV

    A Dataset of Relighted 3D Interacting Hands

    Authors: Gyeongsik Moon, Shunsuke Saito, Weipeng Xu, Rohan Joshi, Julia Buffalini, Harley Bellan, Nicholas Rosen, Jesse Richardson, Mallorie Mize, Philippe de Bree, Tomas Simon, Bo Peng, Shubham Garg, Kevyn McPhail, Takaaki Shiratori

    Abstract: The two-hand interaction is one of the most challenging signals to analyze due to the self-similarity, complicated articulations, and occlusions of hands. Although several datasets have been proposed for the two-hand interaction analysis, all of them do not achieve 1) diverse and realistic image appearances and 2) diverse and large-scale groundtruth (GT) 3D poses at the same time. In this work, we… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023 (Datasets and Benchmarks Track)

  7. arXiv:2309.12578  [pdf, other

    cs.LG cs.DC

    SPION: Layer-Wise Sparse Training of Transformer via Convolutional Flood Filling

    Authors: Bokyeong Yoon, Yoonsang Han, Gordon Euhyun Moon

    Abstract: Sparsifying the Transformer has garnered considerable interest, as training the Transformer is very computationally demanding. Prior efforts to sparsify the Transformer have either used a fixed pattern or data-driven approach to reduce the number of operations involving the computation of multi-head attention, which is the main bottleneck of the Transformer. However, existing methods suffer from i… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  8. arXiv:2309.01943  [pdf, other

    cs.CV

    Extract-and-Adaptation Network for 3D Interacting Hand Mesh Recovery

    Authors: JoonKyu Park, Daniel Sungho Jung, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: Understanding how two hands interact with each other is a key component of accurate 3D interacting hand mesh recovery. However, recent Transformer-based methods struggle to learn the interaction between two hands as they directly utilize two hand features as input tokens, which results in distant token problem. The distant token problem represents that input tokens are in heterogeneous spaces, lea… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted at ICCVW 2023

  9. arXiv:2308.15944  [pdf, other

    cs.SE

    WUDI: A Human Involved Self-Adaptive Framework to Prevent Childhood Obesity in Internet of Things Environment

    Authors: Euijong Lee, Jaemin Jung, Gee-Myung Moon, Seong-Whan Lee, Ji-Hoon Jeong

    Abstract: The Internet of Things (IoT) connects people, devices, and information resources, in various domains to improve efficiency. The healthcare domain has been transformed by the integration of the IoT, leading to the development of digital healthcare solutions such as health monitoring, emergency detection, and remote operation. This integration has led to an increase in the health data collected from… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  10. Relaxed Local Correctability from Local Testing

    Authors: Vinayak M. Kumar, Geoffrey Mon

    Abstract: We construct the first asymptotically good relaxed locally correctable codes with polylogarithmic query complexity, bringing the upper bound polynomially close to the lower bound of Gur and Lachish (SICOMP 2021). Our result follows from showing that a high-rate locally testable code can boost the block length of a smaller relaxed locally correctable code, while preserving the correcting radius and… ▽ More

    Submitted 1 April, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 15 pages. Improved exposition, changed notation; to appear in STOC 2024

  11. arXiv:2304.04875  [pdf, other

    cs.CV

    Three Recipes for Better 3D Pseudo-GTs of 3D Human Mesh Estimation in the Wild

    Authors: Gyeongsik Moon, Hongsuk Choi, Sanghyuk Chun, Jiyoung Lee, Sangdoo Yun

    Abstract: Recovering 3D human mesh in the wild is greatly challenging as in-the-wild (ITW) datasets provide only 2D pose ground truths (GTs). Recently, 3D pseudo-GTs have been widely used to train 3D human mesh estimation networks as the 3D pseudo-GTs enable 3D mesh supervision when training the networks on ITW datasets. However, despite the great potential of the 3D pseudo-GTs, there has been no extensive… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Published at CVPRW 2023

  12. arXiv:2303.15417  [pdf, other

    cs.CV

    Recovering 3D Hand Mesh Sequence from a Single Blurry Image: A New Dataset and Temporal Unfolding

    Authors: Yeonguk Oh, JoonKyu Park, Jaeha Kim, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: Hands, one of the most dynamic parts of our body, suffer from blur due to their active movements. However, previous 3D hand mesh recovery methods have mainly focused on sharp hand images rather than considering blur due to the absence of datasets providing blurry hand images. We first present a novel dataset BlurHand, which contains blurry hand images with 3D groundtruths. The BlurHand is construc… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  13. arXiv:2303.13652  [pdf, other

    cs.CV

    Bringing Inputs to Shared Domains for 3D Interacting Hands Recovery in the Wild

    Authors: Gyeongsik Moon

    Abstract: Despite recent achievements, existing 3D interacting hands recovery methods have shown results mainly on motion capture (MoCap) environments, not on in-the-wild (ITW) ones. This is because collecting 3D interacting hands data in the wild is extremely challenging, even for the 2D data. We present InterWild, which brings MoCap and ITW samples to shared domains for robust 3D interacting hands recover… ▽ More

    Submitted 20 October, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Published at CVPR 2023

  14. arXiv:2303.05370  [pdf, other

    cs.CV

    Rethinking Self-Supervised Visual Representation Learning in Pre-training for 3D Human Pose and Shape Estimation

    Authors: Hongsuk Choi, Hyeongjin Nam, Taeryung Lee, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: Recently, a few self-supervised representation learning (SSL) methods have outperformed the ImageNet classification pre-training for vision tasks such as object detection. However, its effects on 3D human body pose and shape estimation (3DHPSE) are open to question, whose target is fixed to a unique class, the human, and has an inherent task gap with SSL. We empirically study and analyze the effec… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted to ICLR 2023, 18 pages including the appendix

  15. arXiv:2212.05897  [pdf, other

    cs.CV

    MultiAct: Long-Term 3D Human Motion Generation from Multiple Action Labels

    Authors: Taeryung Lee, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: We tackle the problem of generating long-term 3D human motion from multiple action labels. Two main previous approaches, such as action- and motion-conditioned methods, have limitations to solve this problem. The action-conditioned methods generate a sequence of motion from a single action. Hence, it cannot generate long-term motions composed of multiple actions and transitions between actions. Me… ▽ More

    Submitted 17 February, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: AAAI 2023 (Oral presentation)

  16. arXiv:2210.00627  [pdf, other

    cs.CV

    MonoNHR: Monocular Neural Human Renderer

    Authors: Hongsuk Choi, Gyeongsik Moon, Matthieu Armando, Vincent Leroy, Kyoung Mu Lee, Gregory Rogez

    Abstract: Existing neural human rendering methods struggle with a single image input due to the lack of information in invisible areas and the depth ambiguity of pixels in visible areas. In this regard, we propose Monocular Neural Human Renderer (MonoNHR), a novel approach that renders robust free-viewpoint images of an arbitrary human given only a single image. MonoNHR is the first method that (i) renders… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

    Comments: Hongsuk Choi and Gyeongsik Moon contributed equally, 15 pages including the reference and supplementary material

  17. arXiv:2207.10053  [pdf, other

    cs.CV

    3D Clothed Human Reconstruction in the Wild

    Authors: Gyeongsik Moon, Hyeongjin Nam, Takaaki Shiratori, Kyoung Mu Lee

    Abstract: Although much progress has been made in 3D clothed human reconstruction, most of the existing methods fail to produce robust results from in-the-wild images, which contain diverse human poses and appearances. This is mainly due to the large domain gap between training datasets and in-the-wild datasets. The training datasets are usually synthetic ones, which contain rendered images from GT 3D scans… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022, 25 pages including the supplementary material

  18. arXiv:2203.14564  [pdf, other

    cs.CV

    HandOccNet: Occlusion-Robust 3D Hand Mesh Estimation Network

    Authors: JoonKyu Park, Yeonguk Oh, Gyeongsik Moon, Hongsuk Choi, Kyoung Mu Lee

    Abstract: Hands are often severely occluded by objects, which makes 3D hand mesh estimation challenging. Previous works often have disregarded information at occluded regions. However, we argue that occluded regions have strong correlations with hands so that they can provide highly beneficial information for complete 3D hand mesh estimation. Thus, in this work, we propose a novel 3D hand mesh estimation ne… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: also attached the supplementary material

    Journal ref: Computer Vision and Pattern Recognition (CVPR), 2022

  19. arXiv:2203.04738  [pdf, other

    cs.CV cs.DC cs.LG

    Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences

    Authors: Gordon Euhyun Moon, Eric C. Cyr

    Abstract: Parallelizing Gated Recurrent Unit (GRU) networks is a challenging task, as the training procedure of GRU is inherently sequential. Prior efforts to parallelize GRU have largely focused on conventional parallelization strategies such as data-parallel and model-parallel training algorithms. However, when the given sequences are very long, existing approaches are still inevitably performance limited… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: Accepted at ICLR 2022

  20. arXiv:2106.10499  [pdf, other

    cs.DC cs.AI cs.AR

    Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication

    Authors: Gordon E. Moon, Hyoukjun Kwon, Geonhwa Jeong, Prasanth Chatarasi, Sivasankaran Rajamanickam, Tushar Krishna

    Abstract: There is a growing interest in custom spatial accelerators for machine learning applications. These accelerators employ a spatial array of processing elements (PEs) interacting via custom buffer hierarchies and networks-on-chip. The efficiency of these accelerators comes from employing optimized dataflow (i.e., spatial/temporal partitioning of data across the PEs and fine-grained scheduling) strat… ▽ More

    Submitted 19 June, 2021; originally announced June 2021.

  21. arXiv:2104.07300  [pdf, other

    cs.CV

    Learning to Estimate Robust 3D Human Mesh from In-the-Wild Crowded Scenes

    Authors: Hongsuk Choi, Gyeongsik Moon, JoonKyu Park, Kyoung Mu Lee

    Abstract: We consider the problem of recovering a single person's 3D human mesh from in-the-wild crowded scenes. While much progress has been in 3D human mesh estimation, existing methods struggle when test input has crowded scenes. The first reason for the failure is a domain gap between training and testing data. A motion capture dataset, which provides accurate 3D labels for training, lacks crowd data an… ▽ More

    Submitted 18 September, 2022; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2022, 16 pages including the supplementary material

  22. arXiv:2103.10452  [pdf

    cs.DC

    Extending Sparse Tensor Accelerators to Support Multiple Compression Formats

    Authors: Eric Qin, Geonhwa Jeong, William Won, Sheng-Chun Kao, Hyoukjun Kwon, Sudarshan Srinivasan, Dipankar Das, Gordon E. Moon, Sivasankaran Rajamanickam, Tushar Krishna

    Abstract: Sparsity, which occurs in both scientific applications and Deep Learning (DL) models, has been a key target of optimization within recent ASIC accelerators due to the potential memory and compute savings. These applications use data stored in a variety of compression formats. We demonstrate that both the compactness of different compression formats and the compute efficiency of the algorithms enab… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Comments: Accepted for publication at the 35th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2021)

  23. arXiv:2011.11534  [pdf, other

    cs.CV

    Accurate 3D Hand Pose Estimation for Whole-Body 3D Human Mesh Estimation

    Authors: Gyeongsik Moon, Hongsuk Choi, Kyoung Mu Lee

    Abstract: Whole-body 3D human mesh estimation aims to reconstruct the 3D human body, hands, and face simultaneously. Although several methods have been proposed, accurate prediction of 3D hands, which consist of 3D wrist and fingers, still remains challenging due to two reasons. First, the human kinematic chain has not been carefully considered when predicting the 3D wrists. Second, previous works utilize b… ▽ More

    Submitted 19 April, 2022; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: Published at CVPRW 2022

  24. arXiv:2011.11232  [pdf, other

    cs.CV

    NeuralAnnot: Neural Annotator for 3D Human Mesh Training Sets

    Authors: Gyeongsik Moon, Hongsuk Choi, Kyoung Mu Lee

    Abstract: Most 3D human mesh regressors are fully supervised with 3D pseudo-GT human model parameters and weakly supervised with GT 2D/3D joint coordinates as the 3D pseudo-GTs bring great performance gain. The 3D pseudo-GTs are obtained by annotators, systems that iteratively fit 3D human model parameters to GT 2D/3D joint coordinates of training sets in the pre-processing stage of the regressors. The fitt… ▽ More

    Submitted 19 April, 2022; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: Published at CVPRW 2022

  25. arXiv:2011.08627  [pdf, other

    cs.CV

    Beyond Static Features for Temporally Consistent 3D Human Pose and Shape from a Video

    Authors: Hongsuk Choi, Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee

    Abstract: Despite the recent success of single image-based 3D human pose and shape estimation methods, recovering temporally consistent and smooth 3D human motion from a video is still challenging. Several video-based methods have been proposed; however, they fail to resolve the single image-based methods' temporal inconsistency issue due to a strong dependency on a static feature of the current frame. In t… ▽ More

    Submitted 27 April, 2021; v1 submitted 17 November, 2020; originally announced November 2020.

    Comments: Accepted to CVPR 2021, 10 pages

  26. arXiv:2008.09309  [pdf, other

    cs.CV

    InterHand2.6M: A Dataset and Baseline for 3D Interacting Hand Pose Estimation from a Single RGB Image

    Authors: Gyeongsik Moon, Shoou-i Yu, He Wen, Takaaki Shiratori, Kyoung Mu Lee

    Abstract: Analysis of hand-hand interactions is a crucial step towards better understanding human behavior. However, most researches in 3D hand pose estimation have focused on the isolated single hand case. Therefore, we firstly propose (1) a large-scale dataset, InterHand2.6M, and (2) a baseline network, InterNet, for 3D interacting hand pose estimation from a single RGB image. The proposed InterHand2.6M c… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: Published at ECCV 2020

  27. arXiv:2008.09047  [pdf, other

    cs.CV

    Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose

    Authors: Hongsuk Choi, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: Most of the recent deep learning-based 3D human pose and mesh estimation methods regress the pose and shape parameters of human mesh models, such as SMPL and MANO, from an input image. The first weakness of these methods is an appearance domain gap problem, due to different image appearance between train data from controlled environments, such as a laboratory, and test data from in-the-wild enviro… ▽ More

    Submitted 27 April, 2021; v1 submitted 20 August, 2020; originally announced August 2020.

    Comments: Accepted to ECCV 2020, 22 pages

  28. arXiv:2008.08213  [pdf, other

    cs.CV

    DeepHandMesh: A Weakly-supervised Deep Encoder-Decoder Framework for High-fidelity Hand Mesh Modeling

    Authors: Gyeongsik Moon, Takaaki Shiratori, Kyoung Mu Lee

    Abstract: Human hands play a central role in interacting with other people and objects. For realistic replication of such hand motions, high-fidelity hand meshes have to be reconstructed. In this study, we firstly propose DeepHandMesh, a weakly-supervised deep encoder-decoder framework for high-fidelity hand mesh modeling. We design our system to be trained in an end-to-end and weakly-supervised manner; the… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: Published at ECCV 2020 (Oral)

  29. arXiv:2008.03713  [pdf, other

    cs.CV

    I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image

    Authors: Gyeongsik Moon, Kyoung Mu Lee

    Abstract: Most of the previous image-based 3D human pose and mesh estimation methods estimate parameters of the human mesh model from an input image. However, directly regressing the parameters from the input image is a highly non-linear mapping because it breaks the spatial relationship between pixels in the input image. In addition, it cannot model the prediction uncertainty, which can make training harde… ▽ More

    Submitted 1 November, 2020; v1 submitted 9 August, 2020; originally announced August 2020.

    Comments: Published at ECCV 2020

  30. arXiv:2007.06317  [pdf, other

    cs.CV

    IntegralAction: Pose-driven Feature Integration for Robust Human Action Recognition in Videos

    Authors: Gyeongsik Moon, Heeseung Kwon, Kyoung Mu Lee, Minsu Cho

    Abstract: Most current action recognition methods heavily rely on appearance information by taking an RGB sequence of entire image regions as input. While being effective in exploiting contextual information around humans, e.g., human appearance and scene category, they are easily fooled by out-of-context action videos where the contexts do not exactly match with target actions. In contrast, pose-based meth… ▽ More

    Submitted 15 April, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: Published at CVPRW 2021

  31. arXiv:1910.12029  [pdf, other

    cs.CV

    PoseLifter: Absolute 3D human pose lifting network from a single noisy 2D human pose

    Authors: Ju Yong Chang, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: This study presents a new network (i.e., PoseLifter) that can lift a 2D human pose to an absolute 3D pose in a camera coordinate system. The proposed network estimates the absolute 3D location of a target subject and generates an improved 3D relative pose estimation compared with existing pose-lifting methods. Using the PoseLifter with a 2D pose estimator in a cascade fashion can estimate a 3D hum… ▽ More

    Submitted 13 March, 2020; v1 submitted 26 October, 2019; originally announced October 2019.

  32. arXiv:1907.11346  [pdf, other

    cs.CV

    Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image

    Authors: Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee

    Abstract: Although significant improvement has been achieved recently in 3D human pose estimation, most of the previous methods only treat a single-person case. In this work, we firstly propose a fully learning-based, camera distance-aware top-down approach for 3D multi-person pose estimation from a single RGB image. The pipeline of the proposed system consists of human detection, absolute 3D human root loc… ▽ More

    Submitted 17 August, 2019; v1 submitted 25 July, 2019; originally announced July 2019.

    Comments: Published at ICCV 2019

  33. arXiv:1905.03912  [pdf, other

    cs.CV

    Multi-scale Aggregation R-CNN for 2D Multi-person Pose Estimation

    Authors: Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee

    Abstract: Multi-person pose estimation from a 2D image is challenging because it requires not only keypoint localization but also human detection. In state-of-the-art top-down methods, multi-scale information is a crucial factor for the accurate pose estimation because it contains both of local information around the keypoints and global information of the entire person. Although multi-scale information all… ▽ More

    Submitted 9 May, 2019; originally announced May 2019.

    Comments: Published at CVPRW 2019

  34. arXiv:1904.07935  [pdf, other

    cs.LG cs.DC stat.ML

    PL-NMF: Parallel Locality-Optimized Non-negative Matrix Factorization

    Authors: Gordon E. Moon, Aravind Sukumaran-Rajam, Srinivasan Parthasarathy, P. Sadayappan

    Abstract: Non-negative Matrix Factorization (NMF) is a key kernel for unsupervised dimension reduction used in a wide range of applications, including topic modeling, recommender systems and bioinformatics. Due to the compute-intensive nature of applications that must perform repeated NMF, several parallel implementations have been developed in the past. However, existing parallel NMF algorithms have not ad… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

    Comments: 11 pages, 5 tables, 9 figures

  35. arXiv:1812.03595  [pdf, other

    cs.CV

    PoseFix: Model-agnostic General Human Pose Refinement Network

    Authors: Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee

    Abstract: Multi-person pose estimation from a 2D image is an essential technique for human behavior understanding. In this paper, we propose a human pose refinement network that estimates a refined pose from a tuple of an input image and input pose. The pose refinement was performed mainly through an end-to-end trainable multi-stage architecture in previous methods. However, they are highly dependent on pos… ▽ More

    Submitted 10 March, 2019; v1 submitted 9 December, 2018; originally announced December 2018.

    Comments: Published at CVPR 2019

  36. arXiv:1712.03917  [pdf, other

    cs.CV

    Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals

    Authors: Shanxin Yuan, Guillermo Garcia-Hernando, Bjorn Stenger, Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee, Pavlo Molchanov, Jan Kautz, Sina Honari, Liuhao Ge, Junsong Yuan, Xinghao Chen, Guijin Wang, Fan Yang, Kai Akiyama, Yang Wu, Qingfu Wan, Meysam Madadi, Sergio Escalera, Shile Li, Dongheui Lee, Iason Oikonomidis, Antonis Argyros, Tae-Kyun Kim

    Abstract: In this paper, we strive to answer two questions: What is the current state of 3D hand pose estimation from depth images? And, what are the next challenges that need to be tackled? Following the successful Hands In the Million Challenge (HIM2017), we investigate the top 10 state-of-the-art methods on three tasks: single frame 3D pose estimation, 3D hand tracking, and hand pose estimation during ob… ▽ More

    Submitted 29 March, 2018; v1 submitted 11 December, 2017; originally announced December 2017.

  37. arXiv:1711.07399  [pdf, other

    cs.CV

    V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation from a Single Depth Map

    Authors: Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee

    Abstract: Most of the existing deep learning-based methods for 3D hand and human pose estimation from a single depth map are based on a common framework that takes a 2D depth map and directly regresses the 3D coordinates of keypoints, such as hand or human body joints, via 2D convolutional neural networks (CNNs). The first weakness of this approach is the presence of perspective distortion in the 2D depth m… ▽ More

    Submitted 16 August, 2018; v1 submitted 20 November, 2017; originally announced November 2017.

    Comments: HANDS 2017 Challenge Frame-based 3D Hand Pose Estimation Winner (ICCV 2017), Published at CVPR 2018

  38. arXiv:1706.04758  [pdf, other

    cs.CV

    Holistic Planimetric prediction to Local Volumetric prediction for 3D Human Pose Estimation

    Authors: Gyeongsik Moon, Ju Yong Chang, Yumin Suh, Kyoung Mu Lee

    Abstract: We propose a novel approach to 3D human pose estimation from a single depth map. Recently, convolutional neural network (CNN) has become a powerful paradigm in computer vision. Many of computer vision tasks have benefited from CNNs, however, the conventional approach to directly regress 3D body joint locations from an image does not yield a noticeably improved performance. In contrast, we formulat… ▽ More

    Submitted 8 July, 2017; v1 submitted 15 June, 2017; originally announced June 2017.