Skip to main content

Showing 1–9 of 9 results for author: Tam, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.16848  [pdf, other

    cs.GR cs.CV

    HSM: Hierarchical Scene Motifs for Multi-Scale Indoor Scene Generation

    Authors: Hou In Derek Pun, Hou In Ivan Tam, Austin T. Wang, Xiaoliang Huo, Angel X. Chang, Manolis Savva

    Abstract: Despite advances in indoor 3D scene layout generation, synthesizing scenes with dense object arrangements remains challenging. Existing methods primarily focus on large furniture while neglecting smaller objects, resulting in unrealistically empty scenes. Those that place small objects typically do not honor arrangement specifications, resulting in largely random placement not following the text d… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: 23 pages, 7 figures

  2. arXiv:2503.14756  [pdf, ps, other

    cs.GR cs.CV

    SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis

    Authors: Hou In Ivan Tam, Hou In Derek Pun, Austin T. Wang, Angel X. Chang, Manolis Savva

    Abstract: Despite recent advances in text-conditioned 3D indoor scene generation, there remain gaps in the evaluation of these methods. Existing metrics primarily assess the realism of generated scenes by comparing them to a set of ground-truth scenes, often overlooking alignment with the input text - a critical factor in determining how effectively a method meets user requirements. We present SceneEval, an… ▽ More

    Submitted 11 June, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

    Comments: Expanded dataset to 500 annotated scene descriptions with new scene types; added validation via extended manual evaluation and a new user study; clarified distinctions from prior metrics; included results using an open-source VLM; stated intent to release code and data; corrected terminology and typos. 24 pages with 8 figures and 6 tables

  3. arXiv:2408.02211  [pdf, ps, other

    cs.GR

    SceneMotifCoder: Example-driven Visual Program Learning for Generating 3D Object Arrangements

    Authors: Hou In Ivan Tam, Hou In Derek Pun, Austin T. Wang, Angel X. Chang, Manolis Savva

    Abstract: Despite advances in text-to-3D generation methods, generation of multi-object arrangements remains challenging. Current methods exhibit failures in generating physically plausible arrangements that respect the provided text description. We present SceneMotifCoder (SMC), an example-driven framework for generating 3D object arrangements through visual program learning. SMC leverages large language m… ▽ More

    Submitted 3 June, 2025; v1 submitted 4 August, 2024; originally announced August 2024.

    Comments: Accepted at 3DV 2025 (Oral). Project page: https://3dlg-hcvc.github.io/smc/. Minor revisions for camera-ready version

  4. arXiv:2406.10180  [pdf, other

    cs.CV

    MeshPose: Unifying DensePose and 3D Body Mesh reconstruction

    Authors: Eric-Tuan Lê, Antonis Kakolyris, Petros Koutras, Himmy Tam, Efstratios Skordos, George Papandreou, Rıza Alp Güler, Iasonas Kokkinos

    Abstract: DensePose provides a pixel-accurate association of images with 3D mesh coordinates, but does not provide a 3D mesh, while Human Mesh Reconstruction (HMR) systems have high 2D reprojection error, as measured by DensePose localization metrics. In this work we introduce MeshPose to jointly tackle DensePose and HMR. For this we first introduce new losses that allow us to use weak DensePose supervision… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

    MSC Class: 68 ACM Class: I.2.10

    Journal ref: CVPR 2024

  5. arXiv:2312.09570  [pdf, other

    cs.CV

    CAGE: Controllable Articulation GEneration

    Authors: Jiayi Liu, Hou In Ivan Tam, Ali Mahdavi-Amiri, Manolis Savva

    Abstract: We address the challenge of generating 3D articulated objects in a controllable fashion. Currently, modeling articulated 3D objects is either achieved through laborious manual authoring, or using methods from prior work that are hard to scale and control directly. We leverage the interplay between part shape, connectivity, and motion using a denoising diffusion-based method with attention modules… ▽ More

    Submitted 20 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: CVPR 2024. Project page: https://3dlg-hcvc.github.io/cage/

  6. arXiv:2009.05266  [pdf, other

    cs.LG stat.ML

    GTEA: Inductive Representation Learning on Temporal Interaction Graphs via Temporal Edge Aggregation

    Authors: Siyue Xie, Yiming Li, Da Sun Handason Tam, Xiaxin Liu, Qiu Fang Ying, Wing Cheong Lau, Dah Ming Chiu, Shou Zhi Chen

    Abstract: In this paper, we propose the Graph Temporal Edge Aggregation (GTEA) framework for inductive learning on Temporal Interaction Graphs (TIGs). Different from previous works, GTEA models the temporal dynamics of interaction sequences in the continuous-time space and simultaneously takes advantage of both rich node and edge/ interaction attributes in the graph. Concretely, we integrate a sequence mode… ▽ More

    Submitted 3 May, 2023; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: accepted by PAKDD2023

  7. arXiv:1906.05546  [pdf, ps, other

    cs.SI cs.LG

    Identifying Illicit Accounts in Large Scale E-payment Networks -- A Graph Representation Learning Approach

    Authors: Da Sun Handason Tam, Wing Cheong Lau, Bin Hu, Qiu Fang Ying, Dah Ming Chiu, Hong Liu

    Abstract: Rapid and massive adoption of mobile/ online payment services has brought new challenges to the service providers as well as regulators in safeguarding the proper uses such services/ systems. In this paper, we leverage recent advances in deep-neural-network-based graph representation learning to detect abnormal/ suspicious financial transactions in real-world e-payment networks. In particular, we… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

  8. arXiv:1905.12957  [pdf, other

    cs.IT cs.LG

    Neural Entropic Estimation: A faster path to mutual information estimation

    Authors: Chung Chan, Ali Al-Bashabsheh, Hing Pang Huang, Michael Lim, Da Sun Handason Tam, Chao Zhao

    Abstract: We point out a limitation of the mutual information neural estimation (MINE) where the network fails to learn at the initial training phase, leading to slow convergence in the number of training iterations. To solve this problem, we propose a faster method called the mutual information neural entropic estimation (MI-NEE). Our solution first generalizes MINE to estimate the entropy using a custom r… ▽ More

    Submitted 30 May, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

  9. arXiv:1107.3194  [pdf

    cs.CV

    Fingerprint recognition using standardized fingerprint model

    Authors: Le Hoang Thai, Ha Nhat Tam

    Abstract: Fingerprint recognition is one of most popular and accuracy Biometric technologies. Nowadays, it is used in many real applications. However, recognizing fingerprints in poor quality images is still a very complex problem. In recent years, many algorithms, models...are given to improve the accuracy of recognition system. This paper discusses on the standardized fingerprint model which is used to sy… ▽ More

    Submitted 15 July, 2011; originally announced July 2011.

    Comments: 7 pages, 16 figures, 3 tables, IJCSI International Journal of Computer Science Issues, Vol. 7, Issue 3, No 7, May 2010

    Journal ref: IJCSI International Journal of Computer Science Issues, Vol. 7, Issue 3, No 7, May 2010, ISSN (Online): 1694-0784, ISSN (Print): 1694-0814