Skip to main content

Showing 1–16 of 16 results for author: Zhang, J J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.16722  [pdf, other

    cs.CV cs.AI

    PMG: Progressive Motion Generation via Sparse Anchor Postures Curriculum Learning

    Authors: Yingjie Xi, Jian Jun Zhang, Xiaosong Yang

    Abstract: In computer animation, game design, and human-computer interaction, synthesizing human motion that aligns with user intent remains a significant challenge. Existing methods have notable limitations: textual approaches offer high-level semantic guidance but struggle to describe complex actions accurately; trajectory-based techniques provide intuitive global motion direction yet often fall short in… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  2. arXiv:2411.00632  [pdf, other

    cs.CV cs.LG

    PCoTTA: Continual Test-Time Adaptation for Multi-Task Point Cloud Understanding

    Authors: Jincen Jiang, Qianyu Zhou, Yuhang Li, Xinkui Zhao, Meili Wang, Lizhuang Ma, Jian Chang, Jian Jun Zhang, Xuequan Lu

    Abstract: In this paper, we present PCoTTA, an innovative, pioneering framework for Continual Test-Time Adaptation (CoTTA) in multi-task point cloud understanding, enhancing the model's transferability towards the continually changing target domain. We introduce a multi-task setting for PCoTTA, which is practical and realistic, handling multiple tasks within one unified model during the continual adaptation… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: Accepted to NeurIPS 2024

  3. arXiv:2409.03238  [pdf, other

    cs.CL cs.LG

    Preserving Empirical Probabilities in BERT for Small-sample Clinical Entity Recognition

    Authors: Abdul Rehman, Jian Jun Zhang, Xiaosong Yang

    Abstract: Named Entity Recognition (NER) encounters the challenge of unbalanced labels, where certain entity types are overrepresented while others are underrepresented in real-world datasets. This imbalance can lead to biased models that perform poorly on minority entity classes, impeding accurate and equitable entity recognition. This paper explores the effects of unbalanced entity labels of the BERT-base… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: 8 pages, 8 figures

    MSC Class: 68T50 ACM Class: I.2.7

  4. arXiv:2407.08801  [pdf, other

    cs.CV

    DG-PIC: Domain Generalized Point-In-Context Learning for Point Cloud Understanding

    Authors: Jincen Jiang, Qianyu Zhou, Yuhang Li, Xuequan Lu, Meili Wang, Lizhuang Ma, Jian Chang, Jian Jun Zhang

    Abstract: Recent point cloud understanding research suffers from performance drops on unseen data, due to the distribution shifts across different domains. While recent studies use Domain Generalization (DG) techniques to mitigate this by learning domain-invariant features, most are designed for a single task and neglect the potential of testing data. Despite In-Context Learning (ICL) showcasing multi-task… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  5. arXiv:2406.08673  [pdf, ps, other

    cs.CL cs.AI cs.LG

    HelpSteer2: Open-source dataset for training top-performing reward models

    Authors: Zhilin Wang, Yi Dong, Olivier Delalleau, Jiaqi Zeng, Gerald Shen, Daniel Egert, Jimmy J. Zhang, Makesh Narsimhan Sreedhar, Oleksii Kuchaiev

    Abstract: High-quality preference datasets are essential for training reward models that can effectively guide large language models (LLMs) in generating high-quality responses aligned with human preferences. As LLMs become stronger and better aligned, permissively licensed preference datasets, such as Open Assistant, HH-RLHF, and HelpSteer need to be updated to remain effective for reward modeling. Methods… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2404.09000  [pdf, other

    eess.IV cs.CV cs.LG

    MaSkel: A Model for Human Whole-body X-rays Generation from Human Masking Images

    Authors: Yingjie Xi, Boyuan Cheng, Jingyao Cai, Jian Jun Zhang, Xiaosong Yang

    Abstract: The human whole-body X-rays could offer a valuable reference for various applications, including medical diagnostics, digital animation modeling, and ergonomic design. The traditional method of obtaining X-ray information requires the use of CT (Computed Tomography) scan machines, which emit potentially harmful radiation. Thus it faces a significant limitation for realistic applications because it… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  7. arXiv:2310.11830  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    CLARA: Multilingual Contrastive Learning for Audio Representation Acquisition

    Authors: Kari A Noriy, Xiaosong Yang, Marcin Budka, Jian Jun Zhang

    Abstract: Multilingual speech processing requires understanding emotions, a task made difficult by limited labelled data. CLARA, minimizes reliance on labelled data, enhancing generalization across languages. It excels at fostering shared representations, aiding cross-lingual transfer of speech and emotions, even with little data. Our approach adeptly captures emotional nuances in speech, overcoming subject… ▽ More

    Submitted 1 November, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

  8. arXiv:2305.13137  [pdf, other

    cs.CL cs.LG cs.MM

    EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novels

    Authors: Kari Ali Noriy, Xiaosong Yang, Jian Jun Zhang

    Abstract: The increasing adoption of text-to-speech technologies has led to a growing demand for natural and emotive voices that adapt to a conversation's context and emotional tone. The Emotive Narrative Storytelling (EMNS) corpus is a unique speech dataset created to enhance conversations' expressiveness and emotive quality in interactive narrative-driven systems. The corpus consists of a 2.3-hour recordi… ▽ More

    Submitted 25 May, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Dataset download link: https://openslr.elda.org/136/

  9. arXiv:2109.13085  [pdf, other

    cs.LG cs.CV cs.GL cs.HC

    Towards the Classification of Error-Related Potentials using Riemannian Geometry

    Authors: Yichen Tang, Jerry J. Zhang, Paul M. Corballis, Luke E. Hallum

    Abstract: The error-related potential (ErrP) is an event-related potential (ERP) evoked by an experimental participant's recognition of an error during task performance. ErrPs, originally described by cognitive psychologists, have been adopted for use in brain-computer interfaces (BCIs) for the detection and correction of errors, and the online refinement of decoding algorithms. Riemannian geometry-based fe… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: 4 pages, 3 figures, 1 table, submitted to and accepted by the 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), this is the accepted version

    ACM Class: I.5.4; J.3; J.m

  10. arXiv:2107.06964  [pdf, other

    cs.CV

    Surgical Instruction Generation with Transformers

    Authors: Jinglu Zhang, Yinyu Nie, Jian Chang, Jian Jun Zhang

    Abstract: Automatic surgical instruction generation is a prerequisite towards intra-operative context-aware surgical assistance. However, generating instructions from surgical scenes is challenging, as it requires jointly understanding the surgical activity of current view and modelling relationships between visual information and textual description. Inspired by the neural machine translation and imaging c… ▽ More

    Submitted 16 July, 2021; v1 submitted 14 July, 2021; originally announced July 2021.

    Comments: Accepted to MICCAI 2021

  11. arXiv:2010.07428  [pdf, other

    cs.CV

    Skeleton-bridged Point Completion: From Global Inference to Local Adjustment

    Authors: Yinyu Nie, Yiqun Lin, Xiaoguang Han, Shihui Guo, Jian Chang, Shuguang Cui, Jian Jun Zhang

    Abstract: Point completion refers to complete the missing geometries of objects from partial point clouds. Existing works usually estimate the missing shape by decoding a latent feature encoded from the input points. However, real-world objects are usually with diverse topologies and surface details, which a latent feature may fail to represent to recover a clean and complete surface. To this end, we propos… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: Accepted by NeurIPS 2020; Project Page: https://yinyunie.github.io/SKPCN-page/

  12. arXiv:2007.06373  [pdf, other

    eess.IV cs.CV

    Symmetric Dilated Convolution for Surgical Gesture Recognition

    Authors: Jinglu Zhang, Yinyu Nie, Yao Lyu, Hailin Li, Jian Chang, Xiaosong Yang, Jian Jun Zhang

    Abstract: Automatic surgical gesture recognition is a prerequisite of intra-operative computer assistance and objective surgical skill assessment. Prior works either require additional sensors to collect kinematics data or have limitations on capturing temporal information from long and untrimmed surgical videos. To tackle these challenges, we propose a novel temporal convolutional architecture to automatic… ▽ More

    Submitted 14 July, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: Accepted to MICCAI 2020

  13. arXiv:2002.12212  [pdf, other

    cs.CV

    Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

    Authors: Yinyu Nie, Xiaoguang Han, Shihui Guo, Yujian Zheng, Jian Chang, Jian Jun Zhang

    Abstract: Semantic reconstruction of indoor scenes refers to both scene understanding and object reconstruction. Existing works either address one part of this problem or focus on independent objects. In this paper, we bridge the gap between understanding and reconstruction, and propose an end-to-end solution to jointly reconstruct room layout, object bounding boxes and meshes from a single image. Instead o… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: Accepted by CVPR 2020

  14. Shallow2Deep: Indoor Scene Modeling by Single Image Understanding

    Authors: Yinyu Nie, Shihui Guo, Jian Chang, Xiaoguang Han, Jiahui Huang, Shi-Min Hu, Jian Jun Zhang

    Abstract: Dense indoor scene modeling from 2D images has been bottlenecked due to the absence of depth information and cluttered occlusions. We present an automatic indoor scene modeling approach using deep features from neural networks. Given a single RGB image, our method simultaneously recovers semantic contents, 3D geometry and object relationship by reasoning indoor environment context. Particularly, w… ▽ More

    Submitted 22 February, 2020; originally announced February 2020.

    Comments: Accepted by Pattern Recognition

    MSC Class: 68T45 (primary); 65D19 (Secondary) ACM Class: I.2.10; I.4.8

    Journal ref: Pattern Recognition. 2020 Feb 12:107271

  15. arXiv:1803.05541  [pdf, other

    cs.CV

    Context-Aware Mixed Reality: A Framework for Ubiquitous Interaction

    Authors: Long Chen, Wen Tang, Nigel John, Tao Ruan Wan, Jian Jun Zhang

    Abstract: Mixed Reality (MR) is a powerful interactive technology that yields new types of user experience. We present a semantic based interactive MR framework that exceeds the current geometry level approaches, a step change in generating high-level context-aware interactions. Our key insight is to build semantic understanding in MR that not only can greatly enhance user experience through object-specific… ▽ More

    Submitted 14 March, 2018; originally announced March 2018.

  16. arXiv:1703.01243  [pdf, other

    cs.CV

    Augmented Reality for Depth Cues in Monocular Minimally Invasive Surgery

    Authors: Long Chen, Wen Tang, Nigel W. John, Tao Ruan Wan, Jian Jun Zhang

    Abstract: One of the major challenges in Minimally Invasive Surgery (MIS) such as laparoscopy is the lack of depth perception. In recent years, laparoscopic scene tracking and surface reconstruction has been a focus of investigation to provide rich additional information to aid the surgical process and compensate for the depth perception issue. However, robust 3D surface reconstruction and augmented reality… ▽ More

    Submitted 1 March, 2017; originally announced March 2017.

    Comments: 15 pages