Skip to main content

Showing 1–20 of 20 results for author: Ning, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.05935  [pdf, ps, other

    cs.GR cs.CV

    SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction

    Authors: Yuchao Zheng, Jianing Zhang, Guochen Ning, Hongen Liao

    Abstract: Intraoperative navigation relies heavily on precise 3D reconstruction to ensure accuracy and safety during surgical procedures. However, endoscopic scenarios present unique challenges, including sparse features and inconsistent lighting, which render many existing Structure-from-Motion (SfM)-based methods inadequate and prone to reconstruction failure. To mitigate these constraints, we propose Sur… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  2. arXiv:2506.03524  [pdf, ps, other

    cs.CL cs.SE

    Seed-Coder: Let the Code Model Curate Data for Itself

    Authors: ByteDance Seed, Yuyu Zhang, Jing Su, Yifan Sun, Chenguang Xi, Xia Xiao, Shen Zheng, Anxiang Zhang, Kaibo Liu, Daoguang Zan, Tao Sun, Jinhua Zhu, Shulin Xin, Dong Huang, Yetao Bai, Lixin Dong, Chao Li, Jianchong Chen, Hanzhi Zhou, Yifan Huang, Guanghan Ning, Xierui Song, Jiaze Chen, Siyao Liu, Kai Shen , et al. (2 additional authors not shown)

    Abstract: Code data in large language model (LLM) pretraining is recognized crucial not only for code-related tasks but also for enhancing general intelligence of LLMs. Current open-source LLMs often heavily rely on human effort to produce their code pretraining data, such as employing hand-crafted filtering rules tailored to individual programming languages, or using human-annotated data to train quality f… ▽ More

    Submitted 4 June, 2025; v1 submitted 3 June, 2025; originally announced June 2025.

  3. arXiv:2504.13914  [pdf, other

    cs.CL

    Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

    Authors: ByteDance Seed, :, Jiaze Chen, Tiantian Fan, Xin Liu, Lingjun Liu, Zhiqi Lin, Mingxuan Wang, Chengyi Wang, Xiangpeng Wei, Wenyuan Xu, Yufeng Yuan, Yu Yue, Lin Yan, Qiying Yu, Xiaochen Zuo, Chi Zhang, Ruofei Zhu, Zhecheng An, Zhihao Bai, Yu Bao, Xingyan Bin, Jiangjie Chen, Feng Chen, Hongmin Chen , et al. (249 additional authors not shown)

    Abstract: We introduce Seed1.5-Thinking, capable of reasoning through thinking before responding, resulting in improved performance on a wide range of benchmarks. Seed1.5-Thinking achieves 86.7 on AIME 2024, 55.0 on Codeforces and 77.3 on GPQA, demonstrating excellent reasoning abilities in STEM and coding. Beyond reasoning tasks, the method demonstrates notable generalization across diverse domains. For in… ▽ More

    Submitted 29 April, 2025; v1 submitted 10 April, 2025; originally announced April 2025.

  4. arXiv:2412.00535  [pdf, other

    cs.AI cs.SE

    FullStack Bench: Evaluating LLMs as Full Stack Coders

    Authors: Bytedance-Seed-Foundation-Code-Team, :, Yao Cheng, Jianfeng Chen, Jie Chen, Li Chen, Liyu Chen, Wentao Chen, Zhengyu Chen, Shijie Geng, Aoyan Li, Bo Li, Bowen Li, Linyi Li, Boyi Liu, Jiaheng Liu, Kaibo Liu, Qi Liu, Shukai Liu, Siyao Liu, Tianyi Liu, Tingkai Liu, Yongfei Liu, Rui Long, Jing Mai , et al. (31 additional authors not shown)

    Abstract: As the capabilities of code large language models (LLMs) continue to expand, their applications across diverse code intelligence domains are rapidly increasing. However, most existing datasets only evaluate limited application domains. To address this gap, we have developed a comprehensive code evaluation dataset FullStack Bench focusing on full-stack programming, which encompasses a wide range of… ▽ More

    Submitted 12 May, 2025; v1 submitted 30 November, 2024; originally announced December 2024.

    Comments: 26 pages

  5. arXiv:2410.13699  [pdf, other

    cs.CL

    Unconstrained Model Merging for Enhanced LLM Reasoning

    Authors: Yiming Zhang, Baoyi He, Shengyu Zhang, Yuhao Fu, Qi Zhou, Zhijie Sang, Zijin Hong, Kejing Yang, Wenjun Wang, Jianbo Yuan, Guanghan Ning, Linyi Li, Chunlin Ji, Fei Wu, Hongxia Yang

    Abstract: Recent advancements in building domain-specific large language models (LLMs) have shown remarkable success, especially in tasks requiring reasoning abilities like logical inference over complex relationships and multi-step problem solving. However, creating a powerful all-in-one LLM remains challenging due to the need for proprietary data and vast computational resources. As a resource-friendly al… ▽ More

    Submitted 21 October, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: Under review, correct typos

  6. arXiv:2407.03374  [pdf

    cs.AI cs.SE eess.SP eess.SY

    An Outline of Prognostics and Health Management Large Model: Concepts, Paradigms, and Challenges

    Authors: Laifa Tao, Shangyu Li, Haifei Liu, Qixuan Huang, Liang Ma, Guoao Ning, Yiling Chen, Yunlong Wu, Bin Li, Weiwei Zhang, Zhengduo Zhao, Wenchao Zhan, Wenyan Cao, Chao Wang, Hongmei Liu, Jian Ma, Mingliang Suo, Yujie Cheng, Yu Ding, Dengwei Song, Chen Lu

    Abstract: Prognosis and Health Management (PHM), critical for ensuring task completion by complex systems and preventing unexpected failures, is widely adopted in aerospace, manufacturing, maritime, rail, energy, etc. However, PHM's development is constrained by bottlenecks like generalization, interpretation and verification abilities. Presently, generative artificial intelligence (AI), represented by Larg… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  7. arXiv:2404.07940  [pdf, other

    cs.SE cs.LG

    InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models

    Authors: Linyi Li, Shijie Geng, Zhenwen Li, Yibo He, Hao Yu, Ziyue Hua, Guanghan Ning, Siwei Wang, Tao Xie, Hongxia Yang

    Abstract: Large Language Models for code (code LLMs) have witnessed tremendous progress in recent years. With the rapid development of code LLMs, many popular evaluation benchmarks, such as HumanEval, DS-1000, and MBPP, have emerged to measure the performance of code LLMs with a particular focus on code generation tasks. However, they are insufficient to cover the full range of expected capabilities of code… ▽ More

    Submitted 14 November, 2024; v1 submitted 10 March, 2024; originally announced April 2024.

    Comments: 31 pages. Appear at NeurIPS 2024 Datasets and Benchmarks track. Project website: https://infi-coder.github.io/infibench

  8. arXiv:2310.06389  [pdf, other

    cs.CV stat.ML

    Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling

    Authors: Huangjie Zheng, Zhendong Wang, Jianbo Yuan, Guanghan Ning, Pengcheng He, Quanzeng You, Hongxia Yang, Mingyuan Zhou

    Abstract: Diffusion models excel at generating photo-realistic images but come with significant computational costs in both training and sampling. While various techniques address these computational challenges, a less-explored issue is designing an efficient and adaptable network backbone for iterative refinement. Current options like U-Net and Vision Transformer often rely on resource-intensive deep netwo… ▽ More

    Submitted 27 June, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

  9. arXiv:2205.05675  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

    Authors: Yawei Li, Kai Zhang, Radu Timofte, Luc Van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang , et al. (86 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of e… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Validation code of the baseline model is available at https://github.com/ofsoundof/IMDN. Validation of all submitted models is available at https://github.com/ofsoundof/NTIRE2022_ESR

  10. Task-wise Split Gradient Boosting Trees for Multi-center Diabetes Prediction

    Authors: Mingcheng Chen, Zhenghui Wang, Zhiyun Zhao, Weinan Zhang, Xiawei Guo, Jian Shen, Yanru Qu, Jieli Lu, Min Xu, Yu Xu, Tiange Wang, Mian Li, Wei-Wei Tu, Yong Yu, Yufang Bi, Weiqing Wang, Guang Ning

    Abstract: Diabetes prediction is an important data science application in the social healthcare domain. There exist two main challenges in the diabetes prediction task: data heterogeneity since demographic and metabolic data are of different types, data insufficiency since the number of diabetes cases in a single medical center is usually limited. To tackle the above challenges, we employ gradient boosting… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: 11 pages (2 pages of supplementary), 10 figures, 7 tables. Accepted by ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)

  11. arXiv:2103.02852  [pdf, other

    cs.CV

    Data Augmentation for Object Detection via Differentiable Neural Rendering

    Authors: Guanghan Ning, Guang Chen, Chaowei Tan, Si Luo, Liefeng Bo, Heng Huang

    Abstract: It is challenging to train a robust object detector under the supervised learning setting when the annotated data are scarce. Thus, previous approaches tackling this problem are in two categories: semi-supervised learning models that interpolate labeled data from unlabeled data, and self-supervised learning approaches that exploit signals within unlabeled data via pretext tasks. To seamlessly inte… ▽ More

    Submitted 5 April, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: 15 pages, 15 figures

  12. arXiv:2007.14625  [pdf

    cs.CV

    Deep Multi-Scale Resemblance Network for the Sub-class Differentiation of Adrenal Masses on Computed Tomography Images

    Authors: Lei Bi, Jinman Kim, Tingwei Su, Michael Fulham, David Dagan Feng, Guang Ning

    Abstract: The accurate classification of mass lesions in the adrenal glands (adrenal masses), detected with computed tomography (CT), is important for diagnosis and patient management. Adrenal masses can be benign or malignant and benign masses have varying prevalence. Classification methods based on convolutional neural networks (CNNs) are the state-of-the-art in maximizing inter-class differences in large… ▽ More

    Submitted 5 August, 2022; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: 22 pages

  13. arXiv:1905.02822  [pdf, other

    cs.CV

    LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking

    Authors: Guanghan Ning, Heng Huang

    Abstract: In this paper, we propose a novel effective light-weight framework, called LightTrack, for online human pose tracking. The proposed framework is designed to be generic for top-down pose tracking and is faster than existing online and offline methods. Single-person Pose Tracking (SPT) and Visual Object Tracking (VOT) are incorporated into one unified functioning entity, easily implemented by a repl… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

    Comments: 9 pages, 6 figures, 6 tables

  14. arXiv:1901.07680  [pdf, other

    cs.CV

    A Top-down Approach to Articulated Human Pose Estimation and Tracking

    Authors: Guanghan Ning, Ping Liu, Xiaochuan Fan, Chi Zhang

    Abstract: Both the tasks of multi-person human pose estimation and pose tracking in videos are quite challenging. Existing methods can be categorized into two groups: top-down and bottom-up approaches. In this paper, following the top-down approach, we aim to build a strong baseline system with three modules: human candidate detector, single-person pose estimator and human pose tracker. Firstly, we choose a… ▽ More

    Submitted 22 January, 2019; originally announced January 2019.

    Comments: To appear in ECCVW (2018). Workshop: 2nd PoseTrack Challenge

  15. arXiv:1804.09803  [pdf, other

    cs.CV

    Progressive Neural Networks for Image Classification

    Authors: Zhi Zhang, Guanghan Ning, Yigang Cen, Yang Li, Zhiqun Zhao, Hao Sun, Zhihai He

    Abstract: The inference structures and computational complexity of existing deep neural networks, once trained, are fixed and remain the same for all test images. However, in practice, it is highly desirable to establish a progressive structure for deep neural networks which is able to adapt its inference process and complexity for images with different visual recognition complexity. In this work, we develo… ▽ More

    Submitted 25 April, 2018; originally announced April 2018.

  16. arXiv:1710.10192  [pdf, other

    cs.CV

    Dual Path Networks for Multi-Person Human Pose Estimation

    Authors: Guanghan Ning, Zhihai He

    Abstract: The task of multi-person human pose estimation in natural scenes is quite challenging. Existing methods include both top-down and bottom-up approaches. The main advantage of bottom-up methods is its excellent tradeoff between estimation accuracy and computational cost. We follow this path and aim to design smaller, faster, and more accurate neural networks for the regression of keypoints and limb… ▽ More

    Submitted 27 October, 2017; originally announced October 2017.

    Comments: ICCV 2017 Workshop on PoseTrack Challenge. Challenge results available at: https://posetrack.net/workshops/iccv2017/posetrack-challenge-results.html

  17. arXiv:1710.09505  [pdf, other

    cs.CV

    Knowledge Projection for Deep Neural Networks

    Authors: Zhi Zhang, Guanghan Ning, Zhihai He

    Abstract: While deeper and wider neural networks are actively pushing the performance limits of various computer vision and machine learning tasks, they often require large sets of labeled data for effective training and suffer from extremely high computational complexity. In this paper, we will develop a new framework for training deep neural networks on datasets with limited labeled samples using cross-ne… ▽ More

    Submitted 25 October, 2017; originally announced October 2017.

  18. arXiv:1705.02407  [pdf, other

    cs.CV

    Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation

    Authors: Guanghan Ning, Zhi Zhang, Zhihai He

    Abstract: Human pose estimation using deep neural networks aims to map input images with large variations into multiple body keypoints which must satisfy a set of geometric constraints and inter-dependency imposed by the human body model. This is a very challenging nonlinear manifold learning process in a very high dimensional feature space. We believe that the deep neural network, which is inherently an al… ▽ More

    Submitted 8 August, 2017; v1 submitted 5 May, 2017; originally announced May 2017.

    Comments: 13 pages, 12 figures. arXiv admin note: text overlap with arXiv:1609.01743, arXiv:1702.07432, arXiv:1602.00134 by other authors

  19. arXiv:1609.01331  [pdf, other

    cs.IR

    Joint Audio-Video Fingerprint Media Retrieval Using Rate-Coverage Optimization

    Authors: Guanghan Ning, Zhi Zhang, Xiaobo Ren, Haohong Wang, Zhihai He

    Abstract: In this work, we propose a joint audio-video fingerprint Automatic Content Recognition (ACR) technology for media retrieval. The problem is focused on how to balance the query accuracy and the size of fingerprint, and how to allocate the bits of the fingerprint to video frames and audio frames to achieve the best query accuracy. By constructing a novel concept called Coverage, which is highly corr… ▽ More

    Submitted 5 September, 2016; originally announced September 2016.

    Comments: 12 pages, 14 figures

  20. arXiv:1607.05781  [pdf, other

    cs.CV

    Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking

    Authors: Guanghan Ning, Zhi Zhang, Chen Huang, Zhihai He, Xiaobo Ren, Haohong Wang

    Abstract: In this paper, we develop a new approach of spatially supervised recurrent convolutional neural networks for visual object tracking. Our recurrent convolutional network exploits the history of locations as well as the distinctive visual features learned by the deep neural networks. Inspired by recent bounding box regression methods for object detection, we study the regression capability of Long S… ▽ More

    Submitted 19 July, 2016; originally announced July 2016.

    Comments: 10 pages, 9 figures, conference