Skip to main content

Showing 1–20 of 20 results for author: Che, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.02395  [pdf, other

    cs.RO eess.SY

    A Real-Time Control Barrier Function-Based Safety Filter for Motion Planning with Arbitrary Road Boundary Constraints

    Authors: Jianye Xu, Chang Che, Bassam Alrifaee

    Abstract: We present a real-time safety filter for motion planning, such as learning-based methods, using Control Barrier Functions (CBFs), which provides formal guarantees for collision avoidance with road boundaries. A key feature of our approach is its ability to directly incorporate road geometries of arbitrary shape without resorting to conservative overapproximations. We formulate the safety filter as… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  2. arXiv:2503.19740  [pdf, other

    cs.CV

    Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings

    Authors: Chengan Che, Chao Wang, Tom Vercauteren, Sophia Tsoka, Luis C. Garcia-Peraza-Herrera

    Abstract: Advancements in computer-assisted surgical procedures heavily rely on accurate visual data interpretation from camera systems used during surgeries. Traditional open-access datasets focusing on surgical procedures are often limited by their small size, typically consisting of fewer than 100 videos with less than 100K images. To address these constraints, a new dataset called Surg-3M has been compi… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: 15 pages

  3. arXiv:2411.13949  [pdf, other

    cs.CV cs.AI

    Separable Mixture of Low-Rank Adaptation for Continual Visual Instruction Tuning

    Authors: Ziqi Wang, Chang Che, Qi Wang, Yangyang Li, Zenglin Shi, Meng Wang

    Abstract: Visual instruction tuning (VIT) enables multimodal large language models (MLLMs) to effectively handle a wide range of vision tasks by framing them as language-based instructions. Building on this, continual visual instruction tuning (CVIT) extends the capability of MLLMs to incrementally learn new tasks, accommodating evolving functionalities. While prior work has advanced CVIT through the develo… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  4. arXiv:2410.18983  [pdf

    cs.HC

    Smart Navigation System for Parking Assignment at Large Events: Incorporating Heterogeneous Driver Characteristics

    Authors: Xi Cheng, Tong Liu, Gaofeng Su, Chang Che, Chen Zhu, Ke Liu, Binze Cai, Xin Hu

    Abstract: Parking challenges escalate significantly during large events such as concerts and sports games, yet few studies address dynamic parking lot assignments in these occasions. This paper introduces a smart navigation system designed to optimize parking assignments efficiently during major events, employing a mixed search algorithm that considers diverse drivers characteristics. We validated our syste… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: arXiv admin note: text overlap with arXiv:2406.05135

  5. arXiv:2408.16924  [pdf, other

    cs.CV cs.ET

    Enhancing Autism Spectrum Disorder Early Detection with the Parent-Child Dyads Block-Play Protocol and an Attention-enhanced GCN-xLSTM Hybrid Deep Learning Framework

    Authors: Xiang Li, Lizhou Fan, Hanbo Wu, Kunping Chen, Xiaoxiao Yu, Chao Che, Zhifeng Cai, Xiuhong Niu, Aihua Cao, Xin Ma

    Abstract: Autism Spectrum Disorder (ASD) is a rapidly growing neurodevelopmental disorder. Performing a timely intervention is crucial for the growth of young children with ASD, but traditional clinical screening methods lack objectivity. This study introduces an innovative approach to early detection of ASD. The contributions are threefold. First, this work proposes a novel Parent-Child Dyads Block-Play (P… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: 18 pages, 8 figures, and 4 tables

  6. arXiv:2407.16295  [pdf, other

    cs.CR

    Manifoldchain: Maximizing Blockchain Throughput via Bandwidth-Clustered Sharding

    Authors: Chunjiang Che, Songze Li, Xuechao Wang

    Abstract: Bandwidth limitation is the major bottleneck that hinders scaling throughput of proof-of-work blockchains. To guarantee security, the mining rate of the blockchain is determined by the miners with the lowest bandwidth, resulting in an inefficient bandwidth utilization among fast miners. We propose Manifoldchain, an innovative blockchain sharding protocol that alleviates the impact of slow miners t… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  7. arXiv:2407.00362  [pdf, other

    cs.CV cs.AI

    JSCDS: A Core Data Selection Method with Jason-Shannon Divergence for Caries RGB Images-Efficient Learning

    Authors: Peiliang Zhang, Yujia Tong, Chenghu Du, Chao Che, Yongjun Zhu

    Abstract: Deep learning-based RGB caries detection improves the efficiency of caries identification and is crucial for preventing oral diseases. The performance of deep learning models depends on high-quality data and requires substantial training resources, making efficient deployment challenging. Core data selection, by eliminating low-quality and confusing data, aims to enhance training efficiency withou… ▽ More

    Submitted 6 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

    Comments: Accepted in KDD 2024 Workshop AIDSH

  8. arXiv:2406.10744  [pdf, other

    cs.CV

    Technique Report of CVPR 2024 PBDL Challenges

    Authors: Ying Fu, Yu Li, Shaodi You, Boxin Shi, Linwei Chen, Yunhao Zou, Zichun Wang, Yichen Li, Yuze Han, Yingkai Zhang, Jianan Wang, Qinglin Liu, Wei Yu, Xiaoqian Lv, Jianing Li, Shengping Zhang, Xiangyang Ji, Yuanpei Chen, Yuhan Zhang, Weihang Peng, Liwen Zhang, Zhe Xu, Dingyong Gou, Cong Li, Senyan Xu , et al. (75 additional authors not shown)

    Abstract: The intersection of physics-based vision and deep learning presents an exciting frontier for advancing computer vision technologies. By leveraging the principles of physics to inform and enhance deep learning models, we can develop more robust and accurate vision systems. Physics-based vision aims to invert the processes to recover scene properties such as shape, reflectance, light distribution, a… ▽ More

    Submitted 12 July, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 PBDL Challenges: https://pbdl-ws.github.io/pbdl2024/challenge/index.html

  9. arXiv:2405.07479  [pdf, other

    cs.RO

    Enhancing 3D Object Detection by Using Neural Network with Self-adaptive Thresholding

    Authors: Houze Liu, Chongqing Wang, Xiaoan Zhan, Haotian Zheng, Chang Che

    Abstract: Robust 3D object detection remains a pivotal concern in the domain of autonomous field robotics. Despite notable enhancements in detection accuracy across standard datasets, real-world urban environments, characterized by their unstructured and dynamic nature, frequently precipitate an elevated incidence of false positives, thereby undermining the reliability of existing detection paradigms. In th… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: This paper has been accepted by the CONF-SEML 2024

  10. arXiv:2404.03523  [pdf

    cs.CE

    Integrating Generative AI into Financial Market Prediction for Improved Decision Making

    Authors: Chang Che, Zengyi Huang, Chen Li, Haotian Zheng, Xinyu Tian

    Abstract: This study provides an in-depth analysis of the model architecture and key technologies of generative artificial intelligence, combined with specific application cases, and uses conditional generative adversarial networks ( cGAN ) and time series analysis methods to simulate and predict dynamic changes in financial markets. The research results show that the cGAN model can effectively capture the… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  11. arXiv:2404.01116  [pdf

    cs.RO

    Intelligent Robotic Control System Based on Computer Vision Technology

    Authors: Chang Che, Haotian Zheng, Zengyi Huang, Wei Jiang, Bo Liu

    Abstract: The article explores the intersection of computer vision technology and robotic control, highlighting its importance in various fields such as industrial automation, healthcare, and environmental protection. Computer vision technology, which simulates human visual observation, plays a crucial role in enabling robots to perceive and understand their surroundings, leading to advancements in tasks li… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  12. arXiv:2401.06782  [pdf, other

    cs.CL cs.AI

    Semantic Similarity Matching for Patent Documents Using Ensemble BERT-related Model and Novel Text Processing Method

    Authors: Liqiang Yu, Bo Liu, Qunwei Lin, Xinyu Zhao, Chang Che

    Abstract: In the realm of patent document analysis, assessing semantic similarity between phrases presents a significant challenge, notably amplifying the inherent complexities of Cooperative Patent Classification (CPC) research. Firstly, this study addresses these challenges, recognizing early CPC work while acknowledging past struggles with language barriers and document intricacy. Secondly, it underscore… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: It accepted by The 6th International Conference on Machine Learning and Machine Intelligence (MLMI 2023)

  13. arXiv:2401.06167  [pdf, other

    cs.CV cs.AI

    Enhancing Multimodal Understanding with CLIP-Based Image-to-Text Transformation

    Authors: Chang Che, Qunwei Lin, Xinyu Zhao, Jiaxin Huang, Liqiang Yu

    Abstract: The process of transforming input images into corresponding textual explanations stands as a crucial and complex endeavor within the domains of computer vision and natural language processing. In this paper, we propose an innovative ensemble approach that harnesses the capabilities of Contrastive Language-Image Pretraining models.

    Submitted 1 January, 2024; originally announced January 2024.

  14. arXiv:2401.05433  [pdf, other

    cs.CL cs.AI

    Enhancing Essay Scoring with Adversarial Weights Perturbation and Metric-specific AttentionPooling

    Authors: Jiaxin Huang, Xinyu Zhao, Chang Che, Qunwei Lin, Bo Liu

    Abstract: The objective of this study is to improve automated feedback tools designed for English Language Learners (ELLs) through the utilization of data science techniques encompassing machine learning, natural language processing, and educational data analytics. Automated essay scoring (AES) research has made strides in evaluating written essays, but it often overlooks the specific needs of English Langu… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: This article was accepted by 2023 International Conference on Information Network and Computer Communications(INCC)

  15. arXiv:2312.12872  [pdf

    cs.CV cs.AI

    Integration and Performance Analysis of Artificial Intelligence and Computer Vision Based on Deep Learning Algorithms

    Authors: Bo Liu, Liqiang Yu, Chang Che, Qunwei Lin, Hao Hu, Xinyu Zhao

    Abstract: This paper focuses on the analysis of the application effectiveness of the integration of deep learning and computer vision technologies. Deep learning achieves a historic breakthrough by constructing hierarchical neural networks, enabling end-to-end feature learning and semantic understanding of images. The successful experiences in the field of computer vision provide strong support for training… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  16. arXiv:2301.05639  [pdf

    cs.LG physics.chem-ph

    Predictions of photophysical properties of phosphorescent platinum(II) complexes based on ensemble machine learning approach

    Authors: Shuai Wang, ChiYung Yam, Shuguang Chen, Lihong Hu, Liping Li, Faan-Fung Hung, Jiaqi Fan, Chi-Ming Che, GuanHua Chen

    Abstract: Phosphorescent metal complexes have been under intense investigations as emissive dopants for energy efficient organic light emitting diodes (OLEDs). Among them, cyclometalated Pt(II) complexes are widespread triplet emitters with color-tunable emissions. To render their practical applications as OLED emitters, it is in great need to develop Pt(II) complexes with high radiative decay rate constant… ▽ More

    Submitted 7 January, 2023; originally announced January 2023.

  17. A Decentralized Federated Learning Framework via Committee Mechanism with Convergence Guarantee

    Authors: Chunjiang Che, Xiaoli Li, Chuan Chen, Xiaoyu He, Zibin Zheng

    Abstract: Federated learning allows multiple participants to collaboratively train an efficient model without exposing data privacy. However, this distributed machine learning training method is prone to attacks from Byzantine clients, which interfere with the training of the global model by modifying the model or uploading the false gradient. In this paper, we propose a novel serverless federated learning… ▽ More

    Submitted 7 September, 2022; v1 submitted 1 August, 2021; originally announced August 2021.

  18. Nine Million Book Items and Eleven Million Citations: A Study of Book-Based Scholarly Communication Using OpenCitations

    Authors: Yongjun Zhu, Erjia Yan, Silvio Peroni, Chao Che

    Abstract: Books have been widely used to share information and contribute to human knowledge. However, the quantitative use of books as a method of scholarly communication is relatively unexamined compared to journal articles and conference papers. This study uses the COCI dataset (a comprehensive open citation dataset provided by OpenCitations) to explore books' roles in scholarly communication. The COCI d… ▽ More

    Submitted 6 December, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

  19. arXiv:1809.10820  [pdf, other

    cs.CV

    Inverse Transport Networks

    Authors: Chengqian Che, Fujun Luan, Shuang Zhao, Kavita Bala, Ioannis Gkioulekas

    Abstract: We introduce inverse transport networks as a learning architecture for inverse rendering problems where, given input image measurements, we seek to infer physical scene parameters such as shape, material, and illumination. During training, these networks are evaluated not only in terms of how close they can predict groundtruth parameters, but also in terms of whether the parameters they produce ca… ▽ More

    Submitted 27 September, 2018; originally announced September 2018.

  20. arXiv:1806.09737  [pdf, other

    cs.LG stat.ML

    A Multi-View Ensemble Classification Model for Clinically Actionable Genetic Mutations

    Authors: Xi Sheryl Zhang, Dandi Chen, Yongjun Zhu, Chao Che, Chang Su, Sendong Zhao, Xu Min, Fei Wang

    Abstract: This paper presents details of our winning solutions to the task IV of NIPS 2017 Competition Track entitled Classifying Clinically Actionable Genetic Mutations. The machine learning task aims to classify genetic mutations based on text evidence from clinical literature with promising performance. We develop a novel multi-view machine learning framework with ensemble classification models to solve… ▽ More

    Submitted 17 March, 2019; v1 submitted 25 June, 2018; originally announced June 2018.