Skip to main content

Showing 1–50 of 248 results for author: Hu, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.05081  [pdf, ps, other

    cs.AR

    ViPSN 2.0: A Reconfigurable Battery-free IoT Platform for Vibration Energy Harvesting

    Authors: Xin Li, Mianxin Xiao, Xi Shen, Jiaqing Chu, Weifeng Huang, Jiashun Li, Yaoyi Li, Mingjing Cai, Jiaming Chen, Xinming Zhang, Daxing Zhang, Congsi Wang, Hong Tang, Bao Zhao, Qitao Lu, Yilong Wang, Jianjun Wang, Minyi Xu, Shitong Fang, Xuanyu Huang. Chaoyang Zhao, Zicheng Liu, Yaowen Yang, Guobiao Hu, Junrui Liang, Wei-Hsin Liao

    Abstract: Vibration energy harvesting is a promising solution for powering battery-free IoT systems; however, the instability of ambient vibrations presents significant challenges, such as limited harvested energy, intermittent power supply, and poor adaptability to various applications. To address these challenges, this paper proposes ViPSN2.0, a modular and reconfigurable IoT platform that supports multip… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

  2. arXiv:2507.02008  [pdf, ps, other

    cs.LO

    SMT-Sweep: Word-Level Representation Unification for Hardware Verification

    Authors: Ziyi Yang, Guangyu Hu, Mingkai Miao, Changyuan Yu, Hongce Zhang

    Abstract: SAT sweeping has long been a cornerstone technique in logic simplification and equivalence checking at the bit level, leveraging structural hashing, simulation and SAT solving to prune redundant logic. However, with the growing adoption of word-level constructs in hardware verification, such as bit-vector operations, arithmetics and arrays, there lacks a counterpart of SAT sweeping at the word lev… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

  3. arXiv:2506.19842  [pdf, ps, other

    cs.RO cs.AI

    ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model

    Authors: Tengbo Yu, Guanxing Lu, Zaijia Yang, Haoyuan Deng, Season Si Chen, Jiwen Lu, Wenbo Ding, Guoqiang Hu, Yansong Tang, Ziwei Wang

    Abstract: Multi-task robotic bimanual manipulation is becoming increasingly popular as it enables sophisticated tasks that require diverse dual-arm collaboration patterns. Compared to unimanual manipulation, bimanual tasks pose challenges to understanding the multi-body spatiotemporal dynamics. An existing method ManiGaussian pioneers encoding the spatiotemporal dynamics into the visual representation via G… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  4. arXiv:2506.08349  [pdf, ps, other

    cs.CL cs.AI

    Evaluating LLMs Across Multi-Cognitive Levels: From Medical Knowledge Mastery to Scenario-Based Problem Solving

    Authors: Yuxuan Zhou, Xien Liu, Chenwei Yan, Chen Ning, Xiao Zhang, Boxun Li, Xiangling Fu, Shijin Wang, Guoping Hu, Yu Wang, Ji Wu

    Abstract: Large language models (LLMs) have demonstrated remarkable performance on various medical benchmarks, but their capabilities across different cognitive levels remain underexplored. Inspired by Bloom's Taxonomy, we propose a multi-cognitive-level evaluation framework for assessing LLMs in the medical domain in this study. The framework integrates existing medical datasets and introduces tasks target… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: 20 pages, 11 figures. Accepted by ICML 2025

  5. arXiv:2506.07184  [pdf, ps, other

    cs.AI cs.CL cs.CV

    Mitigating Behavioral Hallucination in Multimodal Large Language Models for Sequential Images

    Authors: Liangliang You, Junchi Yao, Shu Yang, Guimin Hu, Lijie Hu, Di Wang

    Abstract: While multimodal large language models excel at various tasks, they still suffer from hallucinations, which limit their reliability and scalability for broader domain applications. To address this issue, recent research mainly focuses on objective hallucination. However, for sequential images, besides objective hallucination, there is also behavioral hallucination, which is less studied. This work… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

  6. arXiv:2506.02736  [pdf, ps, other

    cs.CV cs.RO

    GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal

    Authors: Shufan Qing, Anzhen Li, Qiandi Wang, Yuefeng Niu, Mingchen Feng, Guoliang Hu, Jinqiao Wu, Fengtao Nan, Yingchun Fan

    Abstract: Existing semantic SLAM in dynamic environments mainly identify dynamic regions through object detection or semantic segmentation methods. However, in certain highly dynamic scenarios, the detection boxes or segmentation masks cannot fully cover dynamic regions. Therefore, this paper proposes a robust and efficient GeneA-SLAM2 system that leverages depth variance constraints to handle dynamic scene… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  7. arXiv:2505.18579  [pdf, other

    cs.LG eess.SP

    Mechanical in-sensor computing: a programmable meta-sensor for structural damage classification without external electronic power

    Authors: Tingpeng Zhang, Xuzhang Peng, Mingyuan Zhou, Guobiao Hu, Zhilu Lai

    Abstract: Structural health monitoring (SHM) involves sensor deployment, data acquisition, and data interpretation, commonly implemented via a tedious wired system. The information processing in current practice majorly depends on electronic computers, albeit with universal applications, delivering challenges such as high energy consumption and low throughput due to the nature of digital units. In recent ye… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

  8. arXiv:2505.12524  [pdf, ps, other

    cs.DB cs.LG

    HAKES: Scalable Vector Database for Embedding Search Service

    Authors: Guoyu Hu, Shaofeng Cai, Tien Tuan Anh Dinh, Zhongle Xie, Cong Yue, Gang Chen, Beng Chin Ooi

    Abstract: Modern deep learning models capture the semantics of complex data by transforming them into high-dimensional embedding vectors. Emerging applications, such as retrieval-augmented generation, use approximate nearest neighbor (ANN) search in the embedding vector space to find similar data. Existing vector databases provide indexes for efficient ANN searches, with graph-based indexes being the most p… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  9. arXiv:2505.02486  [pdf, ps, other

    cs.LG cs.AI

    SEFE: Superficial and Essential Forgetting Eliminator for Multimodal Continual Instruction Tuning

    Authors: Jinpeng Chen, Runmin Cong, Yuzhi Zhao, Hongzheng Yang, Guangneng Hu, Horace Ho Shing Ip, Sam Kwong

    Abstract: Multimodal Continual Instruction Tuning (MCIT) aims to enable Multimodal Large Language Models (MLLMs) to incrementally learn new tasks without catastrophic forgetting. In this paper, we explore forgetting in this context, categorizing it into superficial forgetting and essential forgetting. Superficial forgetting refers to cases where the model's knowledge may not be genuinely lost, but its respo… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  10. AlphaFuse: Learn ID Embeddings for Sequential Recommendation in Null Space of Language Embeddings

    Authors: Guoqing Hu, An Zhang, Shuo Liu, Zhibo Cai, Xun Yang, Xiang Wang

    Abstract: Recent advancements in sequential recommendation have underscored the potential of Large Language Models (LLMs) for enhancing item embeddings. However, existing approaches face three key limitations: 1) the degradation of the semantic space when high-dimensional language embeddings are mapped to lower-dimensional ID embeddings, 2) the underutilization of language embeddings, and 3) the reliance on… ▽ More

    Submitted 29 April, 2025; v1 submitted 27 April, 2025; originally announced April 2025.

    Comments: Accepted by SIGIR'25

  11. arXiv:2504.18020  [pdf, other

    cs.CV

    Federated Client-tailored Adapter for Medical Image Segmentation

    Authors: Guyue Hu, Siyuan Song, Yukun Kang, Zhu Yin, Gangming Zhao, Chenglong Li, Jin Tang

    Abstract: Medical image segmentation in X-ray images is beneficial for computer-aided diagnosis and lesion localization. Existing methods mainly fall into a centralized learning paradigm, which is inapplicable in the practical medical scenario that only has access to distributed data islands. Federated Learning has the potential to offer a distributed solution but struggles with heavy training instability d… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  12. arXiv:2504.14891  [pdf, other

    cs.CL

    Retrieval Augmented Generation Evaluation in the Era of Large Language Models: A Comprehensive Survey

    Authors: Aoran Gan, Hao Yu, Kai Zhang, Qi Liu, Wenyu Yan, Zhenya Huang, Shiwei Tong, Guoping Hu

    Abstract: Recent advancements in Retrieval-Augmented Generation (RAG) have revolutionized natural language processing by integrating Large Language Models (LLMs) with external information retrieval, enabling accurate, up-to-date, and verifiable text generation across diverse applications. However, evaluating RAG systems presents unique challenges due to their hybrid architecture that combines retrieval and… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: 18 pages, 5 figures

  13. arXiv:2504.13978  [pdf

    q-bio.QM cs.LG

    Association between nutritional factors, inflammatory biomarkers and cancer types: an analysis of NHANES data using machine learning

    Authors: Yuqing Liu, Meng Zhao, Guanlan Hu, Yuchen Zhang

    Abstract: Background. Diet and inflammation are critical factors influencing cancer risk. However, the combined impact of nutritional status and inflammatory biomarkers on cancer status and type, using machine learning (ML), remains underexplored. Objectives. This study investigates the association between nutritional factors, inflammatory biomarkers, and cancer status, and whether these relationships dif… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  14. arXiv:2504.11574  [pdf, other

    cs.AR

    E-morphic: Scalable Equality Saturation for Structural Exploration in Logic Synthesis

    Authors: Chen Chen, Guangyu HU, Cunxi Yu, Yuzhe Ma, Hongce Zhang

    Abstract: In technology mapping, the quality of the final implementation heavily relies on the circuit structure after technology-independent optimization. Recent studies have introduced equality saturation as a novel optimization approach. However, its efficiency remains a hurdle against its wide adoption in logic synthesis. This paper proposes a highly scalable and efficient framework named E-morphic. It… ▽ More

    Submitted 20 April, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

    Comments: Accepted by DAC 2025

  15. arXiv:2504.10655  [pdf

    cond-mat.mtrl-sci cs.AI cs.LG

    MatterTune: An Integrated, User-Friendly Platform for Fine-Tuning Atomistic Foundation Models to Accelerate Materials Simulation and Discovery

    Authors: Lingyu Kong, Nima Shoghi, Guoxiang Hu, Pan Li, Victor Fung

    Abstract: Geometric machine learning models such as graph neural networks have achieved remarkable success in recent years in chemical and materials science research for applications such as high-throughput virtual screening and atomistic simulations. The success of these models can be attributed to their ability to effectively learn latent representations of atomic structures directly from the training dat… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  16. arXiv:2504.09206  [pdf, other

    cs.LG stat.ML

    Rethinking Remaining Useful Life Prediction with Scarce Time Series Data: Regression under Indirect Supervision

    Authors: Jiaxiang Cheng, Yipeng Pang, Guoqiang Hu

    Abstract: Supervised time series prediction relies on directly measured target variables, but real-world use cases such as predicting remaining useful life (RUL) involve indirect supervision, where the target variable is labeled as a function of another dependent variable. Trending temporal regression techniques rely on sequential time series inputs to capture temporal patterns, requiring interpolation when… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

  17. arXiv:2504.04353  [pdf, other

    cs.LG stat.ML

    Extending Cox Proportional Hazards Model with Symbolic Non-Linear Log-Risk Functions for Survival Analysis

    Authors: Jiaxiang Cheng, Guoqiang Hu

    Abstract: The Cox proportional hazards (CPH) model has been widely applied in survival analysis to estimate relative risks across different subjects given multiple covariates. Traditional CPH models rely on a linear combination of covariates weighted with coefficients as the log-risk function, which imposes a strong and restrictive assumption, limiting generalization. Recent deep learning methods enable non… ▽ More

    Submitted 24 April, 2025; v1 submitted 6 April, 2025; originally announced April 2025.

  18. arXiv:2503.11164  [pdf, other

    cs.CL

    Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity

    Authors: Chi Xu, Gefei Zhang, Yantong Zhu, Luca Benini, Guosheng Hu, Yawei Li, Zhihong Zhang

    Abstract: N:M structured pruning is essential for large language models (LLMs) because it can remove less important network weights and reduce the memory and computation requirements. Existing pruning methods mainly focus on designing metrics to measure the importance of network components to guide pruning. Apart from the impact of these metrics, we observe that different layers have different sensitivities… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  19. arXiv:2503.09095  [pdf, other

    cs.CR cs.CV

    C^2 ATTACK: Towards Representation Backdoor on CLIP via Concept Confusion

    Authors: Lijie Hu, Junchi Liao, Weimin Lyu, Shaopeng Fu, Tianhao Huang, Shu Yang, Guimin Hu, Di Wang

    Abstract: Backdoor attacks pose a significant threat to deep learning models, enabling adversaries to embed hidden triggers that manipulate the behavior of the model during inference. Traditional backdoor attacks typically rely on inserting explicit triggers (e.g., external patches, or perturbations) into input data, but they often struggle to evade existing defense mechanisms. To address this limitation, w… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  20. arXiv:2503.01542  [pdf, other

    cs.CL cs.AI

    Revisiting Large Language Model Pruning using Neuron Semantic Attribution

    Authors: Yizhuo Ding, Xinwei Sun, Yanwei Fu, Guosheng Hu

    Abstract: Model pruning technique is vital for accelerating large language models by reducing their size and computational requirements. However, the generalizability of existing pruning methods across diverse datasets and tasks remains unclear. Thus, we conduct extensive evaluations on 24 datasets and 4 tasks using popular pruning methods. Based on these evaluations, we find and then investigate that calib… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  21. arXiv:2503.01424  [pdf, other

    cs.AI cs.CL

    From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems

    Authors: Zekun Zhou, Xiaocheng Feng, Lei Huang, Xiachong Feng, Ziyun Song, Ruihan Chen, Liang Zhao, Weitao Ma, Yuxuan Gu, Baoxin Wang, Dayong Wu, Guoping Hu, Ting Liu, Bing Qin

    Abstract: Research is a fundamental process driving the advancement of human civilization, yet it demands substantial time and effort from researchers. In recent years, the rapid development of artificial intelligence (AI) technologies has inspired researchers to explore how AI can accelerate and enhance research. To monitor relevant advancements, this paper presents a systematic review of the progress in t… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  22. arXiv:2502.19971  [pdf, ps, other

    quant-ph cs.AI

    Efficient and Universal Neural-Network Decoder for Stabilizer-Based Quantum Error Correction

    Authors: Gengyuan Hu, Wanli Ouyang, Chao-Yang Lu, Chen Lin, Han-Sen Zhong

    Abstract: Scaling quantum computing to practical applications necessitates reliable quantum error correction. Although numerous correction codes have been proposed, the overall correction efficiency critically limited by the decode algorithms. We introduce GraphQEC, a code-agnostic decoder leveraging machine-learning on the graph structure of stabilizer codes with linear time complexity. GraphQEC demonstrat… ▽ More

    Submitted 3 June, 2025; v1 submitted 27 February, 2025; originally announced February 2025.

  23. arXiv:2502.11715  [pdf, other

    cs.LG cs.AI

    Proactive Depot Discovery: A Generative Framework for Flexible Location-Routing

    Authors: Site Qu, Guoqiang Hu

    Abstract: The Location-Routing Problem (LRP), which combines the challenges of facility (depot) locating and vehicle route planning, is critically constrained by the reliance on predefined depot candidates, limiting the solution space and potentially leading to suboptimal outcomes. Previous research on LRP without predefined depots is scant and predominantly relies on heuristic algorithms that iteratively a… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  24. arXiv:2502.11169  [pdf, ps, other

    cs.CL

    CMCTS: A Constrained Monte Carlo Tree Search Framework for Mathematical Reasoning in Large Language Model

    Authors: Qingwen Lin, Boyan Xu, Guimin Hu, Zijian Li, Zhifeng Hao, Keli Zhang, Ruichu Cai

    Abstract: This paper introduces the Constrained Monte Carlo Tree Search (CMCTS) framework to enhance the mathematical reasoning capabilities of Large Language Models (LLM). By incorporating a constrained action space, Process Reward Model (PRM), and partial order rules, CMCTS effectively addresses the limitations of existing MCTS methods in terms of state space diversity and action selection rationality. Sp… ▽ More

    Submitted 16 June, 2025; v1 submitted 16 February, 2025; originally announced February 2025.

  25. arXiv:2502.08904  [pdf, other

    cs.AI

    MIH-TCCT: Mitigating Inconsistent Hallucinations in LLMs via Event-Driven Text-Code Cyclic Training

    Authors: Xinxin You, Xien Liu, Qixin Sun, Huan Zhang, Kaiyin Zhou, Shaohui Liu, GuoPing Hu, ShiJin Wang, Si Liu, Ji Wu

    Abstract: Recent methodologies utilizing synthetic datasets have aimed to address inconsistent hallucinations in large language models (LLMs); however,these approaches are primarily tailored to specific tasks, limiting their generalizability. Inspired by the strong performance of code-trained models in logic-intensive domains, we propose a novel framework that leverages event-based text to generate correspo… ▽ More

    Submitted 26 February, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

  26. arXiv:2501.13573  [pdf, other

    cs.CL

    Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization

    Authors: Lei Huang, Xiaocheng Feng, Weitao Ma, Yuchun Fan, Xiachong Feng, Yangfan Ye, Weihong Zhong, Yuxuan Gu, Baoxin Wang, Dayong Wu, Guoping Hu, Bing Qin

    Abstract: Ensuring contextual faithfulness in retrieval-augmented large language models (LLMs) is crucial for building trustworthy information-seeking systems, particularly in long-form question-answering (LFQA) scenarios. In this work, we identify a salient correlation between LFQA faithfulness and retrieval heads, a set of attention heads responsible for retrieving contextual information. Leveraging this… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

    Comments: Submitted to ARR October 2024

  27. arXiv:2501.10054  [pdf, other

    cs.LG cs.AI

    Accelerating Large Language Models through Partially Linear Feed-Forward Network

    Authors: Gansen Hu, Zhaoguo Wang, Jinglin Wei, Wei Huang, Haibo Chen

    Abstract: Large language models (LLMs) demonstrate remarkable capabilities but face deployment challenges due to their massive parameter counts. While existing compression techniques like pruning can reduce model size, it leads to significant accuracy degradation under high compression ratios. We present a novel perspective inspired by constant folding in compiler optimization. Our approach enables paramete… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

    ACM Class: D.4; I.2; D.3.4

  28. arXiv:2412.20417  [pdf, other

    cs.IT eess.SP

    Movable Antenna Array Aided Ultra Reliable Covert Communications

    Authors: Yida Wang, Guojie Hu, Xiaoling Hu, Xingbo Lu, Yuzhen Huang

    Abstract: In this paper, we construct a framework of the movable antenna (MA) aided covert communication shielded by the general noise uncertainty for the first time. According to the analysis performance on the derived closed-form expressions of the sum of the probabilities of the detection errors and the communication outage probability, the perfect covertness and the ultra reliability can be achieved by… ▽ More

    Submitted 29 December, 2024; originally announced December 2024.

    Comments: has been presented in IEEE GLOBECOM 2024

  29. arXiv:2412.18216  [pdf, other

    cs.CV cs.CL

    ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation

    Authors: Mengyang Wu, Yuzhi Zhao, Jialun Cao, Mingjie Xu, Zhongming Jiang, Xuehui Wang, Qinbin Li, Guangneng Hu, Shengchao Qin, Chi-Wing Fu

    Abstract: Controversial contents largely inundate the Internet, infringing various cultural norms and child protection standards. Traditional Image Content Moderation (ICM) models fall short in producing precise moderation decisions for diverse standards, while recent multimodal large language models (MLLMs), when adopted to general rule-based ICM, often produce classification and explanation results that a… ▽ More

    Submitted 20 January, 2025; v1 submitted 24 December, 2024; originally announced December 2024.

    Comments: Accepted by the AAAI 2025

  30. arXiv:2412.11640  [pdf, other

    cs.CR cs.DC

    SeSeMI: Secure Serverless Model Inference on Sensitive Data

    Authors: Guoyu Hu, Yuncheng Wu, Gang Chen, Tien Tuan Anh Dinh, Beng Chin Ooi

    Abstract: Model inference systems are essential for implementing end-to-end data analytics pipelines that deliver the benefits of machine learning models to users. Existing cloud-based model inference systems are costly, not easy to scale, and must be trusted in handling the models and user request data. Serverless computing presents a new opportunity, as it provides elasticity and fine-grained pricing. Our… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  31. arXiv:2412.11448  [pdf, other

    cs.LG cs.AI cs.DC

    TRAIL: Trust-Aware Client Scheduling for Semi-Decentralized Federated Learning

    Authors: Gangqiang Hu, Jianfeng Lu, Jianmin Han, Shuqin Cao, Jing Liu, Hao Fu

    Abstract: Due to the sensitivity of data, Federated Learning (FL) is employed to enable distributed machine learning while safeguarding data privacy and accommodating the requirements of various devices. However, in the context of semi-decentralized FL, clients' communication and training states are dynamic. This variability arises from local training fluctuations, heterogeneous data distributions, and inte… ▽ More

    Submitted 19 December, 2024; v1 submitted 16 December, 2024; originally announced December 2024.

  32. arXiv:2412.07981  [pdf, other

    cs.AI

    Where Common Knowledge Cannot Be Formed, Common Belief Can -- Planning with Multi-Agent Belief Using Group Justified Perspectives

    Authors: Guang Hu, Tim Miller, Nir Lipovetzky

    Abstract: Epistemic planning is the sub-field of AI planning that focuses on changing knowledge and belief. It is important in both multi-agent domains where agents need to have knowledge/belief regarding the environment, but also the beliefs of other agents, including nested beliefs. When modeling knowledge in multi-agent settings, many models face an exponential growth challenge in terms of nested depth.… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: 10 pages, including appendix and reference

  33. arXiv:2412.07941  [pdf, other

    cs.AI

    Beyond Static Assumptions: the Predictive Justified Perspective Model for Epistemic Planning

    Authors: Weijia Li, Guang Hu, Yangmengfei Xu

    Abstract: Epistemic Planning (EP) is an important research area dedicated to reasoning about the knowledge and beliefs of agents in multi-agent cooperative or adversarial settings. The Justified Perspective (JP) model is the state-of-the-art approach to solving EP problems with efficiency and expressiveness. However, all existing EP methods inherit the static environment assumption from classical planning.… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: 8 pages

  34. arXiv:2412.06838  [pdf

    cs.LG cs.AR

    Hardware implementation of timely reliable Bayesian decision-making using memristors

    Authors: Lekai Song, Pengyu Liu, Yang Liu, Jingfang Pei, Wenyu Cui, Songwei Liu, Yingyi Wen, Teng Ma, Kong-Pang Pun, Leonard W. T. Ng, Guohua Hu

    Abstract: Brains perform decision-making by Bayes theorem. The theorem quantifies events as probabilities and, based on probability rules, renders the decisions. Learning from this, Bayes theorem can be applied to enable efficient user-scene interactions. However, given the probabilistic nature, implementing Bayes theorem in hardware using conventional deterministic computing can incur excessive computation… ▽ More

    Submitted 12 January, 2025; v1 submitted 7 December, 2024; originally announced December 2024.

  35. arXiv:2412.05554  [pdf, other

    eess.SP cs.IT quant-ph

    Rydberg Atomic Quantum Receivers for Classical Wireless Communications and Sensing: Their Models and Performance

    Authors: Tierui Gong, Jiaming Sun, Chau Yuen, Guangwei Hu, Yufei Zhao, Yong Liang Guan, Chong Meng Samson See, Mérouane Debbah, Lajos Hanzo

    Abstract: The significant progress of quantum sensing technologies offer numerous radical solutions for measuring a multitude of physical quantities at an unprecedented precision. Among them, Rydberg atomic quantum receivers (RAQRs) emerge as an eminent solution for detecting the electric field of radio frequency (RF) signals, exhibiting great potential in assisting classical wireless communications and sen… ▽ More

    Submitted 13 May, 2025; v1 submitted 7 December, 2024; originally announced December 2024.

    Comments: 16 pages, 8 figures

  36. arXiv:2411.13907  [pdf, other

    cs.LG cs.AI cs.DC cs.NE

    Split Federated Learning Over Heterogeneous Edge Devices: Algorithm and Optimization

    Authors: Yunrui Sun, Gang Hu, Yinglei Teng, Dunbo Cai

    Abstract: Split Learning (SL) is a promising collaborative machine learning approach, enabling resource-constrained devices to train models without sharing raw data, while reducing computational load and preserving privacy simultaneously. However, current SL algorithms face limitations in training efficiency and suffer from prolonged latency, particularly in sequential settings, where the slowest device can… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  37. arXiv:2411.13785  [pdf, ps, other

    cs.IT eess.SP

    Throughput Maximization for Movable Antenna Systems with Movement Delay Consideration

    Authors: Honghao Wang, Qingqing Wu, Ying Gao, Wen Chen, Weidong Mei, Guojie Hu, Lexi Xu

    Abstract: In this paper, we model the minimum achievable throughput within a transmission block of restricted duration and aim to maximize it in movable antenna (MA)-enabled multiuser downlink communications. Particularly, we account for the antenna moving delay caused by mechanical movement, which has not been fully considered in previous studies, and reveal the trade-off between the delay and signal-to-in… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

  38. arXiv:2411.11214  [pdf, other

    cs.CV

    DeforHMR: Vision Transformer with Deformable Cross-Attention for 3D Human Mesh Recovery

    Authors: Jaewoo Heo, George Hu, Zeyu Wang, Serena Yeung-Levy

    Abstract: Human Mesh Recovery (HMR) is an important yet challenging problem with applications across various domains including motion capture, augmented reality, and biomechanics. Accurately predicting human pose parameters from a single image remains a challenging 3D computer vision task. In this work, we introduce DeforHMR, a novel regression-based monocular HMR framework designed to enhance the predictio… ▽ More

    Submitted 17 November, 2024; originally announced November 2024.

    Comments: 11 pages, 5 figures, 3DV2025

  39. arXiv:2411.09946  [pdf

    cs.SI

    Assessing Response Disparities in California Wildland-Urban-Interface (WUI) Cities Using the Compartmental Model

    Authors: Zihui Ma, Guangxiao Hu, Ting-Syuan Lin, Lingyao Li, Songhua Hu, Loni Hagen, Gregory B. Baecher

    Abstract: The increasing frequency and severity of wildfires pose significant risks to communities, infrastructure, and the environment, especially in Wildland-Urban Interface (WUI) areas. Effective disaster management requires understanding how the public perceives and responds to wildfire threats in real-time. This study uses social media data to assess public responses and explores how these responses ar… ▽ More

    Submitted 21 November, 2024; v1 submitted 14 November, 2024; originally announced November 2024.

  40. arXiv:2411.02118  [pdf, other

    cs.HC cs.CL

    Grounding Emotional Descriptions to Electrovibration Haptic Signals

    Authors: Guimin Hu, Zirui Zhao, Lukas Heilmann, Yasemin Vardar, Hasti Seifi

    Abstract: Designing and displaying haptic signals with sensory and emotional attributes can improve the user experience in various applications. Free-form user language provides rich sensory and emotional information for haptic design (e.g., ``This signal feels smooth and exciting''), but little work exists on linking user descriptions to haptic signals (i.e., language grounding). To address this gap, we co… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  41. arXiv:2410.19128  [pdf, other

    cs.CL

    Retrieving Implicit and Explicit Emotional Events Using Large Language Models

    Authors: Guimin Hu, Hasti Seifi

    Abstract: Large language models (LLMs) have garnered significant attention in recent years due to their impressive performance. While considerable research has evaluated these models from various perspectives, the extent to which LLMs can perform implicit and explicit emotion retrieval remains largely unexplored. To address this gap, this study investigates LLMs' emotion retrieval capabilities in commonsens… ▽ More

    Submitted 1 December, 2024; v1 submitted 24 October, 2024; originally announced October 2024.

  42. arXiv:2410.13428  [pdf, other

    cs.IR

    Generate and Instantiate What You Prefer: Text-Guided Diffusion for Sequential Recommendation

    Authors: Guoqing Hu, Zhengyi Yang, Zhibo Cai, An Zhang, Xiang Wang

    Abstract: Recent advancements in generative recommendation systems, particularly in the realm of sequential recommendation tasks, have shown promise in enhancing generalization to new items. Among these approaches, diffusion-based generative recommendation has emerged as an effective tool, leveraging its ability to capture data distributions and generate high-quality samples. Despite effectiveness, two prim… ▽ More

    Submitted 22 October, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

  43. arXiv:2410.13117  [pdf, other

    cs.IR cs.AI

    Preference Diffusion for Recommendation

    Authors: Shuo Liu, An Zhang, Guoqing Hu, Hong Qian, Tat-seng Chua

    Abstract: Recommender systems predict personalized item rankings based on user preference distributions derived from historical behavior data. Recently, diffusion models (DMs) have gained attention in recommendation for their ability to model complex distributions, yet current DM-based recommenders often rely on traditional objectives like mean squared error (MSE) or recommendation objectives, which are not… ▽ More

    Submitted 20 April, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Accepted by ICLR 2025

    Journal ref: ICLR 2025

  44. arXiv:2409.16626  [pdf, other

    cs.LG cs.AI cs.AR

    Ascend HiFloat8 Format for Deep Learning

    Authors: Yuanyong Luo, Zhongxing Zhang, Richard Wu, Hu Liu, Ying Jin, Kai Zheng, Minmin Wang, Zhanying He, Guipeng Hu, Luyao Chen, Tianchi Hu, Junsong Wang, Minqi Chen, Mikhaylov Dmitry, Korviakov Vladimir, Bobrin Maxim, Yuhao Hu, Guanfu Chen, Zeyi Huang

    Abstract: This preliminary white paper proposes a novel 8-bit floating-point data format HiFloat8 (abbreviated as HiF8) for deep learning. HiF8 features tapered precision. For normal value encoding, it provides 7 exponent values with 3-bit mantissa, 8 exponent values with 2-bit mantissa, and 16 exponent values with 1-bit mantissa. For denormal value encoding, it extends the dynamic range by 7 extra powers o… ▽ More

    Submitted 26 September, 2024; v1 submitted 25 September, 2024; originally announced September 2024.

    Comments: 13 Pages, 4 Figures, 9 Tables

  45. arXiv:2409.13989  [pdf, other

    cs.CL cs.AI cs.LG physics.chem-ph q-bio.BM

    ChemEval: A Comprehensive Multi-Level Chemical Evaluation for Large Language Models

    Authors: Yuqing Huang, Rongyang Zhang, Xuesong He, Xuyang Zhi, Hao Wang, Xin Li, Feiyang Xu, Deguang Liu, Huadong Liang, Yi Li, Jian Cui, Zimu Liu, Shijin Wang, Guoping Hu, Guiquan Liu, Qi Liu, Defu Lian, Enhong Chen

    Abstract: There is a growing interest in the role that LLMs play in chemistry which lead to an increased focus on the development of LLMs benchmarks tailored to chemical domains to assess the performance of LLMs across a spectrum of chemical tasks varying in type and complexity. However, existing benchmarks in this domain fail to adequately meet the specific requirements of chemical research professionals.… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  46. arXiv:2409.08444  [pdf, other

    cs.CV

    Towards Unified Facial Action Unit Recognition Framework by Large Language Models

    Authors: Guohong Hu, Xing Lan, Hanyu Jiang, Jiayi Lyu, Jian Xue

    Abstract: Facial Action Units (AUs) are of great significance in the realm of affective computing. In this paper, we propose AU-LLaVA, the first unified AU recognition framework based on the Large Language Model (LLM). AU-LLaVA consists of a visual encoder, a linear projector layer, and a pre-trained LLM. We meticulously craft the text descriptions and fine-tune the model on various AU datasets, allowing it… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  47. arXiv:2409.07388  [pdf, other

    cs.CL

    Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective

    Authors: Guimin Hu, Yi Xin, Weimin Lyu, Haojian Huang, Chang Sun, Zhihong Zhu, Lin Gui, Ruichu Cai, Erik Cambria, Hasti Seifi

    Abstract: Multimodal affective computing (MAC) has garnered increasing attention due to its broad applications in analyzing human behaviors and intentions, especially in text-dominated multimodal affective computing field. This survey presents the recent trends of multimodal affective computing from NLP perspective through four hot tasks: multimodal sentiment analysis, multimodal emotion recognition in conv… ▽ More

    Submitted 30 October, 2024; v1 submitted 11 September, 2024; originally announced September 2024.

  48. arXiv:2409.07129  [pdf, other

    cs.CV

    MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis

    Authors: Hanyu Jiang, Jian Xue, Xing Lan, Guohong Hu, Ke Lu

    Abstract: This paper introduces MVLLaVA, an intelligent agent designed for novel view synthesis tasks. MVLLaVA integrates multiple multi-view diffusion models with a large multimodal model, LLaVA, enabling it to handle a wide range of tasks efficiently. MVLLaVA represents a versatile and unified platform that adapts to diverse input types, including a single image, a descriptive caption, or a specific chang… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: project page: https://jamesjg.github.io/MVLLaVA_homepage/

  49. arXiv:2409.01580  [pdf, other

    cs.OS

    Foreactor: Exploiting Storage I/O Parallelism with Explicit Speculation

    Authors: Guanzhou Hu, Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau

    Abstract: We introduce explicit speculation, a variant of I/O speculation technique where I/O system calls can be parallelized under the guidance of explicit application code knowledge. We propose a formal abstraction -- the foreaction graph -- which describes the exact pattern of I/O system calls in an application function as well as any necessary computation associated to produce their argument values. I/… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 12 pages, 10 figures

  50. arXiv:2409.01576  [pdf, other

    cs.DC

    A Unified, Practical, and Understandable Model of Non-transactional Consistency Levels in Distributed Replication

    Authors: Guanzhou Hu, Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau

    Abstract: We present a practical model of non-transactional consistency levels in the context of distributed data replication. Unlike prior work, our simple Shared Object Pool (SOP) model defines common consistency levels in a unified framework centered around the single concept of ordering. This naturally reflects modern cloud object storage services and is thus easy to understand. We show that a consisten… ▽ More

    Submitted 26 May, 2025; v1 submitted 2 September, 2024; originally announced September 2024.

    Comments: 12 pages, 4 figures, 3 tables