Skip to main content

Showing 1–13 of 13 results for author: Zan, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.12527  [pdf, ps, other

    cs.CL

    Detection, Classification, and Mitigation of Gender Bias in Large Language Models

    Authors: Xiaoqing Cheng, Hongying Zan, Lulu Kong, Jinwang Song, Min Peng

    Abstract: With the rapid development of large language models (LLMs), they have significantly improved efficiency across a wide range of domains. However, recent studies have revealed that LLMs often exhibit gender bias, leading to serious social implications. Detecting, classifying, and mitigating gender bias in LLMs has therefore become a critical research focus. In the NLPCC 2025 Shared Task 7: Chinese C… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

  2. arXiv:2505.23829  [pdf, other

    cs.CL

    BiasFilter: An Inference-Time Debiasing Framework for Large Language Models

    Authors: Xiaoqing Cheng, Ruizhe Chen, Hongying Zan, Yuxiang Jia, Min Peng

    Abstract: Mitigating social bias in large language models (LLMs) has become an increasingly important research objective. However, existing debiasing methods often incur high human and computational costs, exhibit limited effectiveness, and struggle to scale to larger models and open-ended generation tasks. To address these limitations, this paper proposes BiasFilter, a model-agnostic, inference-time debias… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  3. arXiv:2505.18744  [pdf, ps, other

    cs.CL

    LogicCat: A Chain-of-Thought Text-to-SQL Benchmark for Multi-Domain Reasoning Challenges

    Authors: Tao Liu, Hongying Zan, Yifan Li, Dixuan Zhang, Lulu Kong, Haixin Liu, Jiaming Hou, Aoze Zheng, Rui Li, Yiming Qiao, Zewei Luo, Qi Wang, Zhiqiang Zhang, Jiaxi Li, Supeng Liu, Kunli Zhang, Min Peng

    Abstract: Text-to-SQL is a fundamental task in natural language processing that seeks to translate natural language questions into meaningful and executable SQL queries. While existing datasets are extensive and primarily focus on business scenarios and operational logic, they frequently lack coverage of domain-specific knowledge and complex mathematical reasoning. To address this gap, we present a novel da… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

    Comments: 22 pages, 10 figures

  4. arXiv:2505.14305  [pdf, ps, other

    cs.CL

    JOLT-SQL: Joint Loss Tuning of Text-to-SQL with Confusion-aware Noisy Schema Sampling

    Authors: Jinwang Song, Hongying Zan, Kunli Zhang, Lingling Mu, Yingjie Han, Haobo Hua, Min Peng

    Abstract: Text-to-SQL, which maps natural language to SQL queries, has benefited greatly from recent advances in Large Language Models (LLMs). While LLMs offer various paradigms for this task, including prompting and supervised fine-tuning (SFT), SFT approaches still face challenges such as complex multi-stage pipelines and poor robustness to noisy schema information. To address these limitations, we presen… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: Work in progress. 13 pages, 6 figures

  5. arXiv:2412.19140  [pdf, other

    cs.CL cs.AI cs.CE

    SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis

    Authors: Senbin Zhu, Chenyuan He, Hongde Liu, Pengcheng Dong, Hanjie Zhao, Yuchen Yan, Yuxiang Jia, Hongying Zan, Min Peng

    Abstract: In recent years, fine-grained sentiment analysis in finance has gained significant attention, but the scarcity of entity-level datasets remains a key challenge. To address this, we have constructed the largest English and Chinese financial entity-level sentiment analysis datasets to date. Building on this foundation, we propose a novel two-stage sentiment analysis approach called Self-aware In-con… ▽ More

    Submitted 26 December, 2024; originally announced December 2024.

    Comments: This paper is to be published in the Proceedings of the 31st International Conference on Computational Linguistics (COLING 2025)

  6. arXiv:2407.15341  [pdf, other

    cs.CL

    ZZU-NLP at SIGHAN-2024 dimABSA Task: Aspect-Based Sentiment Analysis with Coarse-to-Fine In-context Learning

    Authors: Senbin Zhu, Hanjie Zhao, Xingren Wang, Shanhong Liu, Yuxiang Jia, Hongying Zan

    Abstract: The DimABSA task requires fine-grained sentiment intensity prediction for restaurant reviews, including scores for Valence and Arousal dimensions for each Aspect Term. In this study, we propose a Coarse-to-Fine In-context Learning(CFICL) method based on the Baichuan2-7B model for the DimABSA task in the SIGHAN 2024 workshop. Our method improves prediction accuracy through a two-stage optimization… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Journal ref: https://aclanthology.org/2024.sighan-1.13

  7. arXiv:2403.15800  [pdf, other

    cs.CL

    MRC-based Nested Medical NER with Co-prediction and Adaptive Pre-training

    Authors: Xiaojing Du, Hanjie Zhao, Danyan Xing, Yuxiang Jia, Hongying Zan

    Abstract: In medical information extraction, medical Named Entity Recognition (NER) is indispensable, playing a crucial role in developing medical knowledge graphs, enhancing medical question-answering systems, and analyzing electronic medical records. The challenge in medical NER arises from the complex nested structures and sophisticated medical terminologies, distinguishing it from its counterparts in tr… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  8. arXiv:2403.12316  [pdf, other

    cs.CL

    OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety

    Authors: Chuang Liu, Linhao Yu, Jiaxuan Li, Renren Jin, Yufei Huang, Ling Shi, Junhui Zhang, Xinmeng Ji, Tingting Cui, Tao Liu, Jinwang Song, Hongying Zan, Sun Li, Deyi Xiong

    Abstract: The rapid development of Chinese large language models (LLMs) poses big challenges for efficient LLM evaluation. While current initiatives have introduced new benchmarks or evaluation platforms for assessing Chinese LLMs, many of these focus primarily on capabilities, usually overlooking potential alignment and safety issues. To address this gap, we introduce OpenEval, an evaluation testbed that b… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  9. arXiv:2311.15509  [pdf, other

    cs.CL

    A Corpus for Named Entity Recognition in Chinese Novels with Multi-genres

    Authors: Hanjie Zhao, Jinge Xie, Yuchen Yan, Yuxiang Jia, Yawen Ye, Hongying Zan

    Abstract: Entities like person, location, organization are important for literary text analysis. The lack of annotated data hinders the progress of named entity recognition (NER) in literary domain. To promote the research of literary NER, we build the largest multi-genre literary NER corpus containing 263,135 entities in 105,851 sentences from 260 online Chinese novels spanning 13 different genres. Based o… ▽ More

    Submitted 15 October, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

  10. arXiv:2308.03549  [pdf, other

    cs.CL

    Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn Dialogue

    Authors: Songhua Yang, Hanjie Zhao, Senbin Zhu, Guangyu Zhou, Hongfei Xu, Yuxiang Jia, Hongying Zan

    Abstract: Recent advances in Large Language Models (LLMs) have achieved remarkable breakthroughs in understanding and responding to user intents. However, their performance lag behind general use cases in some expertise domains, such as Chinese medicine. Existing efforts to incorporate Chinese medicine into LLMs rely on Supervised Fine-Tuning (SFT) with single-turn and distilled dialogue data. These models… ▽ More

    Submitted 28 December, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  11. Optimizing Deep Transformers for Chinese-Thai Low-Resource Translation

    Authors: Wenjie Hao, Hongfei Xu, Lingling Mu, Hongying Zan

    Abstract: In this paper, we study the use of deep Transformer translation model for the CCMT 2022 Chinese-Thai low-resource machine translation task. We first explore the experiment settings (including the number of BPE merge operations, dropout probability, embedding size, etc.) for the low-resource scenario with the 6-layer Transformer. Considering that increasing the number of layers also increases the r… ▽ More

    Submitted 24 December, 2022; originally announced December 2022.

    Journal ref: In CCMT 2022. Communications in Computer and Information Science, vol 1671. Springer, Singapore (2022)

  12. arXiv:2211.03462  [pdf, other

    cs.CL

    NAPG: Non-Autoregressive Program Generation for Hybrid Tabular-Textual Question Answering

    Authors: Tengxun Zhang, Hongfei Xu, Josef van Genabith, Deyi Xiong, Hongying Zan

    Abstract: Hybrid tabular-textual question answering (QA) requires reasoning from heterogeneous information, and the types of reasoning are mainly divided into numerical reasoning and span extraction. Current numerical reasoning methods autoregressively decode program sequences, and each decoding step produces either an operator or an operand. However, the step-by-step decoding suffers from exposure bias, an… ▽ More

    Submitted 13 October, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

  13. arXiv:2106.08087  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark

    Authors: Ningyu Zhang, Mosha Chen, Zhen Bi, Xiaozhuan Liang, Lei Li, Xin Shang, Kangping Yin, Chuanqi Tan, Jian Xu, Fei Huang, Luo Si, Yuan Ni, Guotong Xie, Zhifang Sui, Baobao Chang, Hui Zong, Zheng Yuan, Linfeng Li, Jun Yan, Hongying Zan, Kunli Zhang, Buzhou Tang, Qingcai Chen

    Abstract: Artificial Intelligence (AI), along with the recent progress in biomedical language understanding, is gradually changing medical practice. With the development of biomedical language understanding benchmarks, AI applications are widely used in the medical field. However, most benchmarks are limited to English, which makes it challenging to replicate many of the successes in English for other langu… ▽ More

    Submitted 7 March, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: Accepted by ACL 2022