-
Resource-Friendly Dynamic Enhancement Chain for Multi-Hop Question Answering
Authors:
Binquan Ji,
Haibo Luo,
Yifei Lu,
Lei Hei,
Jiaqi Wang,
Tingjing Liao,
Lingyu Wang,
Shichao Wang,
Feiliang Ren
Abstract:
Knowledge-intensive multi-hop question answering (QA) tasks, which require integrating evidence from multiple sources to address complex queries, often necessitate multiple rounds of retrieval and iterative generation by large language models (LLMs). However, incorporating many documents and extended contexts poses challenges -such as hallucinations and semantic drift-for lightweight LLMs with few…
▽ More
Knowledge-intensive multi-hop question answering (QA) tasks, which require integrating evidence from multiple sources to address complex queries, often necessitate multiple rounds of retrieval and iterative generation by large language models (LLMs). However, incorporating many documents and extended contexts poses challenges -such as hallucinations and semantic drift-for lightweight LLMs with fewer parameters. This work proposes a novel framework called DEC (Dynamic Enhancement Chain). DEC first decomposes complex questions into logically coherent subquestions to form a hallucination-free reasoning chain. It then iteratively refines these subquestions through context-aware rewriting to generate effective query formulations. For retrieval, we introduce a lightweight discriminative keyword extraction module that leverages extracted keywords to achieve targeted, precise document recall with relatively low computational overhead. Extensive experiments on three multi-hop QA datasets demonstrate that DEC performs on par with or surpasses state-of-the-art benchmarks while significantly reducing token consumption. Notably, our approach attains state-of-the-art results on models with 8B parameters, showcasing its effectiveness in various scenarios, particularly in resource-constrained environments.
△ Less
Submitted 21 June, 2025;
originally announced June 2025.
-
Imaging neutron radiation-induced defects in single-crystal chemical vapor deposition diamond at the atomic level
Authors:
Jialiang Zhang,
Futao Huang,
Shuo Li,
Guojun Yu,
Zifeng Xu,
Lifu Hei,
Fanxiu Lv,
Aidan Horne,
Peng Wang,
Ming Qi
Abstract:
Diamond's exceptional properties make it highly suited for applications in challenging radiation environments. Understanding radiation-induced damage in diamond is crucial for enabling its practical applications and advancing materials science. However, direct imaging of radiation-induced crystal defects at the atomic scale remains rare due to diamond's compact lattice structure. Here, we report t…
▽ More
Diamond's exceptional properties make it highly suited for applications in challenging radiation environments. Understanding radiation-induced damage in diamond is crucial for enabling its practical applications and advancing materials science. However, direct imaging of radiation-induced crystal defects at the atomic scale remains rare due to diamond's compact lattice structure. Here, we report the atomic-level characterization of crystal defects induced by high-flux fast neutron radiation (up to $3 \times10^{17}$ n/$cm^2$) in single-crystal chemical vapor deposition diamonds. Through Raman spectroscopy, the phase transition from carbon $sp^3$ to $sp^2$ hybridization was identified, primarily associated with the formation of dumbbell-shaped interstitial defects. Using electron energy loss spectroscopy and aberration-corrected transmission electron microscopy, we observed a clustering trend in defect distribution, where $sp^2$ rich clusters manifested as dislocation structures with a density up to $10^{14}$ $cm^{-2}$. Lomer-Cottrell junctions were identified, offering a possible explanation for defect cluster formation. Radiation-induced point defects were found to be dispersed throughout the diamond lattice, highlighting the widespread nature of primary defect formation. Vacancy defects, along with $\langle 111 \rangle$ and $\langle 100 \rangle$ oriented dumbbell-shaped interstitial defects induced by high-dose neutron irradiation, were directly imaged, providing microscopic structural evidence that complements spectroscopic studies of point defects. Dynamical simulations combined with an adiabatic recombination-based damage model provided insights into the correlation between irradiation dose and resulting crystal damage. These findings advance our understanding of neutron-induced damage mechanisms in diamond and contribute to the development of radiation-resistant diamond materials.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
Multimodal Relational Triple Extraction with Query-based Entity Object Transformer
Authors:
Lei Hei,
Ning An,
Tingjing Liao,
Qi Ma,
Jiaqi Wang,
Feiliang Ren
Abstract:
Multimodal Relation Extraction is crucial for constructing flexible and realistic knowledge graphs. Recent studies focus on extracting the relation type with entity pairs present in different modalities, such as one entity in the text and another in the image. However, existing approaches require entities and objects given beforehand, which is costly and impractical. To address the limitation, we…
▽ More
Multimodal Relation Extraction is crucial for constructing flexible and realistic knowledge graphs. Recent studies focus on extracting the relation type with entity pairs present in different modalities, such as one entity in the text and another in the image. However, existing approaches require entities and objects given beforehand, which is costly and impractical. To address the limitation, we propose a novel task, Multimodal Entity-Object Relational Triple Extraction, which aims to extract all triples (entity span, relation, object region) from image-text pairs. To facilitate this study, we modified a multimodal relation extraction dataset MORE, which includes 21 relation types, to create a new dataset containing 20,264 triples, averaging 5.75 triples per image-text pair. Moreover, we propose QEOT, a query-based model with a selective attention mechanism, to dynamically explore the interaction and fusion of textual and visual information. In particular, the proposed method can simultaneously accomplish entity extraction, relation classification, and object detection with a set of queries. Our method is suitable for downstream applications and reduces error accumulation due to the pipeline-style approaches. Extensive experimental results demonstrate that our proposed method outperforms the existing baselines by 8.06% and achieves state-of-the-art performance.
△ Less
Submitted 16 August, 2024;
originally announced August 2024.
-
RTF: Region-based Table Filling Method for Relational Triple Extraction
Authors:
Ning An,
Lei Hei,
Yong Jiang,
Weiping Meng,
Jingjing Hu,
Boran Huang,
Feiliang Ren
Abstract:
Relational triple extraction is crucial work for the automatic construction of knowledge graphs. Existing methods only construct shallow representations from a token or token pair-level. However, previous works ignore local spatial dependencies of relational triples, resulting in a weakness of entity pair boundary detection. To tackle this problem, we propose a novel Region-based Table Filling met…
▽ More
Relational triple extraction is crucial work for the automatic construction of knowledge graphs. Existing methods only construct shallow representations from a token or token pair-level. However, previous works ignore local spatial dependencies of relational triples, resulting in a weakness of entity pair boundary detection. To tackle this problem, we propose a novel Region-based Table Filling method (RTF). We devise a novel region-based tagging scheme and bi-directional decoding strategy, which regard each relational triple as a region on the relation-specific table, and identifies triples by determining two endpoints of each region. We also introduce convolution to construct region-level table representations from a spatial perspective which makes triples easier to be captured. In addition, we share partial tagging scores among different relations to improve learning efficiency of relation classifier. Experimental results show that our method achieves state-of-the-art with better generalization capability on three variants of two widely used benchmark datasets.
△ Less
Submitted 13 June, 2024; v1 submitted 29 April, 2024;
originally announced April 2024.
-
TechGPT-2.0: A large language model project to solve the task of knowledge graph construction
Authors:
Jiaqi Wang,
Yuying Chang,
Zhong Li,
Ning An,
Qi Ma,
Lei Hei,
Haibo Luo,
Yifei Lu,
Feiliang Ren
Abstract:
Large language models have exhibited robust performance across diverse natural language processing tasks. This report introduces TechGPT-2.0, a project designed to enhance the capabilities of large language models specifically in knowledge graph construction tasks, including named entity recognition (NER) and relationship triple extraction (RTE) tasks in NLP applications. Additionally, it serves a…
▽ More
Large language models have exhibited robust performance across diverse natural language processing tasks. This report introduces TechGPT-2.0, a project designed to enhance the capabilities of large language models specifically in knowledge graph construction tasks, including named entity recognition (NER) and relationship triple extraction (RTE) tasks in NLP applications. Additionally, it serves as a LLM accessible for research within the Chinese open-source model community. We offer two 7B large language model weights and a QLoRA weight specialized for processing lengthy texts.Notably, TechGPT-2.0 is trained on Huawei's Ascend server. Inheriting all functionalities from TechGPT-1.0, it exhibits robust text processing capabilities, particularly in the domains of medicine and law. Furthermore, we introduce new capabilities to the model, enabling it to process texts in various domains such as geographical areas, transportation, organizations, literary works, biology, natural sciences, astronomical objects, and architecture. These enhancements also fortified the model's adeptness in handling hallucinations, unanswerable queries, and lengthy texts. This report provides a comprehensive and detailed introduction to the full fine-tuning process on Huawei's Ascend servers, encompassing experiences in Ascend server debugging, instruction fine-tuning data processing, and model training. Our code is available at https://github.com/neukg/TechGPT-2.0
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
Analytical Solutions to the Navier-Stokes-Poisson Equations with Density-dependent Viscosity and with Pressure
Authors:
Yeung Ling Hei,
Yuen Manwai
Abstract:
We study some particular solutions to the Navier-Stokes-Poisson equations with density-dependent viscosity and with pressure, in radial symmetry. With extension of the previous known blowup solutions for the Euler-Poisson equations / pressureless Navier-Stokes-Poisson with density-dependent viscosity, we constructed the corresponding analytical blowup solutions for the Navier-Stokes-Poisson Equati…
▽ More
We study some particular solutions to the Navier-Stokes-Poisson equations with density-dependent viscosity and with pressure, in radial symmetry. With extension of the previous known blowup solutions for the Euler-Poisson equations / pressureless Navier-Stokes-Poisson with density-dependent viscosity, we constructed the corresponding analytical blowup solutions for the Navier-Stokes-Poisson Equations with density-dependent viscosity and with pressure.
△ Less
Submitted 5 May, 2010;
originally announced May 2010.