Search | arXiv e-print repository

doi 10.1016/j.knosys.2022.110168

Data and Knowledge Co-driving for Cancer Subtype Classification on Multi-Scale Histopathological Slides

Authors: Bo Yu, Hechang Chen, Yunke Zhang, Lele Cong, Shuchao Pang, Hongren Zhou, Ziye Wang, Xianling Cong

Abstract: Artificial intelligence-enabled histopathological data analysis has become a valuable assistant to the pathologist. However, existing models lack representation and inference abilities compared with those of pathologists, especially in cancer subtype diagnosis, which is unconvincing in clinical practice. For instance, pathologists typically observe the lesions of a slide from global to local, and… ▽ More Artificial intelligence-enabled histopathological data analysis has become a valuable assistant to the pathologist. However, existing models lack representation and inference abilities compared with those of pathologists, especially in cancer subtype diagnosis, which is unconvincing in clinical practice. For instance, pathologists typically observe the lesions of a slide from global to local, and then can give a diagnosis based on their knowledge and experience. In this paper, we propose a Data and Knowledge Co-driving (D&K) model to replicate the process of cancer subtype classification on a histopathological slide like a pathologist. Specifically, in the data-driven module, the bagging mechanism in ensemble learning is leveraged to integrate the histological features from various bags extracted by the embedding representation unit. Furthermore, a knowledge-driven module is established based on the Gestalt principle in psychology to build the three-dimensional (3D) expert knowledge space and map histological features into this space for metric. Then, the diagnosis can be made according to the Euclidean distance between them. Extensive experimental results on both public and in-house datasets demonstrate that the D&K model has a high performance and credible results compared with the state-of-the-art methods for diagnosing histopathological subtypes. Code: https://github.com/Dennis-YB/Data-and-Knowledge-Co-driving-for-Cancer-Subtypes-Classification △ Less

Submitted 18 April, 2023; originally announced April 2023.

Journal ref: [J]. Knowledge-Based Systems, 2023, 260: 110168

arXiv:2304.08354 [pdf, other]

Tool Learning with Foundation Models

Authors: Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, Jing Yi, Yuzhang Zhu , et al. (16 additional authors not shown)

Abstract: Humans possess an extraordinary ability to create and utilize tools, allowing them to overcome physical limitations and explore new frontiers. With the advent of foundation models, AI systems have the potential to be equally adept in tool use as humans. This paradigm, i.e., tool learning with foundation models, combines the strengths of specialized tools and foundation models to achieve enhanced a… ▽ More Humans possess an extraordinary ability to create and utilize tools, allowing them to overcome physical limitations and explore new frontiers. With the advent of foundation models, AI systems have the potential to be equally adept in tool use as humans. This paradigm, i.e., tool learning with foundation models, combines the strengths of specialized tools and foundation models to achieve enhanced accuracy, efficiency, and automation in problem-solving. Despite its immense potential, there is still a lack of a comprehensive understanding of key challenges, opportunities, and future endeavors in this field. To this end, we present a systematic investigation of tool learning in this paper. We first introduce the background of tool learning, including its cognitive origins, the paradigm shift of foundation models, and the complementary roles of tools and models. Then we recapitulate existing tool learning research into tool-augmented and tool-oriented learning. We formulate a general tool learning framework: starting from understanding the user instruction, models should learn to decompose a complex task into several subtasks, dynamically adjust their plan through reasoning, and effectively conquer each sub-task by selecting appropriate tools. We also discuss how to train models for improved tool-use capabilities and facilitate the generalization in tool learning. Considering the lack of a systematic tool learning evaluation in prior works, we experiment with 18 representative tools and show the potential of current foundation models in skillfully utilizing tools. Finally, we discuss several open problems that require further investigation for tool learning. In general, we hope this paper could inspire future research in integrating tools with foundation models. △ Less

Submitted 6 August, 2024; v1 submitted 17 April, 2023; originally announced April 2023.

arXiv:2304.03891 [pdf, other]

doi 10.1145/3511808.3557262

Contrastive Cross-Domain Sequential Recommendation

Authors: Jiangxia Cao, Xin Cong, Jiawei Sheng, Tingwen Liu, Bin Wang

Abstract: Cross-Domain Sequential Recommendation (CDSR) aims to predict future interactions based on user's historical sequential interactions from multiple domains. Generally, a key challenge of CDSR is how to mine precise cross-domain user preference based on the intra-sequence and inter-sequence item interactions. Existing works first learn single-domain user preference only with intra-sequence item inte… ▽ More Cross-Domain Sequential Recommendation (CDSR) aims to predict future interactions based on user's historical sequential interactions from multiple domains. Generally, a key challenge of CDSR is how to mine precise cross-domain user preference based on the intra-sequence and inter-sequence item interactions. Existing works first learn single-domain user preference only with intra-sequence item interactions, and then build a transferring module to obtain cross-domain user preference. However, such a pipeline and implicit solution can be severely limited by the bottleneck of the designed transferring module, and ignores to consider inter-sequence item relationships. In this paper, we propose C^2DSR to tackle the above problems to capture precise user preferences. The main idea is to simultaneously leverage the intra- and inter- sequence item relationships, and jointly learn the single- and cross- domain user preferences. Specifically, we first utilize a graph neural network to mine inter-sequence item collaborative relationship, and then exploit sequential attentive encoder to capture intra-sequence item sequential relationship. Based on them, we devise two different sequential training objectives to obtain user single-domain and cross-domain representations. Furthermore, we present a novel contrastive cross-domain infomax objective to enhance the correlation between single- and cross- domain user representations by maximizing their mutual information. To validate the effectiveness of C^2DSR, we first re-split four e-comerce datasets, and then conduct extensive experiments to demonstrate the effectiveness of our approach C^2DSR. △ Less

Submitted 7 April, 2023; originally announced April 2023.

Comments: This paper has been accepted by CIKM 2022

arXiv:2304.02328 [pdf, other]

doi 10.1109/TASLP.2023.3345146

Enhancing Multimodal Entity and Relation Extraction with Variational Information Bottleneck

Authors: Shiyao Cui, Jiangxia Cao, Xin Cong, Jiawei Sheng, Quangang Li, Tingwen Liu, Jinqiao Shi

Abstract: This paper studies the multimodal named entity recognition (MNER) and multimodal relation extraction (MRE), which are important for multimedia social platform analysis. The core of MNER and MRE lies in incorporating evident visual information to enhance textual semantics, where two issues inherently demand investigations. The first issue is modality-noise, where the task-irrelevant information in… ▽ More This paper studies the multimodal named entity recognition (MNER) and multimodal relation extraction (MRE), which are important for multimedia social platform analysis. The core of MNER and MRE lies in incorporating evident visual information to enhance textual semantics, where two issues inherently demand investigations. The first issue is modality-noise, where the task-irrelevant information in each modality may be noises misleading the task prediction. The second issue is modality-gap, where representations from different modalities are inconsistent, preventing from building the semantic alignment between the text and image. To address these issues, we propose a novel method for MNER and MRE by Multi-Modal representation learning with Information Bottleneck (MMIB). For the first issue, a refinement-regularizer probes the information-bottleneck principle to balance the predictive evidence and noisy information, yielding expressive representations for prediction. For the second issue, an alignment-regularizer is proposed, where a mutual information-based item works in a contrastive manner to regularize the consistent text-image representations. To our best knowledge, we are the first to explore variational IB estimation for MNER and MRE. Experiments show that MMIB achieves the state-of-the-art performances on three public benchmarks. △ Less

Submitted 5 April, 2023; originally announced April 2023.

Journal ref: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2024

arXiv:2303.17255 [pdf, other]

Fooling the Image Dehazing Models by First Order Gradient

Authors: Jie Gui, Xiaofeng Cong, Chengwei Peng, Yuan Yan Tang, James Tin-Yau Kwok

Abstract: The research on the single image dehazing task has been widely explored. However, as far as we know, no comprehensive study has been conducted on the robustness of the well-trained dehazing models. Therefore, there is no evidence that the dehazing networks can resist malicious attacks. In this paper, we focus on designing a group of attack methods based on first order gradient to verify the robust… ▽ More The research on the single image dehazing task has been widely explored. However, as far as we know, no comprehensive study has been conducted on the robustness of the well-trained dehazing models. Therefore, there is no evidence that the dehazing networks can resist malicious attacks. In this paper, we focus on designing a group of attack methods based on first order gradient to verify the robustness of the existing dehazing algorithms. By analyzing the general purpose of image dehazing task, four attack methods are proposed, which are predicted dehazed image attack, hazy layer mask attack, haze-free image attack and haze-preserved attack. The corresponding experiments are conducted on six datasets with different scales. Further, the defense strategy based on adversarial training is adopted for reducing the negative effects caused by malicious attacks. In summary, this paper defines a new challenging problem for the image dehazing area, which can be called as adversarial attack on dehazing networks (AADN). Code and Supplementary Material are available at https://github.com/Xiaofeng-life/AADN Dehazing. △ Less

Submitted 15 February, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

Comments: This paper is accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

arXiv:2301.11621 [pdf, other]

Event Causality Extraction with Event Argument Correlations

Authors: Shiyao Cui, Jiawei Sheng, Xin Cong, QuanGang Li, Tingwen Liu, Jinqiao Shi

Abstract: Event Causality Identification (ECI), which aims to detect whether a causality relation exists between two given textual events, is an important task for event causality understanding. However, the ECI task ignores crucial event structure and cause-effect causality component information, making it struggle for downstream applications. In this paper, we explore a novel task, namely Event Causality… ▽ More Event Causality Identification (ECI), which aims to detect whether a causality relation exists between two given textual events, is an important task for event causality understanding. However, the ECI task ignores crucial event structure and cause-effect causality component information, making it struggle for downstream applications. In this paper, we explore a novel task, namely Event Causality Extraction (ECE), aiming to extract the cause-effect event causality pairs with their structured event information from plain texts. The ECE task is more challenging since each event can contain multiple event arguments, posing fine-grained correlations between events to decide the causeeffect event pair. Hence, we propose a method with a dual grid tagging scheme to capture the intra- and inter-event argument correlations for ECE. Further, we devise a event type-enhanced model architecture to realize the dual grid tagging scheme. Experiments demonstrate the effectiveness of our method, and extensive analyses point out several future directions for ECE. △ Less

Submitted 27 January, 2023; originally announced January 2023.

Comments: Accepted to COLING2022

arXiv:2203.16863 [pdf, other]

Cross-Domain Recommendation to Cold-Start Users via Variational Information Bottleneck

Authors: Jiangxia Cao, Jiawei Sheng, Xin Cong, Tingwen Liu, Bin Wang

Abstract: Recommender systems have been widely deployed in many real-world applications, but usually suffer from the long-standing user cold-start problem. As a promising way, Cross-Domain Recommendation (CDR) has attracted a surge of interest, which aims to transfer the user preferences observed in the source domain to make recommendations in the target domain. Previous CDR approaches mostly achieve the go… ▽ More Recommender systems have been widely deployed in many real-world applications, but usually suffer from the long-standing user cold-start problem. As a promising way, Cross-Domain Recommendation (CDR) has attracted a surge of interest, which aims to transfer the user preferences observed in the source domain to make recommendations in the target domain. Previous CDR approaches mostly achieve the goal by following the Embedding and Mapping (EMCDR) idea which attempts to learn a mapping function to transfer the pre-trained user representations (embeddings) from the source domain into the target domain. However, they pre-train the user/item representations independently for each domain, ignoring to consider both domain interactions simultaneously. Therefore, the biased pre-trained representations inevitably involve the domain-specific information which may lead to negative impact to transfer information across domains. In this work, we consider a key point of the CDR task: what information needs to be shared across domains? To achieve the above idea, this paper utilizes the information bottleneck (IB) principle, and proposes a novel approach termed as CDRIB to enforce the representations encoding the domain-shared information. To derive the unbiased representations, we devise two IB regularizers to model the cross-domain/in-domain user-item interactions simultaneously and thereby CDRIB could consider both domain interactions jointly for de-biasing. △ Less

Submitted 31 March, 2022; originally announced March 2022.

Comments: This paper has been accepted by ICDE 2022

arXiv:2202.03092 [pdf, other]

Document-Level Event Extraction via Human-Like Reading Process

Authors: Shiyao Cui, Xin Cong, Bowen Yu, Tingwen Liu, Yucheng Wang, Jinqiao Shi

Abstract: Document-level Event Extraction (DEE) is particularly tricky due to the two challenges it poses: scattering-arguments and multi-events. The first challenge means that arguments of one event record could reside in different sentences in the document, while the second one reflects one document may simultaneously contain multiple such event records. Motivated by humans' reading cognitive to extract i… ▽ More Document-level Event Extraction (DEE) is particularly tricky due to the two challenges it poses: scattering-arguments and multi-events. The first challenge means that arguments of one event record could reside in different sentences in the document, while the second one reflects one document may simultaneously contain multiple such event records. Motivated by humans' reading cognitive to extract information of interests, in this paper, we propose a method called HRE (Human Reading inspired Extractor for Document Events), where DEE is decomposed into these two iterative stages, rough reading and elaborate reading. Specifically, the first stage browses the document to detect the occurrence of events, and the second stage serves to extract specific event arguments. For each concrete event role, elaborate reading hierarchically works from sentences to characters to locate arguments across sentences, thus the scattering-arguments problem is tackled. Meanwhile, rough reading is explored in a multi-round manner to discover undetected events, thus the multi-events problem is handled. Experiment results show the superiority of HRE over prior competitive methods. △ Less

Submitted 7 February, 2022; originally announced February 2022.

Comments: To apper in ICASSP2022

arXiv:2111.00884 [pdf, other]

Enhanced Language Representation with Label Knowledge for Span Extraction

Authors: Pan Yang, Xin Cong, Zhenyun Sun, Xingwu Liu

Abstract: Span extraction, aiming to extract text spans (such as words or phrases) from plain texts, is a fundamental process in Information Extraction. Recent works introduce the label knowledge to enhance the text representation by formalizing the span extraction task into a question answering problem (QA Formalization), which achieves state-of-the-art performance. However, QA Formalization does not fully… ▽ More Span extraction, aiming to extract text spans (such as words or phrases) from plain texts, is a fundamental process in Information Extraction. Recent works introduce the label knowledge to enhance the text representation by formalizing the span extraction task into a question answering problem (QA Formalization), which achieves state-of-the-art performance. However, QA Formalization does not fully exploit the label knowledge and suffers from low efficiency in training/inference. To address those problems, we introduce a new paradigm to integrate label knowledge and further propose a novel model to explicitly and efficiently integrate label knowledge into text representations. Specifically, it encodes texts and label annotations independently and then integrates label knowledge into text representation with an elaborate-designed semantics fusion module. We conduct extensive experiments on three typical span extraction tasks: flat NER, nested NER, and event detection. The empirical results show that 1) our method achieves state-of-the-art performance on four benchmarks, and 2) reduces training time and inference time by 76% and 77% on average, respectively, compared with the QA Formalization paradigm. Our code and data are available at https://github.com/Akeepers/LEAR. △ Less

Submitted 1 November, 2021; originally announced November 2021.

Comments: Accepted to the main conference of EMNLP 2021 (long paper)

arXiv:2107.03573 [pdf, other]

Deep Structural Point Process for Learning Temporal Interaction Networks

Authors: Jiangxia Cao, Xixun Lin, Xin Cong, Shu Guo, Hengzhu Tang, Tingwen Liu, Bin Wang

Abstract: This work investigates the problem of learning temporal interaction networks. A temporal interaction network consists of a series of chronological interactions between users and items. Previous methods tackle this problem by using different variants of recurrent neural networks to model sequential interactions, which fail to consider the structural information of temporal interaction networks and… ▽ More This work investigates the problem of learning temporal interaction networks. A temporal interaction network consists of a series of chronological interactions between users and items. Previous methods tackle this problem by using different variants of recurrent neural networks to model sequential interactions, which fail to consider the structural information of temporal interaction networks and inevitably lead to sub-optimal results. To this end, we propose a novel Deep Structural Point Process termed as DSPP for learning temporal interaction networks. DSPP simultaneously incorporates the topological structure and long-range dependency structure into our intensity function to enhance model expressiveness. To be specific, by using the topological structure as a strong prior, we first design a topological fusion encoder to obtain node embeddings. An attentive shift encoder is then developed to learn the long-range dependency structure between users and items in continuous time. The proposed two modules enable our model to capture the user-item correlation and dynamic influence in temporal interaction networks. DSPP is evaluated on three real-world datasets for both tasks of item prediction and time prediction. Extensive experiments demonstrate that our model achieves consistent and significant improvements over state-of-the-art baselines. △ Less

Submitted 7 July, 2021; originally announced July 2021.

Comments: Accepted by ECML/PKDD 2021, 16 pages, 2 figures

arXiv:2106.03323 [pdf, other]

A Comprehensive Survey and Taxonomy on Single Image Dehazing Based on Deep Learning

Authors: Jie Gui, Xiaofeng Cong, Yuan Cao, Wenqi Ren, Jun Zhang, Jing Zhang, Jiuxin Cao, Dacheng Tao

Abstract: With the development of convolutional neural networks, hundreds of deep learning based dehazing methods have been proposed. In this paper, we provide a comprehensive survey on supervised, semi-supervised, and unsupervised single image dehazing. We first discuss the physical model, datasets, network modules, loss functions, and evaluation metrics that are commonly used. Then, the main contributions… ▽ More With the development of convolutional neural networks, hundreds of deep learning based dehazing methods have been proposed. In this paper, we provide a comprehensive survey on supervised, semi-supervised, and unsupervised single image dehazing. We first discuss the physical model, datasets, network modules, loss functions, and evaluation metrics that are commonly used. Then, the main contributions of various dehazing algorithms are categorized and summarized. Further, quantitative and qualitative experiments of various baseline methods are carried out. Finally, the unsolved issues and challenges that can inspire the future research are pointed out. A collection of useful dehazing materials is available at \url{https://github.com/Xiaofeng-life/AwesomeDehazing}. △ Less

Submitted 20 December, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

Comments: This paper is accepted by ACM Computing Surveys

arXiv:2104.03132 [pdf]

Phonon-related monochromatic THz radiation and its magneto-modulation in 2D ferromagnetic Cr2Ge2Te6

Authors: Long Cheng, Huiping Li, Gaoting Lin, Jian Yan, Lei Zhang, Cheng Yang, Wei Tong, Zhuang Ren, Wang Zhu, Xin Cong, Jingjing Gao, Pingheng Tan, Xuan Luo, Yuping sun, Wenguang Zhu, Zhigao Sheng

Abstract: Searching multiple types of terahertz (THz) irradiation source is crucial for the THz technology. Here, by utilizing a two-dimensional (2D) ferromagnetic Cr2Ge2Te6 crystal, we firstly demonstrate a magneto-tunable monochromatic THz irradiation source. With a low-photonic-energy broadband THz pump, a strong THz irradiation with frequency ~0.9 THz and bandwidth ~0.25 THz can be generated and its con… ▽ More Searching multiple types of terahertz (THz) irradiation source is crucial for the THz technology. Here, by utilizing a two-dimensional (2D) ferromagnetic Cr2Ge2Te6 crystal, we firstly demonstrate a magneto-tunable monochromatic THz irradiation source. With a low-photonic-energy broadband THz pump, a strong THz irradiation with frequency ~0.9 THz and bandwidth ~0.25 THz can be generated and its conversion efficiency could even reach 2.1% at 160 K. Moreover, it is intriguing to find that such monochromatic THz irradiation can be efficiently modulated by the magnetic field below 160 K. According to both experimental and theoretical analyses, the emergent THz irradiation is identified as the emission from the phonon-polariton and its temperature and magnetic field dependent behaviors confirmed the large spin-lattice coupling in this 2D ferromagnetic crystal. These observations provide a new route for the creation of tunable monochromatic THz source which may have great practical interests in future applications in photonic and spintronic devices. △ Less

Submitted 12 May, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

Comments: 35 pages,11 figures

MSC Class: 78-05

arXiv:2012.02353 [pdf, other]

Few-Shot Event Detection with Prototypical Amortized Conditional Random Field

Authors: Xin Cong, Shiyao Cui, Bowen Yu, Tingwen Liu, Yubin Wang, Bin Wang

Abstract: Event detection tends to struggle when it needs to recognize novel event types with a few samples. The previous work attempts to solve this problem in the identify-then-classify manner but ignores the trigger discrepancy between event types, thus suffering from the error propagation. In this paper, we present a novel unified model which converts the task to a few-shot tagging problem with a double… ▽ More Event detection tends to struggle when it needs to recognize novel event types with a few samples. The previous work attempts to solve this problem in the identify-then-classify manner but ignores the trigger discrepancy between event types, thus suffering from the error propagation. In this paper, we present a novel unified model which converts the task to a few-shot tagging problem with a double-part tagging scheme. To this end, we first propose the Prototypical Amortized Conditional Random Field (PA-CRF) to model the label dependency in the few-shot scenario, which approximates the transition scores between labels based on the label prototypes. Then Gaussian distribution is introduced for modeling of the transition scores to alleviate the uncertain estimation resulting from insufficient data. Experimental results show that the unified models work better than existing identify-then-classify models and our PA-CRF further achieves the best results on the benchmark dataset FewEvent. Our code and data are available at http://github.com/congxin95/PA-CRF. △ Less

Submitted 24 May, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

Comments: Accepted at ACL 2021

arXiv:2012.01878 [pdf, other]

doi 10.1007/s11390-023-1541-6

Label Enhanced Event Detection with Heterogeneous Graph Attention Networks

Authors: Shiyao Cui, Bowen Yu, Xin Cong, Tingwen Liu, Quangang Li, Jinqiao Shi

Abstract: Event Detection (ED) aims to recognize instances of specified types of event triggers in text. Different from English ED, Chinese ED suffers from the problem of word-trigger mismatch due to the uncertain word boundaries. Existing approaches injecting word information into character-level models have achieved promising progress to alleviate this problem, but they are limited by two issues. First, t… ▽ More Event Detection (ED) aims to recognize instances of specified types of event triggers in text. Different from English ED, Chinese ED suffers from the problem of word-trigger mismatch due to the uncertain word boundaries. Existing approaches injecting word information into character-level models have achieved promising progress to alleviate this problem, but they are limited by two issues. First, the interaction between characters and lexicon words is not fully exploited. Second, they ignore the semantic information provided by event labels. We thus propose a novel architecture named Label enhanced Heterogeneous Graph Attention Networks (L-HGAT). Specifically, we transform each sentence into a graph, where character nodes and word nodes are connected with different types of edges, so that the interaction between words and characters is fully reserved. A heterogeneous graph attention networks is then introduced to propagate relational message and enrich information interaction. Furthermore, we convert each label into a trigger-prototype-based embedding, and design a margin loss to guide the model distinguish confusing event labels. Experiments on two benchmark datasets show that our model achieves significant improvement over a range of competitive baseline methods. △ Less

Submitted 3 December, 2020; originally announced December 2020.

Journal ref: Journal of Computer Science and Technology 2023

arXiv:2009.12822 [pdf]

doi 10.1038/s41563-020-00806-2

Towards Chirality Control of Graphene Nanoribbons Embedded in Hexagonal Boron Nitride

Authors: Hui Shan Wang, Lingxiu Chen, Kenan Elibol, Li He, Haomin Wang, Chen Chen, Chengxin Jiang, Chen Li, Tianru Wu, Chun Xiao Cong, Timothy J. Pennycook, Giacomo Argentero, Daoli Zhang, Kenji Watanabe, Takashi Taniguchi, Wenya Wei, Qinghong Yuan, Jannik C. Meyer, Xiaoming Xie

Abstract: The integrated inplane growth of two dimensional materials with similar lattices, but distinct electrical properties, could provide a promising route to achieve integrated circuitry of atomic thickness. However, fabrication of edge specific GNR in the lattice of hBN still remains an enormous challenge for present approaches. Here we developed a two step growth method and successfully achieved sub… ▽ More The integrated inplane growth of two dimensional materials with similar lattices, but distinct electrical properties, could provide a promising route to achieve integrated circuitry of atomic thickness. However, fabrication of edge specific GNR in the lattice of hBN still remains an enormous challenge for present approaches. Here we developed a two step growth method and successfully achieved sub 5 nm wide zigzag and armchair GNRs embedded in hBN, respectively. Further transport measurements reveal that the sub 7 nm wide zigzag GNRs exhibit openings of the band gap inversely proportional to their width, while narrow armchair GNRs exhibit some fluctuation in the bandgap width relationship.This integrated lateral growth of edge specific GNRs in hBN brings semiconducting building blocks to atomically thin layer, and will provide a promising route to achieve intricate nanoscale electrical circuits on high quality insulating hBN substrates. △ Less

Submitted 27 September, 2020; originally announced September 2020.

Comments: 84 pages,46 figures

Journal ref: Nature Materials, 2020

arXiv:2009.12072 [pdf, other]

AIM 2020 Challenge on Real Image Super-Resolution: Methods and Results

Authors: Pengxu Wei, Hannan Lu, Radu Timofte, Liang Lin, Wangmeng Zuo, Zhihong Pan, Baopu Li, Teng Xi, Yanwen Fan, Gang Zhang, Jingtuo Liu, Junyu Han, Errui Ding, Tangxin Xie, Liang Cao, Yan Zou, Yi Shen, Jialiang Zhang, Yu Jia, Kaihua Cheng, Chenhuan Wu, Yue Lin, Cen Liu, Yunbo Peng, Xueyi Zou , et al. (51 additional authors not shown)

Abstract: This paper introduces the real image Super-Resolution (SR) challenge that was part of the Advances in Image Manipulation (AIM) workshop, held in conjunction with ECCV 2020. This challenge involves three tracks to super-resolve an input image for $\times$2, $\times$3 and $\times$4 scaling factors, respectively. The goal is to attract more attention to realistic image degradation for the SR task, wh… ▽ More This paper introduces the real image Super-Resolution (SR) challenge that was part of the Advances in Image Manipulation (AIM) workshop, held in conjunction with ECCV 2020. This challenge involves three tracks to super-resolve an input image for $\times$2, $\times$3 and $\times$4 scaling factors, respectively. The goal is to attract more attention to realistic image degradation for the SR task, which is much more complicated and challenging, and contributes to real-world image super-resolution applications. 452 participants were registered for three tracks in total, and 24 teams submitted their results. They gauge the state-of-the-art approaches for real image SR in terms of PSNR and SSIM. △ Less

Submitted 25 September, 2020; originally announced September 2020.

Journal ref: European Conference on Computer Vision Workshops, 2020

arXiv:2009.00409 [pdf]

doi 10.1021/acs.jpclett.0c03320

Electronic Raman Scattering in Suspended Semiconducting Carbon Nanotubes

Authors: Yuecong Hu, Shaochuang Chen, Daqi Zhang, Xin Cong, Sida Sun, Jiangbin Wu, Feng Yang, Juan Yang, Ping-heng Tan, Yan Li

Abstract: The electronic Raman scattering (ERS) features of single-walled carbon nanotubes (SWNTs) can reveal a wealth of information about their electronic structures, but have previously been thought to appear exclusively in metallic (M-) but not in semiconducting (S-) SWNTs. We report the experimental observation of the ERS features with an accuracy of 1 meV in suspended S-SWNTs, the processes of which a… ▽ More The electronic Raman scattering (ERS) features of single-walled carbon nanotubes (SWNTs) can reveal a wealth of information about their electronic structures, but have previously been thought to appear exclusively in metallic (M-) but not in semiconducting (S-) SWNTs. We report the experimental observation of the ERS features with an accuracy of 1 meV in suspended S-SWNTs, the processes of which are accomplished via the available high-energy electron-hole pairs. The ERS features can facilitate further systematic studies on the properties of SWNT, both metallic and semiconducting, with defined chirality. △ Less

Submitted 3 September, 2020; v1 submitted 1 September, 2020; originally announced September 2020.

Comments: 12 pages, 4 figures and 1 table

arXiv:2008.03851 [pdf, ps, other]

doi 10.1016/j.scib.2020.08.008

Understanding angle-resolved polarized Raman scattering from black phosphorus at normal and oblique laser incidences

Authors: Miao-Ling Lin, Yu-Chen Leng, Xin Cong, Da Meng, Jiahong Wang, Xiao-Li Li, Binlu Yu, Xue-Lu Liu, Xue-Feng Yu, Ping-Heng Tan

Abstract: The selection rule for angle-resolved polarized Raman (ARPR) intensity of phonons from standard group-theoretical method in isotropic materials would break down in anisotropic layered materials (ALMs) due to birefringence and linear dichroism effects. The two effects result in depth-dependent polarization and intensity of incident laser and scattered signal inside ALMs and thus make a challenge to… ▽ More The selection rule for angle-resolved polarized Raman (ARPR) intensity of phonons from standard group-theoretical method in isotropic materials would break down in anisotropic layered materials (ALMs) due to birefringence and linear dichroism effects. The two effects result in depth-dependent polarization and intensity of incident laser and scattered signal inside ALMs and thus make a challenge to predict ARPR intensity at any laser incidence direction. Herein, taking in-plane anisotropic black phosphorus as a prototype, we developed a so-called birefringence-linear-dichroism (BLD) model to quantitatively understand its ARPR intensity at both normal and oblique laser incidences by the same set of real Raman tensors for certain laser excitation. No fitting parameter is needed, once the birefringence and linear dichroism effects are considered with the complex refractive indexes. An approach was proposed to experimentally determine real Raman tensor and complex refractive indexes, respectively, from the relative Raman intensity along its principle axes and incident-angle resolved reflectivity by Fresnel$'$s law. The results suggest that the previously reported ARPR intensity of ultrathin ALM flakes deposited on a multilayered substrate at normal laser incidence can be also understood based on the BLD model by considering the depth-dependent polarization and intensity of incident laser and scattered Raman signal induced by both birefringence and linear dichroism effects within ALM flakes and the interference effects in the multilayered structures, which are dependent on the excitation wavelength, thickness of ALM flakes and dielectric layers of the substrate. This work can be generally applicable to any opaque anisotropic crystals, offering a promising route to predict and manipulate the polarized behaviors of related phonons. △ Less

Submitted 22 August, 2020; v1 submitted 9 August, 2020; originally announced August 2020.

Comments: Accepted by Science Bulletin

Journal ref: Science Bulletin, 2020, 65(22):1894-1900

arXiv:2006.12816 [pdf, other]

Inductive Unsupervised Domain Adaptation for Few-Shot Classification via Clustering

Authors: Xin Cong, Bowen Yu, Tingwen Liu, Shiyao Cui, Hengzhu Tang, Bin Wang

Abstract: Few-shot classification tends to struggle when it needs to adapt to diverse domains. Due to the non-overlapping label space between domains, the performance of conventional domain adaptation is limited. Previous work tackles the problem in a transductive manner, by assuming access to the full set of test data, which is too restrictive for many real-world applications. In this paper, we set out to… ▽ More Few-shot classification tends to struggle when it needs to adapt to diverse domains. Due to the non-overlapping label space between domains, the performance of conventional domain adaptation is limited. Previous work tackles the problem in a transductive manner, by assuming access to the full set of test data, which is too restrictive for many real-world applications. In this paper, we set out to tackle this issue by introducing a inductive framework, DaFeC, to improve Domain adaptation performance for Few-shot classification via Clustering. We first build a representation extractor to derive features for unlabeled data from the target domain (no test data is necessary) and then group them with a cluster miner. The generated pseudo-labeled data and the labeled source-domain data are used as supervision to update the parameters of the few-shot classifier. In order to derive high-quality pseudo labels, we propose a Clustering Promotion Mechanism, to learn better features for the target domain via Similarity Entropy Minimization and Adversarial Distribution Alignment, which are combined with a Cosine Annealing Strategy. Experiments are performed on the FewRel 2.0 dataset. Our approach outperforms previous work with absolute gains (in classification accuracy) of 4.95%, 9.55%, 3.99% and 11.62%, respectively, under four few-shot settings. △ Less

Submitted 23 June, 2020; originally announced June 2020.

Comments: Accepted by ECML-PKDD 2020

arXiv:2006.00487 [pdf, other]

Multivariate Log-Contrast Regression with Sub-Compositional Predictors: Testing the Association Between Preterm Infants' Gut Microbiome and Neurobehavioral Outcomes

Authors: Xiaokang Liu, Xiaomei Cong, Gen Li, Kendra Maas, Kun Chen

Abstract: The so-called gut-brain axis has stimulated extensive research on microbiomes. One focus is to assess the association between certain clinical outcomes and the relative abundances of gut microbes, which can be presented as sub-compositional data in conformity with the taxonomic hierarchy of bacteria. Motivated by a study for identifying the microbes in the gut microbiome of preterm infants that im… ▽ More The so-called gut-brain axis has stimulated extensive research on microbiomes. One focus is to assess the association between certain clinical outcomes and the relative abundances of gut microbes, which can be presented as sub-compositional data in conformity with the taxonomic hierarchy of bacteria. Motivated by a study for identifying the microbes in the gut microbiome of preterm infants that impact their later neurobehavioral outcomes, we formulate a constrained integrative multi-view regression, where the neurobehavioral scores form multivariate response, the sub-compositional microbiome data form multi-view feature matrices, and a set of linear constraints on their corresponding sub-coefficient matrices ensures the conformity to the simplex geometry. To enable joint selection and inference of sub-compositions/views, we assume all the sub-coefficient matrices are possibly of low-rank, i.e., the outcomes are associated with the microbiome through different sets of latent sub-compositional factors from different taxa. We propose a scaled composite nuclear norm penalization approach for model estimation and develop a hypothesis testing procedure through de-biasing to assess the significance of different views. Simulation studies confirm the effectiveness of the proposed procedure. In the preterm infant study, the identified microbes are mostly consistent with existing studies and biological understandings. Our approach supports that stressful early life experiences imprint gut microbiome through the regulation of the gut-brain axis. △ Less

Submitted 31 May, 2020; originally announced June 2020.

arXiv:1912.12459 [pdf]

doi 10.1002/smll.201905902

Edge-Epitaxial Growth of InSe Nanowires toward High-Performance Photodetectors

Authors: Song Hao, Shengnan Yan, Yang Wang, Tao Xu, Hui Zhang, Xin Cong, Lingfei Li, Xiaowei Liu, Tianjun Cao, Anyuan Gao, Lili Zhang, Lanxin Jia, Mingsheng Long, Weida Hu, Xiaomu Wang, Pingheng Tan, Litao Sun, Xinyi Cui, Shi-Jun Liang, Feng Miao

Abstract: Semiconducting nanowires offer many opportunities for electronic and optoelectronic device applications due to their special geometries and unique physical properties. However, it has been challenging to synthesize semiconducting nanowires directly on SiO2/Si substrate due to lattice mismatch. Here, we developed a catalysis-free approach to achieve direct synthesis of long and straight InSe nanowi… ▽ More Semiconducting nanowires offer many opportunities for electronic and optoelectronic device applications due to their special geometries and unique physical properties. However, it has been challenging to synthesize semiconducting nanowires directly on SiO2/Si substrate due to lattice mismatch. Here, we developed a catalysis-free approach to achieve direct synthesis of long and straight InSe nanowires on SiO2/Si substrate through edge-homoepitaxial growth. We further achieved parallel InSe nanowires on SiO2/Si substrate through controlling growth conditions. We attributed the underlying growth mechanism to selenium self-driven vapor-liquid-solid process, which is distinct from conventional metal-catalytic vapor-liquid-solid method widely used for growing Si and III-V nanowires. Furthermore, we demonstrated that the as-grown InSe nanowire-based visible light photodetector simultaneously possesses an extraordinary photoresponsivity of 271 A/W, ultrahigh detectivity of 1.57*10^14 Jones and a fast response speed of microsecond scale. The excellent performance of the photodetector indicates that as-grown InSe nanowires are promising in future optoelectronic applications. More importantly, the proposed edge-homoepitaxial approach may open up a novel avenue for direct synthesis of semiconducting nanowire arrays on SiO2/Si substrate. △ Less

Submitted 28 December, 2019; originally announced December 2019.

Comments: 19 pages, 4 figures, published in Small

arXiv:1905.09936 [pdf]

Linear Dichroism Conversion in Quasi One-Dimensional Perovskite Chalcogenide

Authors: Jiangbin Wu, Xin Cong, Shanyuan Niu, Fanxin Liu, Huan Zhao, Zhonghao Du, Jayakanth Ravichandran, Ping-Heng Tan, Han Wang

Abstract: Anisotropic photonic materials with linear dichroism are crucial components in many sensing, imaging and communication applications. Such materials play an important role as polarizers, filters and wave-plates in photonic devices and circuits. Conventional crystalline materials with optical anisotropy typically show unidirectional linear dichroism over a broad wavelength range. The linear dichrois… ▽ More Anisotropic photonic materials with linear dichroism are crucial components in many sensing, imaging and communication applications. Such materials play an important role as polarizers, filters and wave-plates in photonic devices and circuits. Conventional crystalline materials with optical anisotropy typically show unidirectional linear dichroism over a broad wavelength range. The linear dichroism conversion phenomenon has not been observed in crystalline materials. Here, we report the investigation of the unique linear dichroism conversion phenomenon in quasi-one-dimensional (quasi-1D) hexagonal perovskite chalcogenide BaTiS3. The material shows record level of optical anisotropy within the visible wavelength range. In contrast to conventional anisotropic optical materials, the linear dichroism polarity in BaTiS3 makes an orthogonal change at an optical wavelength corresponding to the photon energy of 1.78 eV. First principle calculations reveal that this anomalous linear dichroism conversion behavior originates from different selection rules of the optical transitions from the parallel bands in the BaTiS3 material. Wavelength dependent polarized Raman spectroscopy further confirms this phenomenon. Such material with linear dichroism conversion property can facilitate new ability to control and sense the energy and polarization of light, and lead to novel photonic devices such as polarization-wavelength selective detectors and lasers for multispectral imaging, sensing and optical communication applications. △ Less

Submitted 27 May, 2019; v1 submitted 23 May, 2019; originally announced May 2019.

arXiv:1808.02403 [pdf, other]

Log-Contrast Regression with Functional Compositional Predictors: Linking Preterm Infant's Gut Microbiome Trajectories to Neurobehavioral Outcome

Authors: Zhe Sun, Wanli Xu, Xiaomei Cong, Gen Li, Kun Chen

Abstract: The neonatal intensive care unit (NICU) experience is known to be one of the most crucial factors that drive preterm infant's neurodevelopmental and health outcomes. It is hypothesized that stressful early life experience of very preterm neonate is imprinting gut microbiome by the regulation of the so-called brain-gut axis, and consequently, certain microbiome markers are predictive of later infan… ▽ More The neonatal intensive care unit (NICU) experience is known to be one of the most crucial factors that drive preterm infant's neurodevelopmental and health outcomes. It is hypothesized that stressful early life experience of very preterm neonate is imprinting gut microbiome by the regulation of the so-called brain-gut axis, and consequently, certain microbiome markers are predictive of later infant neurodevelopment. To investigate, a preterm infant study was conducted; infant fecal samples were collected during the infants' first month of postnatal age, resulting in functional compositional microbiome data, and neurobehavioral outcomes were measured when infants reached 36-38 weeks of post-menstrual age. To identify potential microbiome markers and estimate how the trajectories of gut microbiome compositions during early postnatal stage impact later neurobehavioral outcomes of the preterm infants, we innovate a sparse log-contrast regression with functional compositional predictors. The functional simplex structure is strictly preserved, and the functional compositional predictors are allowed to have sparse, smoothly varying, and accumulating effects on the outcome through time. Through a pragmatic basis expansion step, the problem boils down to a linearly constrained sparse group regression, for which we develop an efficient algorithm and obtain theoretical performance guarantees. Our approach yields insightful results in the preterm infant study. The identified microbiome markers and the estimated time dynamics of their impact on the neurobehavioral outcome shed light on the linkage between stress accumulation in early postnatal stage and neurodevelopmental process of infants. △ Less

Submitted 8 October, 2019; v1 submitted 7 August, 2018; originally announced August 2018.

MSC Class: 62J02

arXiv:1808.02283 [pdf]

doi 10.1002/adfm.201803746

Low-Temperature Eutectic Synthesis of PtTe2 with Weak Antilocalization and Controlled Layer Thinning

Authors: Song Hao, Junwen Zeng, Tao Xu, Xin Cong, Chenyu Wang, Chenchen Wu, Yaojia Wang, Xiaowei Liu, Tianjun Cao, Guangxu Su, Lanxin Jia, Zhangting Wu, Qian Lin, Lili Zhang, Shengnan Yan, Mengfan Guo, Zhenlin Wang, Pingheng Tan, Litao Sun, Zhenhua Ni, Shi-Jun Liang, Xinyi Cui, Feng Miao

Abstract: Metallic transition metal dichalcogenides (TMDs) have exhibited various exotic physical properties and hold the promise of novel optoelectronic and topological devices applications. However, the synthesis of metallic TMDs is based on gas-phase methods and requires high temperature condition. As an alternative to the gas-phase synthetic approach, lower temperature eutectic liquid-phase synthesis pr… ▽ More Metallic transition metal dichalcogenides (TMDs) have exhibited various exotic physical properties and hold the promise of novel optoelectronic and topological devices applications. However, the synthesis of metallic TMDs is based on gas-phase methods and requires high temperature condition. As an alternative to the gas-phase synthetic approach, lower temperature eutectic liquid-phase synthesis presents a very promising approach with the potential for larger-scale and controllable growth of high-quality thin metallic TMDs single crystals. Herein, we report the first realization of low-temperature eutectic liquid-phase synthesis of type-II Dirac semimetal PtTe2 single crystals with thickness ranging from 2 to 200 nm. The electrical measurement of synthesized PtTe2 reveals a record-high conductivity of as high as 3.3*106 S/m at room temperature. Besides, we experimentally identify the weak antilocalization behavior in the type-II Dirac semimetal PtTe2 for the first time. Furthermore, we develop a simple and general strategy to obtain atomically-thin PtTe2 crystal by thinning as-synthesized bulk samples, which can still retain highly crystalline and exhibits excellent electric conductivity. Our results of controllable and scalable low-temperature eutectic liquid-phase synthesis and layer-by-layer thinning of high-quality thin PtTe2 single crystals offer a simple and general approach for obtaining different thickness metallic TMDs with high-melting point transition metal. △ Less

Submitted 7 August, 2018; originally announced August 2018.

Journal ref: Advanced Functional Materials 2018, 1803746

arXiv:1711.09322 [pdf]

Direct evidences of pentagonal silicon chains and magic clusters

Authors: Shaoxiang Sheng, Runze Ma, Jiangbin Wu, Wenbin Li, Longjuan Kong, Xin Cong, Duanyun Cao, Wenqi Hu, Jun-Wei Luo, Peng Cheng, Ping-Heng Tan, Ying Jiang, Lan Chen, Kehui Wu

Abstract: Pentagon is one of the most beautiful geometric structures in nature, but it is rarely seen simply because five-fold symmetry is mathematically forbidden in a 2D or 3D periodic lattices. Fortunately, pentagon as a structural element is allowed in 1D or 0D systems, since translational symmetry is not necessary there. However, in these systems pentagons only compose a small portion of the structure.… ▽ More Pentagon is one of the most beautiful geometric structures in nature, but it is rarely seen simply because five-fold symmetry is mathematically forbidden in a 2D or 3D periodic lattices. Fortunately, pentagon as a structural element is allowed in 1D or 0D systems, since translational symmetry is not necessary there. However, in these systems pentagons only compose a small portion of the structure. So far, 1D or 0D systems consisting of purely pentagons are still rare. Here, combing high resolution non-contact atomic force microscopy and tip-enhanced Raman spectroscopy, we have directly visualized the pentagon-ring structure in self-assembled Si nanoribbons and magic clusters on Ag(110) substrate. Moreover, chemical fingerprint of Si pentagon was detected in individual Si nanoribbon and clusters by tip-enhanced Raman spectroscopy. This work demonstrates that Si pentagon can be an important element in building silicon nanostructures, which may find potential applications in future nanoelectronics based on silicon. △ Less

Submitted 25 November, 2017; originally announced November 2017.

Comments: 4 figures

arXiv:0812.5014 [pdf]

Fabrication of graphene nanodisk arrays using nanosphere lithography

Authors: C. X. Cong, T. Yu, Z. H. Ni, L. Liu, Z. X. Shen, W. Huang

Abstract: Ordered graphene nanodisk arrays have been successfully fabricated by combining nanosphere lithography (NSL) and reactive ion etching (RIE) processes. The dimension of graphene nanodisks can be effectively tuned by varying the size of polystyrene spheres, which function as masks during RIE. Low voltage scanning electron microscopy shows that the graphene sheet could be readily patterned into per… ▽ More Ordered graphene nanodisk arrays have been successfully fabricated by combining nanosphere lithography (NSL) and reactive ion etching (RIE) processes. The dimension of graphene nanodisks can be effectively tuned by varying the size of polystyrene spheres, which function as masks during RIE. Low voltage scanning electron microscopy shows that the graphene sheet could be readily patterned into periodic disk-like nanostructures by oxygen RIE. Raman mapping and spectroscopy further visualize such nanodisk arrays and reveal the nature of disks are crystalline single layer graphene. This work demonstrates an efficient and manageable way to pattern graphene. Considering the periodicity, nanometer dimension and large edge to body ratio, the graphene nanodisk arrays, such two dimensional assembly of carbon atoms offer intrisic advantages in various electronic and spintronic fabrications. △ Less

Submitted 30 December, 2008; originally announced December 2008.

Comments: 15 pages, 4 figures

Showing 51–76 of 76 results for author: Cong, X