-
Data and Knowledge Co-driving for Cancer Subtype Classification on Multi-Scale Histopathological Slides
Authors:
Bo Yu,
Hechang Chen,
Yunke Zhang,
Lele Cong,
Shuchao Pang,
Hongren Zhou,
Ziye Wang,
Xianling Cong
Abstract:
Artificial intelligence-enabled histopathological data analysis has become a valuable assistant to the pathologist. However, existing models lack representation and inference abilities compared with those of pathologists, especially in cancer subtype diagnosis, which is unconvincing in clinical practice. For instance, pathologists typically observe the lesions of a slide from global to local, and…
▽ More
Artificial intelligence-enabled histopathological data analysis has become a valuable assistant to the pathologist. However, existing models lack representation and inference abilities compared with those of pathologists, especially in cancer subtype diagnosis, which is unconvincing in clinical practice. For instance, pathologists typically observe the lesions of a slide from global to local, and then can give a diagnosis based on their knowledge and experience. In this paper, we propose a Data and Knowledge Co-driving (D&K) model to replicate the process of cancer subtype classification on a histopathological slide like a pathologist. Specifically, in the data-driven module, the bagging mechanism in ensemble learning is leveraged to integrate the histological features from various bags extracted by the embedding representation unit. Furthermore, a knowledge-driven module is established based on the Gestalt principle in psychology to build the three-dimensional (3D) expert knowledge space and map histological features into this space for metric. Then, the diagnosis can be made according to the Euclidean distance between them. Extensive experimental results on both public and in-house datasets demonstrate that the D&K model has a high performance and credible results compared with the state-of-the-art methods for diagnosing histopathological subtypes. Code: https://github.com/Dennis-YB/Data-and-Knowledge-Co-driving-for-Cancer-Subtypes-Classification
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
Tool Learning with Foundation Models
Authors:
Yujia Qin,
Shengding Hu,
Yankai Lin,
Weize Chen,
Ning Ding,
Ganqu Cui,
Zheni Zeng,
Yufei Huang,
Chaojun Xiao,
Chi Han,
Yi Ren Fung,
Yusheng Su,
Huadong Wang,
Cheng Qian,
Runchu Tian,
Kunlun Zhu,
Shihao Liang,
Xingyu Shen,
Bokai Xu,
Zhen Zhang,
Yining Ye,
Bowen Li,
Ziwei Tang,
Jing Yi,
Yuzhang Zhu
, et al. (16 additional authors not shown)
Abstract:
Humans possess an extraordinary ability to create and utilize tools, allowing them to overcome physical limitations and explore new frontiers. With the advent of foundation models, AI systems have the potential to be equally adept in tool use as humans. This paradigm, i.e., tool learning with foundation models, combines the strengths of specialized tools and foundation models to achieve enhanced a…
▽ More
Humans possess an extraordinary ability to create and utilize tools, allowing them to overcome physical limitations and explore new frontiers. With the advent of foundation models, AI systems have the potential to be equally adept in tool use as humans. This paradigm, i.e., tool learning with foundation models, combines the strengths of specialized tools and foundation models to achieve enhanced accuracy, efficiency, and automation in problem-solving. Despite its immense potential, there is still a lack of a comprehensive understanding of key challenges, opportunities, and future endeavors in this field. To this end, we present a systematic investigation of tool learning in this paper. We first introduce the background of tool learning, including its cognitive origins, the paradigm shift of foundation models, and the complementary roles of tools and models. Then we recapitulate existing tool learning research into tool-augmented and tool-oriented learning. We formulate a general tool learning framework: starting from understanding the user instruction, models should learn to decompose a complex task into several subtasks, dynamically adjust their plan through reasoning, and effectively conquer each sub-task by selecting appropriate tools. We also discuss how to train models for improved tool-use capabilities and facilitate the generalization in tool learning. Considering the lack of a systematic tool learning evaluation in prior works, we experiment with 18 representative tools and show the potential of current foundation models in skillfully utilizing tools. Finally, we discuss several open problems that require further investigation for tool learning. In general, we hope this paper could inspire future research in integrating tools with foundation models.
△ Less
Submitted 6 August, 2024; v1 submitted 17 April, 2023;
originally announced April 2023.
-
Contrastive Cross-Domain Sequential Recommendation
Authors:
Jiangxia Cao,
Xin Cong,
Jiawei Sheng,
Tingwen Liu,
Bin Wang
Abstract:
Cross-Domain Sequential Recommendation (CDSR) aims to predict future interactions based on user's historical sequential interactions from multiple domains. Generally, a key challenge of CDSR is how to mine precise cross-domain user preference based on the intra-sequence and inter-sequence item interactions. Existing works first learn single-domain user preference only with intra-sequence item inte…
▽ More
Cross-Domain Sequential Recommendation (CDSR) aims to predict future interactions based on user's historical sequential interactions from multiple domains. Generally, a key challenge of CDSR is how to mine precise cross-domain user preference based on the intra-sequence and inter-sequence item interactions. Existing works first learn single-domain user preference only with intra-sequence item interactions, and then build a transferring module to obtain cross-domain user preference. However, such a pipeline and implicit solution can be severely limited by the bottleneck of the designed transferring module, and ignores to consider inter-sequence item relationships. In this paper, we propose C^2DSR to tackle the above problems to capture precise user preferences. The main idea is to simultaneously leverage the intra- and inter- sequence item relationships, and jointly learn the single- and cross- domain user preferences. Specifically, we first utilize a graph neural network to mine inter-sequence item collaborative relationship, and then exploit sequential attentive encoder to capture intra-sequence item sequential relationship. Based on them, we devise two different sequential training objectives to obtain user single-domain and cross-domain representations. Furthermore, we present a novel contrastive cross-domain infomax objective to enhance the correlation between single- and cross- domain user representations by maximizing their mutual information. To validate the effectiveness of C^2DSR, we first re-split four e-comerce datasets, and then conduct extensive experiments to demonstrate the effectiveness of our approach C^2DSR.
△ Less
Submitted 7 April, 2023;
originally announced April 2023.
-
Enhancing Multimodal Entity and Relation Extraction with Variational Information Bottleneck
Authors:
Shiyao Cui,
Jiangxia Cao,
Xin Cong,
Jiawei Sheng,
Quangang Li,
Tingwen Liu,
Jinqiao Shi
Abstract:
This paper studies the multimodal named entity recognition (MNER) and multimodal relation extraction (MRE), which are important for multimedia social platform analysis. The core of MNER and MRE lies in incorporating evident visual information to enhance textual semantics, where two issues inherently demand investigations. The first issue is modality-noise, where the task-irrelevant information in…
▽ More
This paper studies the multimodal named entity recognition (MNER) and multimodal relation extraction (MRE), which are important for multimedia social platform analysis. The core of MNER and MRE lies in incorporating evident visual information to enhance textual semantics, where two issues inherently demand investigations. The first issue is modality-noise, where the task-irrelevant information in each modality may be noises misleading the task prediction. The second issue is modality-gap, where representations from different modalities are inconsistent, preventing from building the semantic alignment between the text and image. To address these issues, we propose a novel method for MNER and MRE by Multi-Modal representation learning with Information Bottleneck (MMIB). For the first issue, a refinement-regularizer probes the information-bottleneck principle to balance the predictive evidence and noisy information, yielding expressive representations for prediction. For the second issue, an alignment-regularizer is proposed, where a mutual information-based item works in a contrastive manner to regularize the consistent text-image representations. To our best knowledge, we are the first to explore variational IB estimation for MNER and MRE. Experiments show that MMIB achieves the state-of-the-art performances on three public benchmarks.
△ Less
Submitted 5 April, 2023;
originally announced April 2023.
-
Fooling the Image Dehazing Models by First Order Gradient
Authors:
Jie Gui,
Xiaofeng Cong,
Chengwei Peng,
Yuan Yan Tang,
James Tin-Yau Kwok
Abstract:
The research on the single image dehazing task has been widely explored. However, as far as we know, no comprehensive study has been conducted on the robustness of the well-trained dehazing models. Therefore, there is no evidence that the dehazing networks can resist malicious attacks. In this paper, we focus on designing a group of attack methods based on first order gradient to verify the robust…
▽ More
The research on the single image dehazing task has been widely explored. However, as far as we know, no comprehensive study has been conducted on the robustness of the well-trained dehazing models. Therefore, there is no evidence that the dehazing networks can resist malicious attacks. In this paper, we focus on designing a group of attack methods based on first order gradient to verify the robustness of the existing dehazing algorithms. By analyzing the general purpose of image dehazing task, four attack methods are proposed, which are predicted dehazed image attack, hazy layer mask attack, haze-free image attack and haze-preserved attack. The corresponding experiments are conducted on six datasets with different scales. Further, the defense strategy based on adversarial training is adopted for reducing the negative effects caused by malicious attacks. In summary, this paper defines a new challenging problem for the image dehazing area, which can be called as adversarial attack on dehazing networks (AADN). Code and Supplementary Material are available at https://github.com/Xiaofeng-life/AADN Dehazing.
△ Less
Submitted 15 February, 2024; v1 submitted 30 March, 2023;
originally announced March 2023.
-
Event Causality Extraction with Event Argument Correlations
Authors:
Shiyao Cui,
Jiawei Sheng,
Xin Cong,
QuanGang Li,
Tingwen Liu,
Jinqiao Shi
Abstract:
Event Causality Identification (ECI), which aims to detect whether a causality relation exists between two given textual events, is an important task for event causality understanding. However, the ECI task ignores crucial event structure and cause-effect causality component information, making it struggle for downstream applications. In this paper, we explore a novel task, namely Event Causality…
▽ More
Event Causality Identification (ECI), which aims to detect whether a causality relation exists between two given textual events, is an important task for event causality understanding. However, the ECI task ignores crucial event structure and cause-effect causality component information, making it struggle for downstream applications. In this paper, we explore a novel task, namely Event Causality Extraction (ECE), aiming to extract the cause-effect event causality pairs with their structured event information from plain texts. The ECE task is more challenging since each event can contain multiple event arguments, posing fine-grained correlations between events to decide the causeeffect event pair. Hence, we propose a method with a dual grid tagging scheme to capture the intra- and inter-event argument correlations for ECE. Further, we devise a event type-enhanced model architecture to realize the dual grid tagging scheme. Experiments demonstrate the effectiveness of our method, and extensive analyses point out several future directions for ECE.
△ Less
Submitted 27 January, 2023;
originally announced January 2023.
-
Cross-Domain Recommendation to Cold-Start Users via Variational Information Bottleneck
Authors:
Jiangxia Cao,
Jiawei Sheng,
Xin Cong,
Tingwen Liu,
Bin Wang
Abstract:
Recommender systems have been widely deployed in many real-world applications, but usually suffer from the long-standing user cold-start problem. As a promising way, Cross-Domain Recommendation (CDR) has attracted a surge of interest, which aims to transfer the user preferences observed in the source domain to make recommendations in the target domain. Previous CDR approaches mostly achieve the go…
▽ More
Recommender systems have been widely deployed in many real-world applications, but usually suffer from the long-standing user cold-start problem. As a promising way, Cross-Domain Recommendation (CDR) has attracted a surge of interest, which aims to transfer the user preferences observed in the source domain to make recommendations in the target domain. Previous CDR approaches mostly achieve the goal by following the Embedding and Mapping (EMCDR) idea which attempts to learn a mapping function to transfer the pre-trained user representations (embeddings) from the source domain into the target domain. However, they pre-train the user/item representations independently for each domain, ignoring to consider both domain interactions simultaneously. Therefore, the biased pre-trained representations inevitably involve the domain-specific information which may lead to negative impact to transfer information across domains. In this work, we consider a key point of the CDR task: what information needs to be shared across domains? To achieve the above idea, this paper utilizes the information bottleneck (IB) principle, and proposes a novel approach termed as CDRIB to enforce the representations encoding the domain-shared information. To derive the unbiased representations, we devise two IB regularizers to model the cross-domain/in-domain user-item interactions simultaneously and thereby CDRIB could consider both domain interactions jointly for de-biasing.
△ Less
Submitted 31 March, 2022;
originally announced March 2022.
-
Document-Level Event Extraction via Human-Like Reading Process
Authors:
Shiyao Cui,
Xin Cong,
Bowen Yu,
Tingwen Liu,
Yucheng Wang,
Jinqiao Shi
Abstract:
Document-level Event Extraction (DEE) is particularly tricky due to the two challenges it poses: scattering-arguments and multi-events. The first challenge means that arguments of one event record could reside in different sentences in the document, while the second one reflects one document may simultaneously contain multiple such event records. Motivated by humans' reading cognitive to extract i…
▽ More
Document-level Event Extraction (DEE) is particularly tricky due to the two challenges it poses: scattering-arguments and multi-events. The first challenge means that arguments of one event record could reside in different sentences in the document, while the second one reflects one document may simultaneously contain multiple such event records. Motivated by humans' reading cognitive to extract information of interests, in this paper, we propose a method called HRE (Human Reading inspired Extractor for Document Events), where DEE is decomposed into these two iterative stages, rough reading and elaborate reading. Specifically, the first stage browses the document to detect the occurrence of events, and the second stage serves to extract specific event arguments. For each concrete event role, elaborate reading hierarchically works from sentences to characters to locate arguments across sentences, thus the scattering-arguments problem is tackled. Meanwhile, rough reading is explored in a multi-round manner to discover undetected events, thus the multi-events problem is handled. Experiment results show the superiority of HRE over prior competitive methods.
△ Less
Submitted 7 February, 2022;
originally announced February 2022.
-
Enhanced Language Representation with Label Knowledge for Span Extraction
Authors:
Pan Yang,
Xin Cong,
Zhenyun Sun,
Xingwu Liu
Abstract:
Span extraction, aiming to extract text spans (such as words or phrases) from plain texts, is a fundamental process in Information Extraction. Recent works introduce the label knowledge to enhance the text representation by formalizing the span extraction task into a question answering problem (QA Formalization), which achieves state-of-the-art performance. However, QA Formalization does not fully…
▽ More
Span extraction, aiming to extract text spans (such as words or phrases) from plain texts, is a fundamental process in Information Extraction. Recent works introduce the label knowledge to enhance the text representation by formalizing the span extraction task into a question answering problem (QA Formalization), which achieves state-of-the-art performance. However, QA Formalization does not fully exploit the label knowledge and suffers from low efficiency in training/inference. To address those problems, we introduce a new paradigm to integrate label knowledge and further propose a novel model to explicitly and efficiently integrate label knowledge into text representations. Specifically, it encodes texts and label annotations independently and then integrates label knowledge into text representation with an elaborate-designed semantics fusion module. We conduct extensive experiments on three typical span extraction tasks: flat NER, nested NER, and event detection. The empirical results show that 1) our method achieves state-of-the-art performance on four benchmarks, and 2) reduces training time and inference time by 76% and 77% on average, respectively, compared with the QA Formalization paradigm. Our code and data are available at https://github.com/Akeepers/LEAR.
△ Less
Submitted 1 November, 2021;
originally announced November 2021.
-
Deep Structural Point Process for Learning Temporal Interaction Networks
Authors:
Jiangxia Cao,
Xixun Lin,
Xin Cong,
Shu Guo,
Hengzhu Tang,
Tingwen Liu,
Bin Wang
Abstract:
This work investigates the problem of learning temporal interaction networks. A temporal interaction network consists of a series of chronological interactions between users and items. Previous methods tackle this problem by using different variants of recurrent neural networks to model sequential interactions, which fail to consider the structural information of temporal interaction networks and…
▽ More
This work investigates the problem of learning temporal interaction networks. A temporal interaction network consists of a series of chronological interactions between users and items. Previous methods tackle this problem by using different variants of recurrent neural networks to model sequential interactions, which fail to consider the structural information of temporal interaction networks and inevitably lead to sub-optimal results. To this end, we propose a novel Deep Structural Point Process termed as DSPP for learning temporal interaction networks. DSPP simultaneously incorporates the topological structure and long-range dependency structure into our intensity function to enhance model expressiveness. To be specific, by using the topological structure as a strong prior, we first design a topological fusion encoder to obtain node embeddings. An attentive shift encoder is then developed to learn the long-range dependency structure between users and items in continuous time. The proposed two modules enable our model to capture the user-item correlation and dynamic influence in temporal interaction networks. DSPP is evaluated on three real-world datasets for both tasks of item prediction and time prediction. Extensive experiments demonstrate that our model achieves consistent and significant improvements over state-of-the-art baselines.
△ Less
Submitted 7 July, 2021;
originally announced July 2021.
-
A Comprehensive Survey and Taxonomy on Single Image Dehazing Based on Deep Learning
Authors:
Jie Gui,
Xiaofeng Cong,
Yuan Cao,
Wenqi Ren,
Jun Zhang,
Jing Zhang,
Jiuxin Cao,
Dacheng Tao
Abstract:
With the development of convolutional neural networks, hundreds of deep learning based dehazing methods have been proposed. In this paper, we provide a comprehensive survey on supervised, semi-supervised, and unsupervised single image dehazing. We first discuss the physical model, datasets, network modules, loss functions, and evaluation metrics that are commonly used. Then, the main contributions…
▽ More
With the development of convolutional neural networks, hundreds of deep learning based dehazing methods have been proposed. In this paper, we provide a comprehensive survey on supervised, semi-supervised, and unsupervised single image dehazing. We first discuss the physical model, datasets, network modules, loss functions, and evaluation metrics that are commonly used. Then, the main contributions of various dehazing algorithms are categorized and summarized. Further, quantitative and qualitative experiments of various baseline methods are carried out. Finally, the unsolved issues and challenges that can inspire the future research are pointed out. A collection of useful dehazing materials is available at \url{https://github.com/Xiaofeng-life/AwesomeDehazing}.
△ Less
Submitted 20 December, 2022; v1 submitted 6 June, 2021;
originally announced June 2021.
-
Phonon-related monochromatic THz radiation and its magneto-modulation in 2D ferromagnetic Cr2Ge2Te6
Authors:
Long Cheng,
Huiping Li,
Gaoting Lin,
Jian Yan,
Lei Zhang,
Cheng Yang,
Wei Tong,
Zhuang Ren,
Wang Zhu,
Xin Cong,
Jingjing Gao,
Pingheng Tan,
Xuan Luo,
Yuping sun,
Wenguang Zhu,
Zhigao Sheng
Abstract:
Searching multiple types of terahertz (THz) irradiation source is crucial for the THz technology. Here, by utilizing a two-dimensional (2D) ferromagnetic Cr2Ge2Te6 crystal, we firstly demonstrate a magneto-tunable monochromatic THz irradiation source. With a low-photonic-energy broadband THz pump, a strong THz irradiation with frequency ~0.9 THz and bandwidth ~0.25 THz can be generated and its con…
▽ More
Searching multiple types of terahertz (THz) irradiation source is crucial for the THz technology. Here, by utilizing a two-dimensional (2D) ferromagnetic Cr2Ge2Te6 crystal, we firstly demonstrate a magneto-tunable monochromatic THz irradiation source. With a low-photonic-energy broadband THz pump, a strong THz irradiation with frequency ~0.9 THz and bandwidth ~0.25 THz can be generated and its conversion efficiency could even reach 2.1% at 160 K. Moreover, it is intriguing to find that such monochromatic THz irradiation can be efficiently modulated by the magnetic field below 160 K. According to both experimental and theoretical analyses, the emergent THz irradiation is identified as the emission from the phonon-polariton and its temperature and magnetic field dependent behaviors confirmed the large spin-lattice coupling in this 2D ferromagnetic crystal. These observations provide a new route for the creation of tunable monochromatic THz source which may have great practical interests in future applications in photonic and spintronic devices.
△ Less
Submitted 12 May, 2021; v1 submitted 7 April, 2021;
originally announced April 2021.
-
Few-Shot Event Detection with Prototypical Amortized Conditional Random Field
Authors:
Xin Cong,
Shiyao Cui,
Bowen Yu,
Tingwen Liu,
Yubin Wang,
Bin Wang
Abstract:
Event detection tends to struggle when it needs to recognize novel event types with a few samples. The previous work attempts to solve this problem in the identify-then-classify manner but ignores the trigger discrepancy between event types, thus suffering from the error propagation. In this paper, we present a novel unified model which converts the task to a few-shot tagging problem with a double…
▽ More
Event detection tends to struggle when it needs to recognize novel event types with a few samples. The previous work attempts to solve this problem in the identify-then-classify manner but ignores the trigger discrepancy between event types, thus suffering from the error propagation. In this paper, we present a novel unified model which converts the task to a few-shot tagging problem with a double-part tagging scheme. To this end, we first propose the Prototypical Amortized Conditional Random Field (PA-CRF) to model the label dependency in the few-shot scenario, which approximates the transition scores between labels based on the label prototypes. Then Gaussian distribution is introduced for modeling of the transition scores to alleviate the uncertain estimation resulting from insufficient data. Experimental results show that the unified models work better than existing identify-then-classify models and our PA-CRF further achieves the best results on the benchmark dataset FewEvent. Our code and data are available at http://github.com/congxin95/PA-CRF.
△ Less
Submitted 24 May, 2021; v1 submitted 3 December, 2020;
originally announced December 2020.
-
Label Enhanced Event Detection with Heterogeneous Graph Attention Networks
Authors:
Shiyao Cui,
Bowen Yu,
Xin Cong,
Tingwen Liu,
Quangang Li,
Jinqiao Shi
Abstract:
Event Detection (ED) aims to recognize instances of specified types of event triggers in text. Different from English ED, Chinese ED suffers from the problem of word-trigger mismatch due to the uncertain word boundaries. Existing approaches injecting word information into character-level models have achieved promising progress to alleviate this problem, but they are limited by two issues. First, t…
▽ More
Event Detection (ED) aims to recognize instances of specified types of event triggers in text. Different from English ED, Chinese ED suffers from the problem of word-trigger mismatch due to the uncertain word boundaries. Existing approaches injecting word information into character-level models have achieved promising progress to alleviate this problem, but they are limited by two issues. First, the interaction between characters and lexicon words is not fully exploited. Second, they ignore the semantic information provided by event labels. We thus propose a novel architecture named Label enhanced Heterogeneous Graph Attention Networks (L-HGAT). Specifically, we transform each sentence into a graph, where character nodes and word nodes are connected with different types of edges, so that the interaction between words and characters is fully reserved. A heterogeneous graph attention networks is then introduced to propagate relational message and enrich information interaction. Furthermore, we convert each label into a trigger-prototype-based embedding, and design a margin loss to guide the model distinguish confusing event labels. Experiments on two benchmark datasets show that our model achieves significant improvement over a range of competitive baseline methods.
△ Less
Submitted 3 December, 2020;
originally announced December 2020.
-
Towards Chirality Control of Graphene Nanoribbons Embedded in Hexagonal Boron Nitride
Authors:
Hui Shan Wang,
Lingxiu Chen,
Kenan Elibol,
Li He,
Haomin Wang,
Chen Chen,
Chengxin Jiang,
Chen Li,
Tianru Wu,
Chun Xiao Cong,
Timothy J. Pennycook,
Giacomo Argentero,
Daoli Zhang,
Kenji Watanabe,
Takashi Taniguchi,
Wenya Wei,
Qinghong Yuan,
Jannik C. Meyer,
Xiaoming Xie
Abstract:
The integrated inplane growth of two dimensional materials with similar lattices, but distinct electrical properties, could provide a promising route to achieve integrated circuitry of atomic thickness. However, fabrication of edge specific GNR in the lattice of hBN still remains an enormous challenge for present approaches. Here we developed a two step growth method and successfully achieved sub…
▽ More
The integrated inplane growth of two dimensional materials with similar lattices, but distinct electrical properties, could provide a promising route to achieve integrated circuitry of atomic thickness. However, fabrication of edge specific GNR in the lattice of hBN still remains an enormous challenge for present approaches. Here we developed a two step growth method and successfully achieved sub 5 nm wide zigzag and armchair GNRs embedded in hBN, respectively. Further transport measurements reveal that the sub 7 nm wide zigzag GNRs exhibit openings of the band gap inversely proportional to their width, while narrow armchair GNRs exhibit some fluctuation in the bandgap width relationship.This integrated lateral growth of edge specific GNRs in hBN brings semiconducting building blocks to atomically thin layer, and will provide a promising route to achieve intricate nanoscale electrical circuits on high quality insulating hBN substrates.
△ Less
Submitted 27 September, 2020;
originally announced September 2020.
-
AIM 2020 Challenge on Real Image Super-Resolution: Methods and Results
Authors:
Pengxu Wei,
Hannan Lu,
Radu Timofte,
Liang Lin,
Wangmeng Zuo,
Zhihong Pan,
Baopu Li,
Teng Xi,
Yanwen Fan,
Gang Zhang,
Jingtuo Liu,
Junyu Han,
Errui Ding,
Tangxin Xie,
Liang Cao,
Yan Zou,
Yi Shen,
Jialiang Zhang,
Yu Jia,
Kaihua Cheng,
Chenhuan Wu,
Yue Lin,
Cen Liu,
Yunbo Peng,
Xueyi Zou
, et al. (51 additional authors not shown)
Abstract:
This paper introduces the real image Super-Resolution (SR) challenge that was part of the Advances in Image Manipulation (AIM) workshop, held in conjunction with ECCV 2020. This challenge involves three tracks to super-resolve an input image for $\times$2, $\times$3 and $\times$4 scaling factors, respectively. The goal is to attract more attention to realistic image degradation for the SR task, wh…
▽ More
This paper introduces the real image Super-Resolution (SR) challenge that was part of the Advances in Image Manipulation (AIM) workshop, held in conjunction with ECCV 2020. This challenge involves three tracks to super-resolve an input image for $\times$2, $\times$3 and $\times$4 scaling factors, respectively. The goal is to attract more attention to realistic image degradation for the SR task, which is much more complicated and challenging, and contributes to real-world image super-resolution applications. 452 participants were registered for three tracks in total, and 24 teams submitted their results. They gauge the state-of-the-art approaches for real image SR in terms of PSNR and SSIM.
△ Less
Submitted 25 September, 2020;
originally announced September 2020.
-
Electronic Raman Scattering in Suspended Semiconducting Carbon Nanotubes
Authors:
Yuecong Hu,
Shaochuang Chen,
Daqi Zhang,
Xin Cong,
Sida Sun,
Jiangbin Wu,
Feng Yang,
Juan Yang,
Ping-heng Tan,
Yan Li
Abstract:
The electronic Raman scattering (ERS) features of single-walled carbon nanotubes (SWNTs) can reveal a wealth of information about their electronic structures, but have previously been thought to appear exclusively in metallic (M-) but not in semiconducting (S-) SWNTs. We report the experimental observation of the ERS features with an accuracy of 1 meV in suspended S-SWNTs, the processes of which a…
▽ More
The electronic Raman scattering (ERS) features of single-walled carbon nanotubes (SWNTs) can reveal a wealth of information about their electronic structures, but have previously been thought to appear exclusively in metallic (M-) but not in semiconducting (S-) SWNTs. We report the experimental observation of the ERS features with an accuracy of 1 meV in suspended S-SWNTs, the processes of which are accomplished via the available high-energy electron-hole pairs. The ERS features can facilitate further systematic studies on the properties of SWNT, both metallic and semiconducting, with defined chirality.
△ Less
Submitted 3 September, 2020; v1 submitted 1 September, 2020;
originally announced September 2020.
-
Understanding angle-resolved polarized Raman scattering from black phosphorus at normal and oblique laser incidences
Authors:
Miao-Ling Lin,
Yu-Chen Leng,
Xin Cong,
Da Meng,
Jiahong Wang,
Xiao-Li Li,
Binlu Yu,
Xue-Lu Liu,
Xue-Feng Yu,
Ping-Heng Tan
Abstract:
The selection rule for angle-resolved polarized Raman (ARPR) intensity of phonons from standard group-theoretical method in isotropic materials would break down in anisotropic layered materials (ALMs) due to birefringence and linear dichroism effects. The two effects result in depth-dependent polarization and intensity of incident laser and scattered signal inside ALMs and thus make a challenge to…
▽ More
The selection rule for angle-resolved polarized Raman (ARPR) intensity of phonons from standard group-theoretical method in isotropic materials would break down in anisotropic layered materials (ALMs) due to birefringence and linear dichroism effects. The two effects result in depth-dependent polarization and intensity of incident laser and scattered signal inside ALMs and thus make a challenge to predict ARPR intensity at any laser incidence direction. Herein, taking in-plane anisotropic black phosphorus as a prototype, we developed a so-called birefringence-linear-dichroism (BLD) model to quantitatively understand its ARPR intensity at both normal and oblique laser incidences by the same set of real Raman tensors for certain laser excitation. No fitting parameter is needed, once the birefringence and linear dichroism effects are considered with the complex refractive indexes. An approach was proposed to experimentally determine real Raman tensor and complex refractive indexes, respectively, from the relative Raman intensity along its principle axes and incident-angle resolved reflectivity by Fresnel$'$s law. The results suggest that the previously reported ARPR intensity of ultrathin ALM flakes deposited on a multilayered substrate at normal laser incidence can be also understood based on the BLD model by considering the depth-dependent polarization and intensity of incident laser and scattered Raman signal induced by both birefringence and linear dichroism effects within ALM flakes and the interference effects in the multilayered structures, which are dependent on the excitation wavelength, thickness of ALM flakes and dielectric layers of the substrate. This work can be generally applicable to any opaque anisotropic crystals, offering a promising route to predict and manipulate the polarized behaviors of related phonons.
△ Less
Submitted 22 August, 2020; v1 submitted 9 August, 2020;
originally announced August 2020.
-
Inductive Unsupervised Domain Adaptation for Few-Shot Classification via Clustering
Authors:
Xin Cong,
Bowen Yu,
Tingwen Liu,
Shiyao Cui,
Hengzhu Tang,
Bin Wang
Abstract:
Few-shot classification tends to struggle when it needs to adapt to diverse domains. Due to the non-overlapping label space between domains, the performance of conventional domain adaptation is limited. Previous work tackles the problem in a transductive manner, by assuming access to the full set of test data, which is too restrictive for many real-world applications. In this paper, we set out to…
▽ More
Few-shot classification tends to struggle when it needs to adapt to diverse domains. Due to the non-overlapping label space between domains, the performance of conventional domain adaptation is limited. Previous work tackles the problem in a transductive manner, by assuming access to the full set of test data, which is too restrictive for many real-world applications. In this paper, we set out to tackle this issue by introducing a inductive framework, DaFeC, to improve Domain adaptation performance for Few-shot classification via Clustering. We first build a representation extractor to derive features for unlabeled data from the target domain (no test data is necessary) and then group them with a cluster miner. The generated pseudo-labeled data and the labeled source-domain data are used as supervision to update the parameters of the few-shot classifier. In order to derive high-quality pseudo labels, we propose a Clustering Promotion Mechanism, to learn better features for the target domain via Similarity Entropy Minimization and Adversarial Distribution Alignment, which are combined with a Cosine Annealing Strategy. Experiments are performed on the FewRel 2.0 dataset. Our approach outperforms previous work with absolute gains (in classification accuracy) of 4.95%, 9.55%, 3.99% and 11.62%, respectively, under four few-shot settings.
△ Less
Submitted 23 June, 2020;
originally announced June 2020.
-
Multivariate Log-Contrast Regression with Sub-Compositional Predictors: Testing the Association Between Preterm Infants' Gut Microbiome and Neurobehavioral Outcomes
Authors:
Xiaokang Liu,
Xiaomei Cong,
Gen Li,
Kendra Maas,
Kun Chen
Abstract:
The so-called gut-brain axis has stimulated extensive research on microbiomes. One focus is to assess the association between certain clinical outcomes and the relative abundances of gut microbes, which can be presented as sub-compositional data in conformity with the taxonomic hierarchy of bacteria. Motivated by a study for identifying the microbes in the gut microbiome of preterm infants that im…
▽ More
The so-called gut-brain axis has stimulated extensive research on microbiomes. One focus is to assess the association between certain clinical outcomes and the relative abundances of gut microbes, which can be presented as sub-compositional data in conformity with the taxonomic hierarchy of bacteria. Motivated by a study for identifying the microbes in the gut microbiome of preterm infants that impact their later neurobehavioral outcomes, we formulate a constrained integrative multi-view regression, where the neurobehavioral scores form multivariate response, the sub-compositional microbiome data form multi-view feature matrices, and a set of linear constraints on their corresponding sub-coefficient matrices ensures the conformity to the simplex geometry. To enable joint selection and inference of sub-compositions/views, we assume all the sub-coefficient matrices are possibly of low-rank, i.e., the outcomes are associated with the microbiome through different sets of latent sub-compositional factors from different taxa. We propose a scaled composite nuclear norm penalization approach for model estimation and develop a hypothesis testing procedure through de-biasing to assess the significance of different views. Simulation studies confirm the effectiveness of the proposed procedure. In the preterm infant study, the identified microbes are mostly consistent with existing studies and biological understandings. Our approach supports that stressful early life experiences imprint gut microbiome through the regulation of the gut-brain axis.
△ Less
Submitted 31 May, 2020;
originally announced June 2020.
-
Edge-Epitaxial Growth of InSe Nanowires toward High-Performance Photodetectors
Authors:
Song Hao,
Shengnan Yan,
Yang Wang,
Tao Xu,
Hui Zhang,
Xin Cong,
Lingfei Li,
Xiaowei Liu,
Tianjun Cao,
Anyuan Gao,
Lili Zhang,
Lanxin Jia,
Mingsheng Long,
Weida Hu,
Xiaomu Wang,
Pingheng Tan,
Litao Sun,
Xinyi Cui,
Shi-Jun Liang,
Feng Miao
Abstract:
Semiconducting nanowires offer many opportunities for electronic and optoelectronic device applications due to their special geometries and unique physical properties. However, it has been challenging to synthesize semiconducting nanowires directly on SiO2/Si substrate due to lattice mismatch. Here, we developed a catalysis-free approach to achieve direct synthesis of long and straight InSe nanowi…
▽ More
Semiconducting nanowires offer many opportunities for electronic and optoelectronic device applications due to their special geometries and unique physical properties. However, it has been challenging to synthesize semiconducting nanowires directly on SiO2/Si substrate due to lattice mismatch. Here, we developed a catalysis-free approach to achieve direct synthesis of long and straight InSe nanowires on SiO2/Si substrate through edge-homoepitaxial growth. We further achieved parallel InSe nanowires on SiO2/Si substrate through controlling growth conditions. We attributed the underlying growth mechanism to selenium self-driven vapor-liquid-solid process, which is distinct from conventional metal-catalytic vapor-liquid-solid method widely used for growing Si and III-V nanowires. Furthermore, we demonstrated that the as-grown InSe nanowire-based visible light photodetector simultaneously possesses an extraordinary photoresponsivity of 271 A/W, ultrahigh detectivity of 1.57*10^14 Jones and a fast response speed of microsecond scale. The excellent performance of the photodetector indicates that as-grown InSe nanowires are promising in future optoelectronic applications. More importantly, the proposed edge-homoepitaxial approach may open up a novel avenue for direct synthesis of semiconducting nanowire arrays on SiO2/Si substrate.
△ Less
Submitted 28 December, 2019;
originally announced December 2019.
-
Linear Dichroism Conversion in Quasi One-Dimensional Perovskite Chalcogenide
Authors:
Jiangbin Wu,
Xin Cong,
Shanyuan Niu,
Fanxin Liu,
Huan Zhao,
Zhonghao Du,
Jayakanth Ravichandran,
Ping-Heng Tan,
Han Wang
Abstract:
Anisotropic photonic materials with linear dichroism are crucial components in many sensing, imaging and communication applications. Such materials play an important role as polarizers, filters and wave-plates in photonic devices and circuits. Conventional crystalline materials with optical anisotropy typically show unidirectional linear dichroism over a broad wavelength range. The linear dichrois…
▽ More
Anisotropic photonic materials with linear dichroism are crucial components in many sensing, imaging and communication applications. Such materials play an important role as polarizers, filters and wave-plates in photonic devices and circuits. Conventional crystalline materials with optical anisotropy typically show unidirectional linear dichroism over a broad wavelength range. The linear dichroism conversion phenomenon has not been observed in crystalline materials. Here, we report the investigation of the unique linear dichroism conversion phenomenon in quasi-one-dimensional (quasi-1D) hexagonal perovskite chalcogenide BaTiS3. The material shows record level of optical anisotropy within the visible wavelength range. In contrast to conventional anisotropic optical materials, the linear dichroism polarity in BaTiS3 makes an orthogonal change at an optical wavelength corresponding to the photon energy of 1.78 eV. First principle calculations reveal that this anomalous linear dichroism conversion behavior originates from different selection rules of the optical transitions from the parallel bands in the BaTiS3 material. Wavelength dependent polarized Raman spectroscopy further confirms this phenomenon. Such material with linear dichroism conversion property can facilitate new ability to control and sense the energy and polarization of light, and lead to novel photonic devices such as polarization-wavelength selective detectors and lasers for multispectral imaging, sensing and optical communication applications.
△ Less
Submitted 27 May, 2019; v1 submitted 23 May, 2019;
originally announced May 2019.
-
Log-Contrast Regression with Functional Compositional Predictors: Linking Preterm Infant's Gut Microbiome Trajectories to Neurobehavioral Outcome
Authors:
Zhe Sun,
Wanli Xu,
Xiaomei Cong,
Gen Li,
Kun Chen
Abstract:
The neonatal intensive care unit (NICU) experience is known to be one of the most crucial factors that drive preterm infant's neurodevelopmental and health outcomes. It is hypothesized that stressful early life experience of very preterm neonate is imprinting gut microbiome by the regulation of the so-called brain-gut axis, and consequently, certain microbiome markers are predictive of later infan…
▽ More
The neonatal intensive care unit (NICU) experience is known to be one of the most crucial factors that drive preterm infant's neurodevelopmental and health outcomes. It is hypothesized that stressful early life experience of very preterm neonate is imprinting gut microbiome by the regulation of the so-called brain-gut axis, and consequently, certain microbiome markers are predictive of later infant neurodevelopment. To investigate, a preterm infant study was conducted; infant fecal samples were collected during the infants' first month of postnatal age, resulting in functional compositional microbiome data, and neurobehavioral outcomes were measured when infants reached 36-38 weeks of post-menstrual age. To identify potential microbiome markers and estimate how the trajectories of gut microbiome compositions during early postnatal stage impact later neurobehavioral outcomes of the preterm infants, we innovate a sparse log-contrast regression with functional compositional predictors. The functional simplex structure is strictly preserved, and the functional compositional predictors are allowed to have sparse, smoothly varying, and accumulating effects on the outcome through time. Through a pragmatic basis expansion step, the problem boils down to a linearly constrained sparse group regression, for which we develop an efficient algorithm and obtain theoretical performance guarantees. Our approach yields insightful results in the preterm infant study. The identified microbiome markers and the estimated time dynamics of their impact on the neurobehavioral outcome shed light on the linkage between stress accumulation in early postnatal stage and neurodevelopmental process of infants.
△ Less
Submitted 8 October, 2019; v1 submitted 7 August, 2018;
originally announced August 2018.
-
Low-Temperature Eutectic Synthesis of PtTe2 with Weak Antilocalization and Controlled Layer Thinning
Authors:
Song Hao,
Junwen Zeng,
Tao Xu,
Xin Cong,
Chenyu Wang,
Chenchen Wu,
Yaojia Wang,
Xiaowei Liu,
Tianjun Cao,
Guangxu Su,
Lanxin Jia,
Zhangting Wu,
Qian Lin,
Lili Zhang,
Shengnan Yan,
Mengfan Guo,
Zhenlin Wang,
Pingheng Tan,
Litao Sun,
Zhenhua Ni,
Shi-Jun Liang,
Xinyi Cui,
Feng Miao
Abstract:
Metallic transition metal dichalcogenides (TMDs) have exhibited various exotic physical properties and hold the promise of novel optoelectronic and topological devices applications. However, the synthesis of metallic TMDs is based on gas-phase methods and requires high temperature condition. As an alternative to the gas-phase synthetic approach, lower temperature eutectic liquid-phase synthesis pr…
▽ More
Metallic transition metal dichalcogenides (TMDs) have exhibited various exotic physical properties and hold the promise of novel optoelectronic and topological devices applications. However, the synthesis of metallic TMDs is based on gas-phase methods and requires high temperature condition. As an alternative to the gas-phase synthetic approach, lower temperature eutectic liquid-phase synthesis presents a very promising approach with the potential for larger-scale and controllable growth of high-quality thin metallic TMDs single crystals. Herein, we report the first realization of low-temperature eutectic liquid-phase synthesis of type-II Dirac semimetal PtTe2 single crystals with thickness ranging from 2 to 200 nm. The electrical measurement of synthesized PtTe2 reveals a record-high conductivity of as high as 3.3*106 S/m at room temperature. Besides, we experimentally identify the weak antilocalization behavior in the type-II Dirac semimetal PtTe2 for the first time. Furthermore, we develop a simple and general strategy to obtain atomically-thin PtTe2 crystal by thinning as-synthesized bulk samples, which can still retain highly crystalline and exhibits excellent electric conductivity. Our results of controllable and scalable low-temperature eutectic liquid-phase synthesis and layer-by-layer thinning of high-quality thin PtTe2 single crystals offer a simple and general approach for obtaining different thickness metallic TMDs with high-melting point transition metal.
△ Less
Submitted 7 August, 2018;
originally announced August 2018.
-
Direct evidences of pentagonal silicon chains and magic clusters
Authors:
Shaoxiang Sheng,
Runze Ma,
Jiangbin Wu,
Wenbin Li,
Longjuan Kong,
Xin Cong,
Duanyun Cao,
Wenqi Hu,
Jun-Wei Luo,
Peng Cheng,
Ping-Heng Tan,
Ying Jiang,
Lan Chen,
Kehui Wu
Abstract:
Pentagon is one of the most beautiful geometric structures in nature, but it is rarely seen simply because five-fold symmetry is mathematically forbidden in a 2D or 3D periodic lattices. Fortunately, pentagon as a structural element is allowed in 1D or 0D systems, since translational symmetry is not necessary there. However, in these systems pentagons only compose a small portion of the structure.…
▽ More
Pentagon is one of the most beautiful geometric structures in nature, but it is rarely seen simply because five-fold symmetry is mathematically forbidden in a 2D or 3D periodic lattices. Fortunately, pentagon as a structural element is allowed in 1D or 0D systems, since translational symmetry is not necessary there. However, in these systems pentagons only compose a small portion of the structure. So far, 1D or 0D systems consisting of purely pentagons are still rare. Here, combing high resolution non-contact atomic force microscopy and tip-enhanced Raman spectroscopy, we have directly visualized the pentagon-ring structure in self-assembled Si nanoribbons and magic clusters on Ag(110) substrate. Moreover, chemical fingerprint of Si pentagon was detected in individual Si nanoribbon and clusters by tip-enhanced Raman spectroscopy. This work demonstrates that Si pentagon can be an important element in building silicon nanostructures, which may find potential applications in future nanoelectronics based on silicon.
△ Less
Submitted 25 November, 2017;
originally announced November 2017.
-
Fabrication of graphene nanodisk arrays using nanosphere lithography
Authors:
C. X. Cong,
T. Yu,
Z. H. Ni,
L. Liu,
Z. X. Shen,
W. Huang
Abstract:
Ordered graphene nanodisk arrays have been successfully fabricated by combining nanosphere lithography (NSL) and reactive ion etching (RIE) processes. The dimension of graphene nanodisks can be effectively tuned by varying the size of polystyrene spheres, which function as masks during RIE. Low voltage scanning electron microscopy shows that the graphene sheet could be readily patterned into per…
▽ More
Ordered graphene nanodisk arrays have been successfully fabricated by combining nanosphere lithography (NSL) and reactive ion etching (RIE) processes. The dimension of graphene nanodisks can be effectively tuned by varying the size of polystyrene spheres, which function as masks during RIE. Low voltage scanning electron microscopy shows that the graphene sheet could be readily patterned into periodic disk-like nanostructures by oxygen RIE. Raman mapping and spectroscopy further visualize such nanodisk arrays and reveal the nature of disks are crystalline single layer graphene. This work demonstrates an efficient and manageable way to pattern graphene. Considering the periodicity, nanometer dimension and large edge to body ratio, the graphene nanodisk arrays, such two dimensional assembly of carbon atoms offer intrisic advantages in various electronic and spintronic fabrications.
△ Less
Submitted 30 December, 2008;
originally announced December 2008.