Skip to main content

Showing 201–250 of 692 results for author: Ta, C

.
  1. False: False Negative Samples Aware Contrastive Learning for Semantic Segmentation of High-Resolution Remote Sensing Image

    Authors: Zhaoyang Zhang, Xuying Wang, Xiaoming Mei, Chao Tao, Haifeng Li

    Abstract: The existing SSCL of RSI is built based on constructing positive and negative sample pairs. However, due to the richness of RSI ground objects and the complexity of the RSI contextual semantics, the same RSI patches have the coexistence and imbalance of positive and negative samples, which causing the SSCL pushing negative samples far away while pushing positive samples far away, and vice versa. W… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 5 Pages, 3 Figures, 5 tables

  2. arXiv:2211.06061  [pdf

    physics.optics nlin.PS

    Dichromatic breather molecules in a mode-locked fiber laser

    Authors: Yudong Cui1, Yusheng Zhang, Lin Huang, Aiguo Zhang, Zhiming Liu, Cuifang Kuang, Chenning Tao, Daru Chen, Xu Liu, Boris A. Malomed

    Abstract: Bound states of solitons (molecules) occur in various settings, playing an important role in the operation of fiber lasers, optical emulations, encoding, and communications. Soliton interactions are generally related to breathing dynamics in nonlinear dissipative systems, maintaining potential applications in spectroscopy. In the present work, dichromatic breather molecules (DBMs) are created in a… ▽ More

    Submitted 30 March, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: to be published in Phys. Rev. Lett

  3. arXiv:2211.05987  [pdf, other

    cs.CL

    CCPrefix: Counterfactual Contrastive Prefix-Tuning for Many-Class Classification

    Authors: Yang Li, Canran Xu, Guodong Long, Tao Shen, Chongyang Tao, Jing Jiang

    Abstract: Recently, prefix-tuning was proposed to efficiently adapt pre-trained language models to a broad spectrum of natural language classification tasks. It leverages soft prefix as task-specific indicators and language verbalizers as categorical-label mentions to narrow the formulation gap from pre-training language models. However, when the label space increases considerably (i.e., many-class classifi… ▽ More

    Submitted 12 February, 2024; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: has been accepted by EACL 2024

  4. arXiv:2211.05719  [pdf, other

    cs.CL cs.AI cs.CV cs.LG cs.MM

    MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation

    Authors: Jiazhan Feng, Qingfeng Sun, Can Xu, Pu Zhao, Yaming Yang, Chongyang Tao, Dongyan Zhao, Qingwei Lin

    Abstract: Responding with multi-modal content has been recognized as an essential capability for an intelligent conversational agent. In this paper, we introduce the MMDialog dataset to better facilitate multi-modal conversation. MMDialog is composed of a curated set of 1.08 million real-world dialogues with 1.53 million unique images across 4,184 topics. MMDialog has two main and unique advantages. First,… ▽ More

    Submitted 21 December, 2022; v1 submitted 10 November, 2022; originally announced November 2022.

  5. Cosmic void exclusion models and their impact on the distance scale measurements from large scale structure

    Authors: Andrei Variu, Cheng Zhao, Daniel Forero-Sánchez, Chia-Hsun Chuang, Francisco-Shu Kitaura, Charling Tao, Amélie Tamone, Jean-Paul Kneib

    Abstract: Baryonic Acoustic Oscillations (BAOs) studies based on the clustering of voids and matter tracers provide important constraints on cosmological parameters related to the expansion of the Universe. However, modelling the void exclusion effect is an important challenge for fully exploiting the potential of this kind of analyses. We thus develop two numerical methods to describe the clustering of cos… ▽ More

    Submitted 16 March, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: 20 pages, 28 figures

  6. Potential scientific synergies in weak lensing studies between the CSST and Euclid space probes

    Authors: D. Z. Liu, X. M. Meng, X. Z. Er, Z. H. Fan, M. Kilbinger, G. L. Li, R. Li, T. Schrabback, D. Scognamiglio, H. Y. Shan, C. Tao, Y. S. Ting, J. Zhang, S. H. Cheng, S. Farrens, L. P. Fu, H. Hildebrandt, X. Kang, J. P. Kneib, X. K. Liu, Y. Mellier, R. Nakajima, P. Schneider, J. L. Starck, C. L. Wei , et al. (2 additional authors not shown)

    Abstract: Aims. With the next generation of large surveys coming to the stage of observational cosmology soon, it is important to explore their potential synergies and to maximise their scientific outcomes. In this study, we aim to investigate the complementarity of the two upcoming space missions Euclid and the China Space Station Telescope (CSST), focusing on weak lensing (WL) cosmology. In particular, we… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: 18 pages, 19 figures and 2 tables. Accepted for publication in A&A

    Journal ref: A&A 669, A128 (2023)

  7. arXiv:2210.14169  [pdf, other

    cs.CL cs.AI cs.LG

    Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding

    Authors: Maximillian Chen, Alexandros Papangelis, Chenyang Tao, Andy Rosenbaum, Seokhwan Kim, Yang Liu, Zhou Yu, Dilek Hakkani-Tur

    Abstract: Dialogue understanding tasks often necessitate abundant annotated data to achieve good performance and that presents challenges in low-resource settings. To alleviate this barrier, we explore few-shot data augmentation for dialogue understanding by prompting large pre-trained language models and present a novel approach that iterates on augmentation quality by applying weakly-supervised filters. W… ▽ More

    Submitted 2 November, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: To appear in SyntheticData4ML @ NeurIPS 2022. 16 pages, 10 figures, 3 tables

  8. arXiv:2210.12460  [pdf, other

    cs.CL

    Collaborative Reasoning on Multi-Modal Semantic Graphs for Video-Grounded Dialogue Generation

    Authors: Xueliang Zhao, Yuxuan Wang, Chongyang Tao, Chenshuo Wang, Dongyan Zhao

    Abstract: We study video-grounded dialogue generation, where a response is generated based on the dialogue context and the associated video. The primary challenges of this task lie in (1) the difficulty of integrating video data into pre-trained language models (PLMs) which presents obstacles to exploiting the power of large-scale pre-training; and (2) the necessity of taking into account the complementarit… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: To appear at EMNLP 2022 findings

  9. arXiv:2210.12459  [pdf, other

    cs.CL

    There Is No Standard Answer: Knowledge-Grounded Dialogue Generation with Adversarial Activated Multi-Reference Learning

    Authors: Xueliang Zhao, Tingchen Fu, Chongyang Tao, Rui Yan

    Abstract: Knowledge-grounded conversation (KGC) shows excellent potential to deliver an engaging and informative response. However, existing approaches emphasize selecting one golden knowledge given a particular dialogue context, overlooking the one-to-many phenomenon in dialogue. As a result, the existing paradigm limits the diversity of knowledge selection and generation. To this end, we establish a multi… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: To appear at EMNLP 2022 main conference. The first two authors contributed equally

  10. arXiv:2210.11929  [pdf, other

    cs.CV cs.CL

    LiteVL: Efficient Video-Language Learning with Enhanced Spatial-Temporal Modeling

    Authors: Dongsheng Chen, Chaofan Tao, Lu Hou, Lifeng Shang, Xin Jiang, Qun Liu

    Abstract: Recent large-scale video-language pre-trained models have shown appealing performance on various downstream tasks. However, the pre-training process is computationally expensive due to the requirement of millions of video-text pairs and the redundant data structure of each video. To mitigate these problems, we propose LiteVL, which adapts a pre-trained image-language model BLIP into a video-text m… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 13 pages, 6 figures, accepted by EMNLP 2022 main conference

  11. arXiv:2210.08701  [pdf, other

    cs.LG cs.CV

    ODG-Q: Robust Quantization via Online Domain Generalization

    Authors: Chaofan Tao, Ngai Wong

    Abstract: Quantizing neural networks to low-bitwidth is important for model deployment on resource-limited edge hardware. Although a quantized network has a smaller model size and memory footprint, it is fragile to adversarial attacks. However, few methods study the robustness and training efficiency of quantized networks. To this end, we propose a new method by recasting robust quantization as an online do… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

  12. arXiv:2210.06708  [pdf

    astro-ph.SR hep-ph

    Bump Morphology of the CMAGIC Diagram

    Authors: L. Aldoroty, L. Wang, P. Hoeflich, J. Yang, N. Suntzeff, G. Aldering, P. Antilogus, C. Aragon, S. Bailey, C. Baltay, S. Bongard, K. Boone, C. Buton, Y. Copin, S. Dixon, D. Fouchez, E. Gangler, R. Gupta, B. Hayden, Mitchell Karmen, A. G. Kim, M. Kowalski, D. Küsters, P. -F. Léget, F. Mondon , et al. (16 additional authors not shown)

    Abstract: We apply the color-magnitude intercept calibration method (CMAGIC) to the Nearby Supernova Factory SNe Ia spectrophotometric dataset. The currently existing CMAGIC parameters are the slope and intercept of a straight line fit to the first linear region in the color-magnitude diagram, which occurs over a span of approximately 30 days after maximum brightness. We define a new parameter, $ω_{XY}$, th… ▽ More

    Submitted 22 June, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: 19 pages, 9 figures

    Journal ref: The Astrophysical Journal, 948:10 (15pp), 2023 May 1

  13. arXiv:2209.09502  [pdf, other

    cs.CV

    GAMA: Generative Adversarial Multi-Object Scene Attacks

    Authors: Abhishek Aich, Calvin-Khang Ta, Akash Gupta, Chengyu Song, Srikanth V. Krishnamurthy, M. Salman Asif, Amit K. Roy-Chowdhury

    Abstract: The majority of methods for crafting adversarial attacks have focused on scenes with a single dominant object (e.g., images from ImageNet). On the other hand, natural scenes include multiple dominant objects that are semantically related. Thus, it is crucial to explore designing attack strategies that look beyond learning on single-object scenes or attack single-object victim classifiers. Due to t… ▽ More

    Submitted 15 October, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

    Comments: Accepted at NeurIPS 2022; First two authors contributed equally; Includes Supplementary Material

  14. arXiv:2209.01718  [pdf, other

    stat.ME

    Online Updating Huber Robust Regression for Big Data Streams

    Authors: Chunbai Tao, Shanshan Wang

    Abstract: Big data streams are grasping increasing attention with the development of modern science and information technology. Due to the incompatibility of limited computer memory to high volume of streaming data, real-time methods without historical data storage is worth investigating. Moreover, outliers may occur with high velocity data streams generating, calling for more robust analysis. Motivated by… ▽ More

    Submitted 28 June, 2023; v1 submitted 4 September, 2022; originally announced September 2022.

    Comments: 25 pages, 5 figures, 2023 JONIT CONFERENCE ON STATISTICS AND DATA SCIENCE IN CHINA

  15. arXiv:2208.14754  [pdf, other

    cs.IR

    LexMAE: Lexicon-Bottlenecked Pretraining for Large-Scale Retrieval

    Authors: Tao Shen, Xiubo Geng, Chongyang Tao, Can Xu, Xiaolong Huang, Binxing Jiao, Linjun Yang, Daxin Jiang

    Abstract: In large-scale retrieval, the lexicon-weighting paradigm, learning weighted sparse representations in vocabulary space, has shown promising results with high quality and low latency. Despite it deeply exploiting the lexicon-representing capability of pre-trained language models, a crucial gap remains between language modeling and lexicon-weighting retrieval -- the former preferring certain or low-… ▽ More

    Submitted 4 June, 2023; v1 submitted 31 August, 2022; originally announced August 2022.

    Comments: Appeared at ICLR 2023

  16. arXiv:2208.13661  [pdf, other

    cs.CL

    LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval

    Authors: Kai Zhang, Chongyang Tao, Tao Shen, Can Xu, Xiubo Geng, Binxing Jiao, Daxin Jiang

    Abstract: Retrieval models based on dense representations in semantic space have become an indispensable branch for first-stage retrieval. These retrievers benefit from surging advances in representation learning towards compressive global sequence-level embeddings. However, they are prone to overlook local salient phrases and entity mentions in texts, which usually play pivot roles in first-stage retrieval… ▽ More

    Submitted 2 March, 2023; v1 submitted 29 August, 2022; originally announced August 2022.

    Comments: 14 pages, 6 tables, 4 figures. WWW 2023

  17. arXiv:2208.10437  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Multiple topological nodal structure in LaSb2 with large linear magnetoresistance

    Authors: Y. X. Qiao, Z. C. Tao, F. Y. Wang, Huaiqiang Wang, Z. C. Jiang, Z. T. Liu, Soohyun Cho, F. Y. Zhang, Q. K. Meng, W. Xia, Y. C. Yang, Z. Huang, J. S. Liu, Z. H. Liu, Z. W. Zhu, S. Qiao, Y. F. Guo, Haijun Zhang, Dawei Shen

    Abstract: Unconventional fermions in the immensely studied topological semimetals are the source for rich exotic topological properties. Here, using symmetry analysis and first-principles calculations, we propose the coexistence of multiple topological nodal structure in LaSb2, including topological nodal surfaces, nodal lines and in particular eightfold degenerate nodal points, which have been scarcely obs… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

  18. Imperfect chirality at exceptional points in optical whispering-gallery microcavities

    Authors: Junda Zhu, Changqing Wang, Can Tao, Zhoutian Fu, Haitao Liu, Fang Bo, Lan Yang, Guoquan Zhang, Jingjun Xu

    Abstract: Non-Hermitian systems have attracted considerable attention for their broad impacts on various physical platforms and peculiar applications. In non-Hermitian systems, both eigenvalues and eigenstates simultaneously coalesce at exceptional points (EPs). As one of the remarkable features of EPs, the field chirality is commonly considered perfect, which is utilized as an intriguing feature to control… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Journal ref: Phys. Rev. A 108, L041501 (2023)

  19. arXiv:2208.06238  [pdf, other

    astro-ph.CO

    Void BAO measurements on quasars from eBOSS

    Authors: A. Tamone, C. Zhao, D. Forero-Sánchez, A. Variu, C. -H. Chuang, F. -S. Kitaura, J. -P. Kneib, C. Tao

    Abstract: We present the clustering of voids based on the quasar (QSO) sample of the extended Baryon Oscillation Spectroscopic Survey Data Release 16 in configuration space. We define voids as overlapping empty circumspheres computed by Delaunay tetrahedra spanned by quartets of quasars, allowing for an estimate of the depth of underdense regions. To maximise the BAO signal-to-noise ratio, we consider only… ▽ More

    Submitted 12 August, 2022; originally announced August 2022.

    Comments: Submitted to MNRAS

  20. arXiv:2207.07645  [pdf, other

    astro-ph.CO cs.LG

    A Probabilistic Autoencoder for Type Ia Supernovae Spectral Time Series

    Authors: George Stein, Uros Seljak, Vanessa Bohm, G. Aldering, P. Antilogus, C. Aragon, S. Bailey, C. Baltay, S. Bongard, K. Boone, C. Buton, Y. Copin, S. Dixon, D. Fouchez, E. Gangler, R. Gupta, B. Hayden, W. Hillebrandt, M. Karmen, A. G. Kim, M. Kowalski, D. Kusters, P. F. Leget, F. Mondon, J. Nordin , et al. (15 additional authors not shown)

    Abstract: We construct a physically-parameterized probabilistic autoencoder (PAE) to learn the intrinsic diversity of type Ia supernovae (SNe Ia) from a sparse set of spectral time series. The PAE is a two-stage generative model, composed of an Auto-Encoder (AE) which is interpreted probabilistically after training using a Normalizing Flow (NF). We demonstrate that the PAE learns a low-dimensional latent sp… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: 23 pages, 8 Figures, 1 Table. Accepted to ApJ

  21. arXiv:2207.04519  [pdf, other

    astro-ph.HE astro-ph.IM hep-ex

    A multi-cubic-kilometre neutrino telescope in the western Pacific Ocean

    Authors: Z. P. Ye, F. Hu, W. Tian, Q. C. Chang, Y. L. Chang, Z. S. Cheng, J. Gao, T. Ge, G. H. Gong, J. Guo, X. X. Guo, X. G. He, J. T. Huang, K. Jiang, P. K. Jiang, Y. P. Jing, H. L. Li, J. L. Li, L. Li, W. L. Li, Z. Li, N. Y. Liao, Q. Lin, F. Liu, J. L. Liu , et al. (33 additional authors not shown)

    Abstract: Next-generation neutrino telescopes with significantly improved sensitivity are required to pinpoint the sources of the diffuse astrophysical neutrino flux detected by IceCube and uncover the century-old puzzle of cosmic ray origins. A detector near the equator will provide a unique viewpoint of the neutrino sky, complementing IceCube and other neutrino telescopes in the Northern Hemisphere. Here… ▽ More

    Submitted 13 May, 2024; v1 submitted 10 July, 2022; originally announced July 2022.

    Comments: 34 pages,12 figures. Correspondence should be addressed to D. L. Xu: [email protected]

  22. arXiv:2206.11985  [pdf, other

    eess.SY

    Path Integral Methods with Stochastic Control Barrier Functions

    Authors: Chuyuan Tao, Hyung-Jin Yoon, Hunmin Kim, Naira Hovakimyan, Petros Voulgaris

    Abstract: Safe control designs for robotic systems remain challenging because of the difficulties of explicitly solving optimal control with nonlinear dynamics perturbed by stochastic noise. However, recent technological advances in computing devices enable online optimization or sampling-based methods to solve control problems. For example, Control Barrier Functions (CBFs), a Lyapunov-like control algorith… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  23. arXiv:2206.10265  [pdf, other

    cs.CL

    KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in Low-Resource NLP

    Authors: Yufei Wang, Jiayi Zheng, Can Xu, Xiubo Geng, Tao Shen, Chongyang Tao, Daxin Jiang

    Abstract: This paper focuses on the data augmentation for low-resource NLP tasks where the training set is limited. The existing solutions either leverage task-independent heuristic rules (e.g., Synonym Replacement) or fine-tune general-purpose pre-trained language models (e.g., GPT2) using the limited training instances to produce new synthetic data. Consequently, they have trivial task-specific knowledge… ▽ More

    Submitted 27 January, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: Accepted by ICLR 2023 main track at https://openreview.net/forum?id=2nocgE1m0A

  24. arXiv:2206.08063  [pdf, other

    cs.IR cs.CL

    Towards Robust Ranker for Text Retrieval

    Authors: Yucheng Zhou, Tao Shen, Xiubo Geng, Chongyang Tao, Can Xu, Guodong Long, Binxing Jiao, Daxin Jiang

    Abstract: A ranker plays an indispensable role in the de facto 'retrieval & rerank' pipeline, but its training still lags behind -- learning from moderate negatives or/and serving as an auxiliary module for a retriever. In this work, we first identify two major barriers to a robust ranker, i.e., inherent label noises caused by a well-trained retriever and non-ideal negatives sampled for a high-capable ranke… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: 11 pages of main content, 4 tables, 3 figures

  25. Poisson2Sparse: Self-Supervised Poisson Denoising From a Single Image

    Authors: Calvin-Khang Ta, Abhishek Aich, Akash Gupta, Amit K. Roy-Chowdhury

    Abstract: Image enhancement approaches often assume that the noise is signal independent, and approximate the degradation model as zero-mean additive Gaussian. However, this assumption does not hold for biomedical imaging systems where sensor-based sources of noise are proportional to signal strengths, and the noise is better represented as a Poisson process. In this work, we explore a sparsity and dictiona… ▽ More

    Submitted 27 June, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: Accepted to MICCAI 2022

  26. arXiv:2206.01204  [pdf, other

    cs.CV

    Siamese Image Modeling for Self-Supervised Vision Representation Learning

    Authors: Chenxin Tao, Xizhou Zhu, Weijie Su, Gao Huang, Bin Li, Jie Zhou, Yu Qiao, Xiaogang Wang, Jifeng Dai

    Abstract: Self-supervised learning (SSL) has delivered superior performance on a variety of downstream vision tasks. Two main-stream SSL frameworks have been proposed, i.e., Instance Discrimination (ID) and Masked Image Modeling (MIM). ID pulls together representations from different views of the same image, while avoiding feature collapse. It lacks spatial sensitivity, which requires modeling the local str… ▽ More

    Submitted 16 November, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

  27. arXiv:2205.11194  [pdf, other

    cs.IR cs.CL

    UnifieR: A Unified Retriever for Large-Scale Retrieval

    Authors: Tao Shen, Xiubo Geng, Chongyang Tao, Can Xu, Guodong Long, Kai Zhang, Daxin Jiang

    Abstract: Large-scale retrieval is to recall relevant documents from a huge collection given a query. It relies on representation learning to embed documents and queries into a common semantic encoding space. According to the encoding space, recent retrieval methods based on pre-trained language models (PLM) can be coarsely categorized into either dense-vector or lexicon-based paradigms. These two paradigms… ▽ More

    Submitted 4 June, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: To appear at KDD ADS 2023

  28. arXiv:2205.01116  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.SR

    Uniform Recalibration of Common Spectrophotometry Standard Stars onto the CALSPEC System using the SuperNova Integral Field Spectrograph

    Authors: David Rubin, G. Aldering, P. Antilogus, C. Aragon, S. Bailey, C. Baltay, S. Bongard, K. Boone, C. Buton, Y. Copin, S. Dixon, D. Fouchez, E. Gangler, R. Gupta, B. Hayden, W. Hillebrandt, A. G. Kim, M. Kowalski, D. Kuesters, P. -F. Leget, F. Mondon, J. Nordin, R. Pain, E. Pecontal, R. Pereira , et al. (13 additional authors not shown)

    Abstract: We calibrate spectrophotometric optical spectra of 32 stars commonly used as standard stars, referenced to 14 stars already on the HST-based CALSPEC flux system. Observations of CALSPEC and non-CALSPEC stars were obtained with the SuperNova Integral Field Spectrograph over the wavelength range 3300 A to 9400 A as calibration for the Nearby Supernova Factory cosmology experiment. In total, this ana… ▽ More

    Submitted 21 June, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: Accepted for publication in ApJS

  29. arXiv:2204.08727  [pdf, ps, other

    astro-ph.HE astro-ph.CO astro-ph.GA astro-ph.SR

    Euclid: Searching for pair-instability supernovae with the Deep Survey

    Authors: T. J. Moriya, C. Inserra, M. Tanaka, E. Cappellaro, M. Della Valle, I. Hook, R. Kotak, G. Longo, F. Mannucci, S. Mattila, C. Tao, B. Altieri, A. Amara, N. Auricchio, D. Bonino, E. Branchini, M. Brescia, J. Brinchmann, S. Camera, V. Capobianco, C. Carbone, J. Carretero, M. Castellano, S. Cavuoti, A. Cimatti , et al. (84 additional authors not shown)

    Abstract: Pair-instability supernovae are theorized supernovae that have not yet been observationally confirmed. They are predicted to exist in low-metallicity environments. Because overall metallicity becomes lower at higher redshifts, deep near-infrared transient surveys probing high-redshift supernovae are suitable to discover pair-instability supernovae. The Euclid satellite, which is planned to be laun… ▽ More

    Submitted 26 August, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: 12 pages, 13 figures, 2 tables, accepted by Astronomy & Astrophysics

    Journal ref: Astronomy & Astrophysics, Volume 666, id.A157, 12 pp. (2022)

  30. arXiv:2204.05805  [pdf, other

    cs.CL

    Learning to Express in Knowledge-Grounded Conversation

    Authors: Xueliang Zhao, Tingchen Fu, Chongyang Tao, Wei Wu, Dongyan Zhao, Rui Yan

    Abstract: Grounding dialogue generation by extra knowledge has shown great potentials towards building a system capable of replying with knowledgeable and engaging responses. Existing studies focus on how to synthesize a response with proper knowledge, yet neglect that the same knowledge could be expressed differently by speakers even under the same context. In this work, we mainly consider two aspects of k… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: Accepted by NAACL 2022 (main conference)

  31. arXiv:2204.04716  [pdf, other

    cs.CV cs.AI cs.GT

    TOV: The Original Vision Model for Optical Remote Sensing Image Understanding via Self-supervised Learning

    Authors: Chao Tao, Ji Qia, Guo Zhang, Qing Zhu, Weipeng Lu, Haifeng Li

    Abstract: Do we on the right way for remote sensing image understanding (RSIU) by training models via supervised data-dependent and task-dependent way, instead of human vision in a label-free and task-independent way? We argue that a more desirable RSIU model should be trained with intrinsic structure from data rather that extrinsic human labels to realize generalizability across a wide range of RSIU tasks.… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

    Comments: 38 pages, 5 figures, 8 Tables

  32. arXiv:2204.02624  [pdf, other

    cs.CL

    There Are a Thousand Hamlets in a Thousand People's Eyes: Enhancing Knowledge-grounded Dialogue with Personal Memory

    Authors: Tingchen Fu, Xueliang Zhao, Chongyang Tao, Ji-Rong Wen, Rui Yan

    Abstract: Knowledge-grounded conversation (KGC) shows great potential in building an engaging and knowledgeable chatbot, and knowledge selection is a key ingredient in it. However, previous methods for knowledge selection only concentrate on the relevance between knowledge and dialogue context, ignoring the fact that age, hobby, education and life experience of an interlocutor have a major effect on his or… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: Accepted by ACL 2022 (main conference). First two authors contributed equally

  33. Model BOSS & eBOSS Luminous Red Galaxies at 0.2 < z < 1.0 using SubHalo Abundance Matching with 3 parameters

    Authors: Jiaxi Yu, Cheng Zhao, Chia-Hsun Chuang, Julian Bautista, Ginevra Favole, Jean-Paul Kneib, Faizan Mohammad, Ashley Ross, Anand Raichoor, Charling Tao, Kyle Dawson, Graziano Rossi

    Abstract: SubHalo Abundance Matching (SHAM) is an empirical method for constructing galaxy catalogues based on high-resolution $N$-body simulations. We apply SHAM on the UNIT simulation to simulate SDSS BOSS/eBOSS Luminous Red Galaxies (LRGs) within a wide redshift range of $0.2 < z < 1.0$. Besides the typical SHAM scatter parameter $σ$, we include $v_{\rm smear}$ and $V_{\rm ceil}$ to take into account the… ▽ More

    Submitted 28 July, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

  34. arXiv:2203.10705  [pdf, other

    cs.CL cs.CV

    Compression of Generative Pre-trained Language Models via Quantization

    Authors: Chaofan Tao, Lu Hou, Wei Zhang, Lifeng Shang, Xin Jiang, Qun Liu, Ping Luo, Ngai Wong

    Abstract: The increasing size of generative Pre-trained Language Models (PLMs) has greatly increased the demand for model compression. Despite various methods to compress BERT or its variants, there are few attempts to compress generative PLMs, and the underlying difficulty remains unclear. In this paper, we compress generative PLMs by quantization. We find that previous quantization methods fail on generat… ▽ More

    Submitted 16 July, 2022; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  35. arXiv:2203.10067  [pdf, other

    cs.RO eess.SY

    Sampling Complexity of Path Integral Methods for Trajectory Optimization

    Authors: Hyung-Jin Yoon, Chuyuan Tao, Hunmin Kim, Naira Hovakimyan, Petros Voulgaris

    Abstract: The use of random sampling in decision-making and control has become popular with the ease of access to graphic processing units that can generate and calculate multiple random trajectories for real-time robotic applications. In contrast to sequential optimization, the sampling-based method can take advantage of parallel computing to maintain constant control loop frequencies. Inspired by its wide… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: Accepted in American Control Conference 2022

  36. arXiv:2203.08739  [pdf, other

    cs.CV cs.LG

    What Do Adversarially trained Neural Networks Focus: A Fourier Domain-based Study

    Authors: Binxiao Huang, Chaofan Tao, Rui Lin, Ngai Wong

    Abstract: Although many fields have witnessed the superior performance brought about by deep learning, the robustness of neural networks remains an open issue. Specifically, a small adversarial perturbation on the input may cause the model to produce a completely different output. Such poor robustness implies many potential hazards, especially in security-critical applications, e.g., autonomous driving and… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

  37. arXiv:2203.08517  [pdf, other

    cs.CL cs.AI

    TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge

    Authors: Chao-Hong Tan, Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling, Can Xu, Huang Hu, Xiubo Geng, Daxin Jiang

    Abstract: Generating natural and informative texts has been a long-standing problem in NLP. Much effort has been dedicated into incorporating pre-trained language models (PLMs) with various open-world knowledge, such as knowledge graphs or wiki pages. However, their ability to access and manipulate the task-specific knowledge is still limited on downstream tasks, as this type of knowledge is usually not wel… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted by Findings of ACL 2022

  38. arXiv:2203.08500  [pdf, other

    cs.CL

    HeterMPC: A Heterogeneous Graph Neural Network for Response Generation in Multi-Party Conversations

    Authors: Jia-Chen Gu, Chao-Hong Tan, Chongyang Tao, Zhen-Hua Ling, Huang Hu, Xiubo Geng, Daxin Jiang

    Abstract: Recently, various response generation models for two-party conversations have achieved impressive improvements, but less effort has been paid to multi-party conversations (MPCs) which are more practical and complicated. Compared with a two-party conversation where a dialogue context is a sequence of utterances, building a response generation model for MPCs is more challenging, since there exist co… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted by ACL 2022

  39. MVD: Memory-Related Vulnerability Detection Based on Flow-Sensitive Graph Neural Networks

    Authors: Sicong Cao, Xiaobing Sun, Lili Bo, Rongxin Wu, Bin Li, Chuanqi Tao

    Abstract: Memory-related vulnerabilities constitute severe threats to the security of modern software. Despite the success of deep learning-based approaches to generic vulnerability detection, they are still limited by the underutilization of flow information when applied for detecting memory-related vulnerabilities, leading to high false positives. In this paper,we propose MVD, a statement-level Memory-r… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

    Comments: To appear in the Technical Track of ICSE 2022

  40. arXiv:2202.12499  [pdf, other

    cs.CL

    PromDA: Prompt-based Data Augmentation for Low-Resource NLU Tasks

    Authors: Yufei Wang, Can Xu, Qingfeng Sun, Huang Hu, Chongyang Tao, Xiubo Geng, Daxin Jiang

    Abstract: This paper focuses on the Data Augmentation for low-resource Natural Language Understanding (NLU) tasks. We propose Prompt-based D}ata Augmentation model (PromDA) which only trains small-scale Soft Prompt (i.e., a set of trainable vectors) in the frozen Pre-trained Language Models (PLMs). This avoids human effort in collecting unlabeled in-domain data and maintains the quality of generated synthet… ▽ More

    Submitted 17 March, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

    Comments: Accepted to ACL 2022 Main Conference, Camera-Ready Version

  41. Mining On Alzheimer's Diseases Related Knowledge Graph to Identity Potential AD-related Semantic Triples for Drug Repurposing

    Authors: Yi Nian, Xinyue Hu, Rui Zhang, Jingna Feng, Jingcheng Du, Fang Li, Yong Chen, Cui Tao

    Abstract: To date, there are no effective treatments for most neurodegenerative diseases. Knowledge graphs can provide comprehensive and semantic representation for heterogeneous data, and have been successfully leveraged in many biomedical applications including drug repurposing. Our objective is to construct a knowledge graph from literature to study relations between Alzheimer's disease (AD) and chemical… ▽ More

    Submitted 28 November, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: BMC Bioinformatics

    Journal ref: BMC Bioinformatics volume 23, Article number: 407 (2022)

  42. Observation of the $π^2σ^2$-bond linear-chain molecular structure in $^{16}$C

    Authors: J. X. Han, Y. Liu, Y. L. Ye, J. L. Lou, X. F. Yang, T. Baba, M. Kimura, B. Yang, Z. H. Li, Q. T. Li, J. Y. Xu, Y. C. Ge, H. Hua, Z. H. Yang, J. S. Wang, Y. Y. Yang, P. Ma, Z. Bai, Q. Hu, W. Liu, K. Ma, L. C. Tao, Y. Jiang, L. Y. Hu, H. L. Zang , et al. (15 additional authors not shown)

    Abstract: Measurements of the $^2$H($^{16}$C,$^{16}$C$^{*}$$\rightarrow^4$He+$^{12}$Be or $^6$He+$^{10}$Be)$^2$H inelastic excitation and cluster-decay reactions have been carried out at a beam energy of about 23.5 MeV/u. A specially designed detection system, including one multi-layer silicon-strip telescope at around zero degrees, has allowed the high-efficiency three-fold coincident detection and therefo… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: 13 pages, 10 figures

  43. arXiv:2202.03594  [pdf, other

    math.MG

    Perfectly packing a square by squares of nearly harmonic sidelength

    Authors: Terence Tao

    Abstract: A well known open problem of Meir and Moser asks if the squares of sidelength $1/n$ for $n \geq 2$ can be packed perfectly into a square of area $\sum_{n=2}^\infty \frac{1}{n^2} = \frac{π^2}{6}-1$. In this paper we show that for any $1/2 < t < 1$, and any $n_0$ that is sufficiently large depending on $t$, the squares of sidelength $n^{-t}$ for $n \geq n_0$ can be packed perfectly into a square of… ▽ More

    Submitted 10 March, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: 11 pages, 1 figure. Several minor corrections

    MSC Class: 52C15

  44. arXiv:2201.12093  [pdf, other

    cs.CL

    PCL: Peer-Contrastive Learning with Diverse Augmentations for Unsupervised Sentence Embeddings

    Authors: Qiyu Wu, Chongyang Tao, Tao Shen, Can Xu, Xiubo Geng, Daxin Jiang

    Abstract: Learning sentence embeddings in an unsupervised manner is fundamental in natural language processing. Recent common practice is to couple pre-trained language models with unsupervised contrastive learning, whose success relies on augmenting a sentence with a semantically-close positive instance to construct contrastive pairs. Nonetheless, existing approaches usually depend on a mono-augmenting str… ▽ More

    Submitted 19 October, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: To appear at EMNLP 2022

  45. Rubin-Euclid Derived Data Products: Initial Recommendations

    Authors: Leanne P. Guy, Jean-Charles Cuillandre, Etienne Bachelet, Manda Banerji, Franz E. Bauer, Thomas Collett, Christopher J. Conselice, Siegfried Eggl, Annette Ferguson, Adriano Fontana, Catherine Heymans, Isobel M. Hook, Éric Aubourg, Hervé Aussel, James Bosch, Benoit Carry, Henk Hoekstra, Konrad Kuijken, Francois Lanusse, Peter Melchior, Joseph Mohr, Michele Moresco, Reiko Nakajima, Stéphane Paltani, Michael Troxel , et al. (95 additional authors not shown)

    Abstract: This report is the result of a joint discussion between the Rubin and Euclid scientific communities. The work presented in this report was focused on designing and recommending an initial set of Derived Data products (DDPs) that could realize the science goals enabled by joint processing. All interested Rubin and Euclid data rights holders were invited to contribute via an online discussion forum… ▽ More

    Submitted 13 October, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

    Comments: Report of the Rubin-Euclid Derived Data Products Working Group, 78 pages, 11 figures

  46. arXiv:2112.13759  [pdf, other

    math.CO

    The inverse theorem for the $U^3$ Gowers uniformity norm on arbitrary finite abelian groups: Fourier-analytic and ergodic approaches

    Authors: Asgar Jamneshan, Terence Tao

    Abstract: We state and prove a quantitative inverse theorem for the Gowers uniformity norm $U^3(G)$ on an arbitrary finite abelian group $G$; the cases when $G$ was of odd order or a vector space over ${\mathbf F}_2$ had previously been established by Green and the second author and by Samorodnitsky respectively by Fourier-analytic methods, which we also employ here. We also prove a qualitative version of t… ▽ More

    Submitted 1 August, 2023; v1 submitted 27 December, 2021; originally announced December 2021.

    Comments: 48 pages, no figures. This is the published version

    MSC Class: 11B30; 28E05; 37A15

  47. arXiv:2112.05141  [pdf, other

    cs.CV

    Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework

    Authors: Chenxin Tao, Honghui Wang, Xizhou Zhu, Jiahua Dong, Shiji Song, Gao Huang, Jifeng Dai

    Abstract: Self-supervised learning has shown its great potential to extract powerful visual representations without human annotations. Various works are proposed to deal with self-supervised learning from different perspectives: (1) contrastive learning methods (e.g., MoCo, SimCLR) utilize both positive and negative samples to guide the training direction; (2) asymmetric network methods (e.g., BYOL, SimSiam… ▽ More

    Submitted 5 July, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: CVPR2022

  48. arXiv:2112.05138  [pdf, other

    cs.CV

    Searching Parameterized AP Loss for Object Detection

    Authors: Chenxin Tao, Zizhang Li, Xizhou Zhu, Gao Huang, Yong Liu, Jifeng Dai

    Abstract: Loss functions play an important role in training deep-network-based object detectors. The most widely used evaluation metric for object detection is Average Precision (AP), which captures the performance of localization and classification sub-tasks simultaneously. However, due to the non-differentiable nature of the AP metric, traditional object detectors adopt separate differentiable losses for… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: Accepted by NeurIPS 2021

  49. arXiv:2112.02056  [pdf, ps, other

    math.DS

    The structure of arbitrary Conze-Lesigne systems

    Authors: Asgar Jamneshan, Or Shalom, Terence Tao

    Abstract: Let $Γ$ be a countable abelian group. An (abstract) $Γ$-system $\mathrm{X}$ - that is, an (abstract) probability space equipped with an (abstract) probability-preserving action of $Γ$ - is said to be a Conze-Lesigne system if it is equal to its second Host-Kra-Ziegler factor $\mathrm{Z}^2(\mathrm{X})$. The main result of this paper is a structural description of such Conze-Lesigne systems for arbi… ▽ More

    Submitted 18 February, 2024; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: 69 pages, [v2]: Final version accepted for publication in Communications of the AMS

    MSC Class: 37A35

  50. arXiv:2111.10831  [pdf, other

    cs.LG cs.AI cs.NE

    Learning by Active Forgetting for Neural Networks

    Authors: Jian Peng, Xian Sun, Min Deng, Chao Tao, Bo Tang, Wenbo Li, Guohua Wu, QingZhu, Yu Liu, Tao Lin, Haifeng Li

    Abstract: Remembering and forgetting mechanisms are two sides of the same coin in a human learning-memory system. Inspired by human brain memory mechanisms, modern machine learning systems have been working to endow machine with lifelong learning capability through better remembering while pushing the forgetting as the antagonist to overcome. Nevertheless, this idea might only see the half picture. Up until… ▽ More

    Submitted 21 November, 2021; originally announced November 2021.

    Comments: 13 pages, 5 figures