Skip to main content

Showing 1–50 of 54 results for author: Hui, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.20200  [pdf, ps, other

    eess.IV cs.CV

    MS-IQA: A Multi-Scale Feature Fusion Network for PET/CT Image Quality Assessment

    Authors: Siqiao Li, Chen Hui, Wei Zhang, Rui Liang, Chenyue Song, Feng Jiang, Haiqi Zhu, Zhixuan Li, Hong Huang, Xiang Li

    Abstract: Positron Emission Tomography / Computed Tomography (PET/CT) plays a critical role in medical imaging, combining functional and anatomical information to aid in accurate diagnosis. However, image quality degradation due to noise, compression and other factors could potentially lead to diagnostic uncertainty and increase the risk of misdiagnosis. When evaluating the quality of a PET/CT image, both l… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: Accepted to MICCAI 2025

  2. arXiv:2506.17983  [pdf, ps, other

    eess.IV cs.CV

    LVPNet: A Latent-variable-based Prediction-driven End-to-end Framework for Lossless Compression of Medical Images

    Authors: Chenyue Song, Chen Hui, Qing Lin, Wei Zhang, Siqiao Li, Haiqi Zhu, Zhixuan Li, Shengping Zhang, Shaohui Liu, Feng Jiang, Xiang Li

    Abstract: Autoregressive Initial Bits is a framework that integrates sub-image autoregression and latent variable modeling, demonstrating its advantages in lossless medical image compression. However, in existing methods, the image segmentation process leads to an even distribution of latent variable information across each sub-image, which in turn causes posterior collapse and inefficient utilization of la… ▽ More

    Submitted 25 June, 2025; v1 submitted 22 June, 2025; originally announced June 2025.

    Comments: Accepted to MICCAI 2025

  3. arXiv:2506.17969  [pdf, ps, other

    cs.CV

    BPCLIP: A Bottom-up Image Quality Assessment from Distortion to Semantics Based on CLIP

    Authors: Chenyue Song, Chen Hui, Wei Zhang, Haiqi Zhu, Shaohui Liu, Hong Huang, Feng Jiang

    Abstract: Image Quality Assessment (IQA) aims to evaluate the perceptual quality of images based on human subjective perception. Existing methods generally combine multiscale features to achieve high performance, but most rely on straightforward linear fusion of these features, which may not adequately capture the impact of distortions on semantic content. To address this, we propose a bottom-up image quali… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: Accepted to ICME 2025

  4. arXiv:2504.16003  [pdf, other

    cs.CV

    MVQA: Mamba with Unified Sampling for Efficient Video Quality Assessment

    Authors: Yachun Mi, Yu Li, Weicheng Meng, Chaofeng Chen, Chen Hui, Shaohui Liu

    Abstract: The rapid growth of long-duration, high-definition videos has made efficient video quality assessment (VQA) a critical challenge. Existing research typically tackles this problem through two main strategies: reducing model parameters and resampling inputs. However, light-weight Convolution Neural Networks (CNN) and Transformers often struggle to balance efficiency with high performance due to the… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  5. arXiv:2504.13475  [pdf, other

    cs.CL

    LLM Sensitivity Evaluation Framework for Clinical Diagnosis

    Authors: Chenwei Yan, Xiangling Fu, Yuxuan Xiong, Tianyi Wang, Siu Cheung Hui, Ji Wu, Xien Liu

    Abstract: Large language models (LLMs) have demonstrated impressive performance across various domains. However, for clinical diagnosis, higher expectations are required for LLM's reliability and sensitivity: thinking like physicians and remaining sensitive to key medical information that affects diagnostic reasoning, as subtle variations can lead to different diagnosis results. Yet, existing works focus ma… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

    Journal ref: Proceedings of the 31st International Conference on Computational Linguistics, 2025

  6. arXiv:2504.06935  [pdf

    cs.LG

    ASRL:A robust loss function with potential for development

    Authors: Chenyu Hui, Anran Zhang, Xintong Li

    Abstract: In this article, we proposed a partition:wise robust loss function based on the previous robust loss function. The characteristics of this loss function are that it achieves high robustness and a wide range of applicability through partition-wise design and adaptive parameter adjustment. Finally, the advantages and development potential of this loss function were verified by applying this loss fun… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Comments: five pages and three figures

  7. arXiv:2504.05020  [pdf, other

    cs.CL cs.AI

    Batch Aggregation: An Approach to Enhance Text Classification with Correlated Augmented Data

    Authors: Charco Hui, Yalu Wen

    Abstract: Natural language processing models often face challenges due to limited labeled data, especially in domain specific areas, e.g., clinical trials. To overcome this, text augmentation techniques are commonly used to increases sample size by transforming the original input data into artificial ones with the label preserved. However, traditional text classification methods ignores the relationship bet… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  8. arXiv:2503.03799  [pdf, other

    cs.LG astro-ph.HE gr-qc

    DeepGrav: Anomalous Gravitational-Wave Detection Through Deep Latent Features

    Authors: Jianqi Yan, Alex P. Leung, Zhiyuan Pei, David C. Y. Hui, Sangin Kim

    Abstract: This work introduces a novel deep learning-based approach for gravitational wave anomaly detection, aiming to overcome the limitations of traditional matched filtering techniques in identifying unknown waveform gravitational wave signals. We introduce a modified convolutional neural network architecture inspired by ResNet that leverages residual blocks to extract high-dimensional features, effecti… ▽ More

    Submitted 15 March, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

    Comments: 6 pages, 3 figures, A concise introduction to the winning solution for NSF HDR A3D3 GW challenge. Our training code is publicly available at https://github.com/yan123yan/HDR-anomaly-challenge-submission

  9. arXiv:2503.02112  [pdf, other

    cs.LG astro-ph.IM

    Building Machine Learning Challenges for Anomaly Detection in Science

    Authors: Elizabeth G. Campolongo, Yuan-Tang Chou, Ekaterina Govorkova, Wahid Bhimji, Wei-Lun Chao, Chris Harris, Shih-Chieh Hsu, Hilmar Lapp, Mark S. Neubauer, Josephine Namayanja, Aneesh Subramanian, Philip Harris, Advaith Anand, David E. Carlyn, Subhankar Ghosh, Christopher Lawrence, Eric Moreno, Ryan Raikman, Jiaman Wu, Ziheng Zhang, Bayu Adhi, Mohammad Ahmadi Gharehtoragh, Saúl Alonso Monsalve, Marta Babicz, Furqan Baig , et al. (125 additional authors not shown)

    Abstract: Scientific discoveries are often made by finding a pattern or object that was not predicted by the known rules of science. Oftentimes, these anomalous events or objects that do not conform to the norms are an indication that the rules of science governing the data are incomplete, and something new needs to be present to explain these unexpected outliers. The challenge of finding anomalies can be c… ▽ More

    Submitted 29 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

    Comments: 17 pages 6 figures to be submitted to Nature Communications

  10. arXiv:2408.06583  [pdf, other

    cs.CL

    A Structure-aware Generative Model for Biomedical Event Extraction

    Authors: Haohan Yuan, Siu Cheung Hui, Haopeng Zhang

    Abstract: Biomedical Event Extraction (BEE) is a challenging task that involves modeling complex relationships between fine-grained entities in biomedical text. BEE has traditionally been formulated as a classification problem. With recent advancements in large language models (LLMs), generation-based models that cast event extraction as a sequence generation problem have attracted attention in the NLP rese… ▽ More

    Submitted 20 February, 2025; v1 submitted 12 August, 2024; originally announced August 2024.

    Comments: 8 pages, 4 figures, 6 tables

  11. arXiv:2407.12851  [pdf

    cs.CL

    ISPO: An Integrated Ontology of Symptom Phenotypes for Semantic Integration of Traditional Chinese Medical Data

    Authors: Zixin Shu, Rui Hua, Dengying Yan, Chenxia Lu, Ning Xu, Jun Li, Hui Zhu, Jia Zhang, Dan Zhao, Chenyang Hui, Junqiu Ye, Chu Liao, Qi Hao, Wen Ye, Cheng Luo, Xinyan Wang, Chuang Cheng, Xiaodong Li, Baoyan Liu, Xiaji Zhou, Runshun Zhang, Min Xu, Xuezhong Zhou

    Abstract: Symptom phenotypes are one of the key types of manifestations for diagnosis and treatment of various disease conditions. However, the diversity of symptom terminologies is one of the major obstacles hindering the analysis and knowledge sharing of various types of symptom-related medical data particularly in the fields of Traditional Chinese Medicine (TCM). Objective: This study aimed to construct… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 39 pages, 6 figures, 6 tables

  12. arXiv:2404.17170  [pdf, other

    cs.CV eess.IV

    Image Quality Assessment With Compressed Sampling

    Authors: Ronghua Liao, Chen Hui, Lang Yuan, Haiqi Zhu, Feng Jiang

    Abstract: No-Reference Image Quality Assessment (NR-IQA) aims at estimating image quality in accordance with subjective human perception. However, most methods focus on exploring increasingly complex networks to improve the final performance,accompanied by limitations on input images. Especially when applied to high-resolution (HR) images, these methods offen have to adjust the size of original image to mee… ▽ More

    Submitted 11 September, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

  13. arXiv:2401.08947  [pdf

    cs.CR cs.LG

    AntiPhishStack: LSTM-based Stacked Generalization Model for Optimized Phishing URL Detection

    Authors: Saba Aslam, Hafsa Aslam, Arslan Manzoor, Chen Hui, Abdur Rasool

    Abstract: The escalating reliance on revolutionary online web services has introduced heightened security risks, with persistent challenges posed by phishing despite extensive security measures. Traditional phishing systems, reliant on machine learning and manual features, struggle with evolving tactics. Recent advances in deep learning offer promising avenues for tackling novel phishing challenges and mali… ▽ More

    Submitted 21 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  14. arXiv:2311.07090  [pdf, other

    cs.CV

    CLiF-VQA: Enhancing Video Quality Assessment by Incorporating High-Level Semantic Information related to Human Feelings

    Authors: Yachun Mi, Yu Li, Yan Shu, Chen Hui, Puchao Zhou, Shaohui Liu

    Abstract: Video Quality Assessment (VQA) aims to simulate the process of perceiving video quality by the human visual system (HVS). The judgments made by HVS are always influenced by human subjective feelings. However, most of the current VQA research focuses on capturing various distortions in the spatial and temporal domains of videos, while ignoring the impact of human feelings. In this paper, we propose… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  15. arXiv:2311.02358  [pdf

    eess.IV cs.CV

    Domain Transfer in Latent Space (DTLS) Wins on Image Super-Resolution -- a Non-Denoising Model

    Authors: Chun-Chuen Hui, Wan-Chi Siu, Ngai-Fong Law

    Abstract: Large scale image super-resolution is a challenging computer vision task, since vast information is missing in a highly degraded image, say for example forscale x16 super-resolution. Diffusion models are used successfully in recent years in extreme super-resolution applications, in which Gaussian noise is used as a means to form a latent photo-realistic space, and acts as a link between the space… ▽ More

    Submitted 21 December, 2023; v1 submitted 4 November, 2023; originally announced November 2023.

  16. arXiv:2304.07473  [pdf, ps, other

    cs.CV eess.IV

    Hierarchical Interactive Reconstruction Network For Video Compressive Sensing

    Authors: Tong Zhang, Wenxue Cui, Chen Hui, Feng Jiang

    Abstract: Deep network-based image and video Compressive Sensing(CS) has attracted increasing attentions in recent years. However, in the existing deep network-based CS methods, a simple stacked convolutional network is usually adopted, which not only weakens the perception of rich contextual prior knowledge, but also limits the exploration of the correlations between temporal video frames. In this paper, w… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

  17. arXiv:2303.09041  [pdf, other

    cs.LG

    A Multimodal Data-driven Framework for Anxiety Screening

    Authors: Haimiao Mo, Shuai Ding, Siu Cheung Hui

    Abstract: Early screening for anxiety and appropriate interventions are essential to reduce the incidence of self-harm and suicide in patients. Due to limited medical resources, traditional methods that overly rely on physician expertise and specialized equipment cannot simultaneously meet the needs for high accuracy and model interpretability. Multimodal data can provide more objective evidence for anxiety… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  18. arXiv:2302.08220  [pdf, other

    cs.CL

    Dialogue State Distillation Network with Inter-slot Contrastive Learning for Dialogue State Tracking

    Authors: Jing Xu, Dandan Song, Chong Liu, Siu Cheung Hui, Fei Li, Qiang Ju, Xiaonan He, Jian Xie

    Abstract: In task-oriented dialogue systems, Dialogue State Tracking (DST) aims to extract users' intentions from the dialogue history. Currently, most existing approaches suffer from error propagation and are unable to dynamically select relevant information when utilizing previous dialogue states. Moreover, the relations between the updates of different slots provide vital clues for DST. However, the exis… ▽ More

    Submitted 7 March, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: Accepted by AAAI 2023

  19. arXiv:2211.00709  [pdf, other

    cs.CL

    Semantic Pivoting Model for Effective Event Detection

    Authors: Anran Hao, Siu Cheung Hui, Jian Su

    Abstract: Event Detection, which aims to identify and classify mentions of event instances from unstructured articles, is an important task in Natural Language Processing (NLP). Existing techniques for event detection only use homogeneous one-hot vectors to represent the event type classes, ignoring the fact that the semantic meaning of the types is important to the task. Such an approach is inefficient and… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 11 pages, 4 figures; Accepted to ACIIDS 2022

  20. SoccerNet 2022 Challenges Results

    Authors: Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao , et al. (69 additional authors not shown)

    Abstract: The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team. In 2022, the challenges were composed of 6 vision-based tasks: (1) action spotting, focusing on retrieving action timestamps in long untrimmed videos, (2) replay grounding, focusing on retrieving the live moment of an action shown in a replay, (3) pitch localization, focusing on det… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted at ACM MMSports 2022

  21. arXiv:2207.04001  [pdf, other

    astro-ph.HE astro-ph.IM cs.LG gr-qc

    On Improving the Performance of Glitch Classification for Gravitational Wave Detection by using Generative Adversarial Networks

    Authors: Jianqi Yan, Alex P. Leung, David C. Y. Hui

    Abstract: Spectrogram classification plays an important role in analyzing gravitational wave data. In this paper, we propose a framework to improve the classification performance by using Generative Adversarial Networks (GANs). As substantial efforts and expertise are required to annotate spectrograms, the number of training examples is very limited. However, it is well known that deep networks can perform… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: Accepted for publication in MNRAS, 16 pages, 14 figures, 5 tables

  22. arXiv:2201.10978  [pdf, other

    cs.IR cs.CL cs.LG

    Machine Learning for Food Review and Recommendation

    Authors: Tan Khang Le, Siu Cheung Hui

    Abstract: Food reviews and recommendations have always been important for online food service websites. However, reviewing and recommending food is not simple as it is likely to be overwhelmed by disparate contexts and meanings. In this paper, we use different deep learning approaches to address the problems of sentiment analysis, automatic review tag generation, and retrieval of food reviews. We propose to… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

    Comments: Accepted paper to International Student Conference on Artificial Intelligence (STCAI) 2021

  23. arXiv:2112.04087  [pdf, other

    cs.AI

    Improving Knowledge Graph Representation Learning by Structure Contextual Pre-training

    Authors: Ganqiang Ye, Wen Zhang, Zhen Bi, Chi Man Wong, Chen Hui, Huajun Chen

    Abstract: Representation learning models for Knowledge Graphs (KG) have proven to be effective in encoding structural information and performing reasoning over KGs. In this paper, we propose a novel pre-training-then-fine-tuning framework for knowledge graph representation learning, in which a KG model is firstly pre-trained with triple classification task, followed by discriminative fine-tuning on specific… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Accepted to IJCKG 2021

  24. arXiv:2102.08597  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters

    Authors: Aston Zhang, Yi Tay, Shuai Zhang, Alvin Chan, Anh Tuan Luu, Siu Cheung Hui, Jie Fu

    Abstract: Recent works have demonstrated reasonable success of representation learning in hypercomplex space. Specifically, "fully-connected layers with Quaternions" (4D hypercomplex numbers), which replace real-valued matrix multiplications in fully-connected layers with Hamilton products of Quaternions, both enjoy parameter savings with only 1/4 learnable parameters and achieve comparable performance in v… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: Published as a conference paper at the 9th International Conference on Learning Representations (ICLR 2021)

  25. arXiv:2010.02469  [pdf, other

    cs.LG stat.CO stat.ML

    Generalized Matrix Factorization: efficient algorithms for fitting generalized linear latent variable models to large data arrays

    Authors: Łukasz Kidziński, Francis K. C. Hui, David I. Warton, Trevor Hastie

    Abstract: Unmeasured or latent variables are often the cause of correlations between multivariate measurements, which are studied in a variety of fields such as psychology, ecology, and medicine. For Gaussian measurements, there are classical tools such as factor analysis or principal component analysis with a well-established theory and fast algorithms. Generalized Linear Latent Variable models (GLLVMs) ge… ▽ More

    Submitted 27 January, 2022; v1 submitted 6 October, 2020; originally announced October 2020.

  26. A Reliable Gravity Compensation Control Strategy for dVRK Robotic Arms With Nonlinear Disturbance Forces

    Authors: Hongbin Lin, C. W. Vincent Hui, Yan Wang, Anton Deguet, Peter Kazanzides, K. W. Samuel Au

    Abstract: External disturbance forces caused by nonlinear springy electrical cables in the Master Tool Manipulator (MTM) of the da Vinci Research Kit (dVRK) limits the usage of the existing gravity compensation methods. Significant motion drifts at the MTM tip are often observed when the MTM is located far from its identification trajectory, preventing the usage of these methods for the entire workspace rel… ▽ More

    Submitted 16 January, 2020; originally announced January 2020.

    Journal ref: IEEE Robotics and Automation Letters 4.4 (2019): 3892-3899

  27. arXiv:1909.01727  [pdf, other

    cs.IR

    Heterogeneous Collaborative Filtering

    Authors: Yifang Liu, Zhentao Xu, Cong Hui, Yi Xuan, Jessie Chen, Yuanming Shan

    Abstract: Recommendation system is important to a content sharing/creating social network. Collaborative filtering is a widely-adopted technology in conventional recommenders, which is based on similarity between positively engaged content items involving the same users. Conventional collaborative filtering (CCF) suffers from cold start problem and narrow content diversity. We propose a new recommendation a… ▽ More

    Submitted 31 August, 2019; originally announced September 2019.

  28. arXiv:1907.00782  [pdf, other

    cs.CR cs.CY cs.DB cs.LG

    Collecting and Analyzing Multidimensional Data with Local Differential Privacy

    Authors: Ning Wang, Xiaokui Xiao, Yin Yang, Jun Zhao, Siu Cheung Hui, Hyejin Shin, Junbum Shin, Ge Yu

    Abstract: Local differential privacy (LDP) is a recently proposed privacy standard for collecting and analyzing data, which has been used, e.g., in the Chrome browser, iOS and macOS. In LDP, each user perturbs her information locally, and only sends the randomized version to an aggregator who performs analyses, which protects both the users and the aggregator against private information leaks. Although LDP… ▽ More

    Submitted 28 June, 2019; originally announced July 2019.

    Comments: 12-Page Full Paper in Proceedings of the 2019 IEEE International Conference on Data Engineering (ICDE). arXiv admin note: text overlap with arXiv:1606.05053

    MSC Class: Local differential privacy; multidimensional data; stochastic gradient descent

  29. arXiv:1906.04393  [pdf, other

    cs.CL cs.LG

    Lightweight and Efficient Neural Natural Language Processing with Quaternion Networks

    Authors: Yi Tay, Aston Zhang, Luu Anh Tuan, Jinfeng Rao, Shuai Zhang, Shuohang Wang, Jie Fu, Siu Cheung Hui

    Abstract: Many state-of-the-art neural models for NLP are heavily parameterized and thus memory inefficient. This paper proposes a series of lightweight and memory efficient neural architectures for a potpourri of natural language processing (NLP) tasks. To this end, our models exploit computation using Quaternion algebra and hypercomplex spaces, enabling not only expressive inter-component interactions but… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: ACL 2019

  30. arXiv:1905.10847  [pdf, other

    cs.CL cs.AI cs.IR

    Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives

    Authors: Yi Tay, Shuohang Wang, Luu Anh Tuan, Jie Fu, Minh C. Phan, Xingdi Yuan, Jinfeng Rao, Siu Cheung Hui, Aston Zhang

    Abstract: This paper tackles the problem of reading comprehension over long narratives where documents easily span over thousands of tokens. We propose a curriculum learning (CL) based Pointer-Generator framework for reading/sampling over large documents, enabling diverse training of the neural model based on the notion of alternating contextual difficulty. This can be interpreted as a form of domain random… ▽ More

    Submitted 26 May, 2019; originally announced May 2019.

    Comments: Accepted to ACL 2019

  31. arXiv:1811.09786  [pdf, other

    cs.CL cs.AI cs.IR cs.NE

    Recurrently Controlled Recurrent Networks

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: Recurrent neural networks (RNNs) such as long short-term memory and gated recurrent units are pivotal building blocks across a broad spectrum of sequence modeling problems. This paper proposes a recurrently controlled recurrent network (RCRN) for expressive and powerful sequence encoding. More concretely, the key idea behind our approach is to learn the recurrent gating functions using recurrent n… ▽ More

    Submitted 24 November, 2018; originally announced November 2018.

    Comments: NIPS 2018

  32. arXiv:1811.04210  [pdf, other

    cs.CL cs.AI cs.IR cs.NE

    Densely Connected Attention Propagation for Reading Comprehension

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui, Jian Su

    Abstract: We propose DecaProp (Densely Connected Attention Propagation), a new densely connected neural architecture for reading comprehension (RC). There are two distinct characteristics of our model. Firstly, our model densely connects all pairwise layers of the network, modeling relationships between passage and query across all hierarchical levels. Secondly, the dense connectors in our network are learn… ▽ More

    Submitted 2 April, 2019; v1 submitted 10 November, 2018; originally announced November 2018.

    Comments: NIPS 2018

  33. arXiv:1810.02938  [pdf, other

    cs.CL cs.AI cs.IR

    Co-Stack Residual Affinity Networks with Multi-level Attention Refinement for Matching Text Sequences

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: Learning a matching function between two text sequences is a long standing problem in NLP research. This task enables many potential applications such as question answering and paraphrase identification. This paper proposes Co-Stack Residual Affinity Networks (CSRAN), a new and universal neural architecture for this problem. CSRAN is a deep architecture, involving stacked (multi-layered) recurrent… ▽ More

    Submitted 6 October, 2018; originally announced October 2018.

    Comments: EMNLP 2018

  34. arXiv:1806.06446   

    cs.IR cs.AI cs.LG cs.NE

    Self-Attentive Neural Collaborative Filtering

    Authors: Yi Tay, Shuai Zhang, Luu Anh Tuan, Siu Cheung Hui

    Abstract: This paper has been withdrawn as we discovered a bug in our tensorflow implementation that involved accidental mixing of vectors across batches. This lead to different inference results given different batch sizes which is completely strange. The performance scores still remain the same but we concluded that it was not the self-attention that contributed to the performance. We are withdrawing the… ▽ More

    Submitted 19 July, 2018; v1 submitted 17 June, 2018; originally announced June 2018.

    Comments: We discovered a bug in our tensorflow implementation that involved accidental mixing of vectors across batches, rendering the main claim of the paper incorrect. We are withdrawing this paper until we find out why

  35. arXiv:1806.00778  [pdf, other

    cs.CL cs.AI cs.IR

    Multi-Cast Attention Networks for Retrieval-based Question Answering and Response Prediction

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: Attention is typically used to select informative sub-phrases that are used for prediction. This paper investigates the novel use of attention as a form of feature augmentation, i.e, casted attention. We propose Multi-Cast Attention Networks (MCAN), a new attention mechanism and general model architecture for a potpourri of ranking tasks in the conversational modeling and question answering domain… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

    Comments: Accepted to KDD 2018 (Paper titled only "Multi-Cast Attention Networks" in KDD version)

  36. arXiv:1805.11535  [pdf, other

    cs.CL cs.AI cs.IR cs.NE

    CoupleNet: Paying Attention to Couples with Coupled Attention for Relationship Recommendation

    Authors: Yi Tay, Anh Tuan Luu, Siu Cheung Hui

    Abstract: Dating and romantic relationships not only play a huge role in our personal lives but also collectively influence and shape society. Today, many romantic partnerships originate from the Internet, signifying the importance of technology and the web in modern dating. In this paper, we present a text-based computational approach for estimating the relationship compatibility of two users on social med… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

    Comments: Accepted at ICWSM 2018

  37. arXiv:1805.11426  [pdf

    cs.OH

    Standard Cell Library Evaluation with Multiple lithography-compliant verification and Improved Synopsys Pin Access Checking Utility

    Authors: Yongfu Li, Wan Chia Ang, Chin Hui Lee, Kok Peng Chua, Yoong Seang Jonathan Ong, Chiu Wing Colin Hui

    Abstract: While standard cell layouts are drawn with minimum design rules to maximize the benefit of design area shrinkage, the complicated design rules have caused difficulties with signal routes accessing the pins in standard cell layouts. As a result, it has become a great challenge for physical layout designers to design a standard cell layout that is optimized for area, power, timing, signal integrity,… ▽ More

    Submitted 27 May, 2018; originally announced May 2018.

    Comments: Synopsys User Group Singapore (SNUG) 2017. arXiv admin note: substantial text overlap with arXiv:1805.10012, arXiv:1805.10745

  38. arXiv:1805.10745  [pdf

    cs.OH

    Multiple-Lithography-Compliant Verification for Standard Cell Library Development Flow

    Authors: Yongfu Li, Wan Chia Ang, Chin Hui Lee, Kok Peng Chua, Yoong Seang Jonathan Ong, Chiu Wing Colin Hui

    Abstract: Starting from 22-nm, a standard cell must be designed to be full lithography-compliant, which includes Design Rule Check, Design-for-Manufacturability and Double-Patterning compliant. It has become a great challenge for physical layout designers to provide a full lithography-compliant standard cell layout that is optimized for area, power, timing, signal integrity, and yield. This challenge is fur… ▽ More

    Submitted 27 May, 2018; originally announced May 2018.

    Comments: Synopsys User Group Silicon Valley (SNUG) 2017

  39. arXiv:1805.10012  [pdf

    cs.OH

    Constraining the Synopsys Pin Access Checker Utility for Improved Standard Cells Library Verification Flow

    Authors: Yongfu Li, Chin Hui Lee, Wan Chia Ang, Kok Peng Chua, Yoong Seang Jonathan Ong, Chiu Wing Colin Hui

    Abstract: While standard cell layouts are drawn with minimum design rules for maximum benefit of design area shrinkage, the complicated design rules begin to cause difficulties with signal routes accessing the pins in standard cell layouts. Multiple design iterations are required to resolve routing issues, thus increasing the runtime and the overall chip area. To optimize the chip performance, power and are… ▽ More

    Submitted 25 May, 2018; originally announced May 2018.

    Journal ref: Synopsys User Conference (SNUG) Silicon Valley 2017

  40. arXiv:1805.02856  [pdf, other

    cs.CL cs.AI cs.IR

    Reasoning with Sarcasm by Reading In-between

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui, Jian Su

    Abstract: Sarcasm is a sophisticated speech act which commonly manifests on social communities such as Twitter and Reddit. The prevalence of sarcasm on the social web is highly disruptive to opinion mining systems due to not only its tendency of polarity flipping but also usage of figurative language. Sarcasm commonly manifests with a contrastive theme either between positive-negative sentiments or between… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

    Comments: Accepted to ACL2018

  41. arXiv:1803.09074  [pdf, other

    cs.CL cs.AI cs.NE

    Multi-range Reasoning for Machine Comprehension

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: We propose MRU (Multi-Range Reasoning Units), a new fast compositional encoder for machine comprehension (MC). Our proposed MRU encoders are characterized by multi-ranged gating, executing a series of parameterized contract-and-expand layers for learning gating vectors that benefit from long and short-term dependencies. The aims of our approach are as follows: (1) learning representations that are… ▽ More

    Submitted 24 March, 2018; originally announced March 2018.

  42. arXiv:1801.09251  [pdf, other

    cs.CL cs.AI cs.IR

    Multi-Pointer Co-Attention Networks for Recommendation

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: Many recent state-of-the-art recommender systems such as D-ATT, TransNet and DeepCoNN exploit reviews for representation learning. This paper proposes a new neural architecture for recommendation with reviews. Our model operates on a multi-hierarchical paradigm and is based on the intuition that not all reviews are created equal, i.e., only a select few are important. The importance, however, shou… ▽ More

    Submitted 21 June, 2018; v1 submitted 28 January, 2018; originally announced January 2018.

    Comments: Accepted to KDD 2018 (Research Track)

  43. arXiv:1801.00102  [pdf, other

    cs.CL cs.AI

    Compare, Compress and Propagate: Enhancing Neural Architectures with Alignment Factorization for Natural Language Inference

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: This paper presents a new deep learning architecture for Natural Language Inference (NLI). Firstly, we introduce a new architecture where alignment pairs are compared, compressed and then propagated to upper layers for enhanced representation learning. Secondly, we adopt factorization layers for efficient and expressive compression of alignment vectors into scalar features, which are then used to… ▽ More

    Submitted 10 September, 2018; v1 submitted 30 December, 2017; originally announced January 2018.

    Comments: EMNLP 2018 CRC and Update CAFE + ELMo result on SNLI

  44. arXiv:1712.05403  [pdf, other

    cs.CL cs.AI cs.IR

    Learning to Attend via Word-Aspect Associative Fusion for Aspect-based Sentiment Analysis

    Authors: Yi Tay, Anh Tuan Luu, Siu Cheung Hui

    Abstract: Aspect-based sentiment analysis (ABSA) tries to predict the polarity of a given document with respect to a given aspect entity. While neural network architectures have been successful in predicting the overall polarity of sentences, aspect-specific sentiment analysis still remains as an open problem. In this paper, we propose a novel method for integrating aspect information into the neural model.… ▽ More

    Submitted 14 December, 2017; originally announced December 2017.

    Comments: Accepted to AAAI2018

  45. arXiv:1711.07656  [pdf, other

    cs.CL cs.AI cs.IR

    Cross Temporal Recurrent Networks for Ranking Question Answer Pairs

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: Temporal gates play a significant role in modern recurrent-based neural encoders, enabling fine-grained control over recursive compositional operations over time. In recurrent models such as the long short-term memory (LSTM), temporal gates control the amount of information retained or discarded over time, not only playing an important role in influencing the learned representations but also servi… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Comments: Accepted to AAAI2018

  46. arXiv:1711.04981  [pdf, other

    cs.AI cs.CL

    SkipFlow: Incorporating Neural Coherence Features for End-to-End Automatic Text Scoring

    Authors: Yi Tay, Minh C. Phan, Luu Anh Tuan, Siu Cheung Hui

    Abstract: Deep learning has demonstrated tremendous potential for Automatic Text Scoring (ATS) tasks. In this paper, we describe a new neural architecture that enhances vanilla neural network models with auxiliary neural coherence features. Our new method proposes a new \textsc{SkipFlow} mechanism that models relationships between snapshots of the hidden representations of a long short-term memory (LSTM) ne… ▽ More

    Submitted 14 November, 2017; originally announced November 2017.

    Comments: Accepted to AAAI 2018

  47. arXiv:1708.07436  [pdf, other

    cs.LG cs.CR cs.DB

    Differentially Private Regression for Discrete-Time Survival Analysis

    Authors: Thông T. Nguyên, Siu Cheung Hui

    Abstract: In survival analysis, regression models are used to understand the effects of explanatory variables (e.g., age, sex, weight, etc.) to the survival probability. However, for sensitive survival data such as medical data, there are serious concerns about the privacy of individuals in the data set when medical data is used to fit the regression models. The closest work addressing such privacy concerns… ▽ More

    Submitted 24 August, 2017; v1 submitted 24 August, 2017; originally announced August 2017.

    Comments: 19 pages, CIKM17

  48. arXiv:1708.04828  [pdf, other

    cs.AI cs.IR

    Multi-task Neural Network for Non-discrete Attribute Prediction in Knowledge Graphs

    Authors: Yi Tay, Luu Anh Tuan, Minh C. Phan, Siu Cheung Hui

    Abstract: Many popular knowledge graphs such as Freebase, YAGO or DBPedia maintain a list of non-discrete attributes for each entity. Intuitively, these attributes such as height, price or population count are able to richly characterize entities in knowledge graphs. This additional source of information may help to alleviate the inherent sparsity and incompleteness problem that are prevalent in knowledge g… ▽ More

    Submitted 16 August, 2017; originally announced August 2017.

    Comments: Accepted at CIKM 2017

  49. arXiv:1708.04517  [pdf, other

    cs.CR cs.DB

    Privacy-Preserving Mechanisms for Parametric Survival Analysis with Weibull Distribution

    Authors: Thông T. Nguyên, Siu Cheung Hui

    Abstract: Survival analysis studies the statistical properties of the time until an event of interest occurs. It has been commonly used to study the effectiveness of medical treatments or the lifespan of a population. However, survival analysis can potentially leak confidential information of individuals in the dataset. The state-of-the-art techniques apply ad-hoc privacy-preserving mechanisms on publishing… ▽ More

    Submitted 24 August, 2017; v1 submitted 1 July, 2017; originally announced August 2017.

    Comments: 8 pages, Trustcom17

  50. Hyperbolic Representation Learning for Fast and Efficient Neural Question Answering

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: The dominant neural architectures in question answer retrieval are based on recurrent or convolutional encoders configured with complex word matching layers. Given that recent architectural innovations are mostly new word interaction layers or attention-based matching mechanisms, it seems to be a well-established fact that these components are mandatory for good performance. Unfortunately, the mem… ▽ More

    Submitted 23 November, 2017; v1 submitted 25 July, 2017; originally announced July 2017.

    Comments: Accepted at WSDM 2018