Skip to main content

Showing 1–50 of 66 results for author: Lai, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.22096  [pdf, ps, other

    cs.LG

    Transfer Learning for Assessing Heavy Metal Pollution in Seaports Sediments

    Authors: Tin Lai, Farnaz Farid, Yueyang Kuan, Xintian Zhang

    Abstract: Detecting heavy metal pollution in soils and seaports is vital for regional environmental monitoring. The Pollution Load Index (PLI), an international standard, is commonly used to assess heavy metal containment. However, the conventional PLI assessment involves laborious procedures and data analysis of sediment samples. To address this challenge, we propose a deep-learning-based model that simpli… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  2. arXiv:2502.02335  [pdf

    cs.CR

    Target Attack Backdoor Malware Analysis and Attribution

    Authors: Anthony Cheuk Tung Lai, Vitaly Kamluk, Alan Ho, Ping Fan Ke, Byron Wai

    Abstract: Backdoor Malware are installed by an attacker on the victim's server(s) for authorized access. A customized backdoor is weaponized to execute unauthorized system, database and application commands to access the user credentials and confidential digital assets. Recently, we discovered and analyzed a targeted persistent module backdoor in Web Server in an online business company that was undetectabl… ▽ More

    Submitted 5 February, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: 12 pages, 8 figures, 2 tables, DFRWS

  3. arXiv:2502.02230  [pdf

    cs.CR

    An Attack-Driven Incident Response and Defense System (ADIRDS)

    Authors: Anthony Cheuk Tung Lai, Siu Ming Yiu, Ping Fan Ke, Alan Ho

    Abstract: One of the major goals of incident response is to help an organization or a system owner to quickly identify and halt the attacks to minimize the damages (and financial loss) to the system being attacked. Typical incident responses rely very much on the log information captured by the system during the attacks and if needed, may need to isolate the victim from the network to avoid further destruct… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

    Comments: 18 pages, 3 figures, 4 tables

  4. arXiv:2502.01221  [pdf

    cs.CR

    Ransomware IR Model: Proactive Threat Intelligence-Based Incident Response Strategy

    Authors: Anthony Cheuk Tung Lai, Ping Fan Ke, Alan Ho

    Abstract: Ransomware impact different organizations for years, it causes huge monetary, reputation loss and operation impact. Other than typical data encryption by ransomware, attackers can request ransom from the victim organizations via data extortion, otherwise, attackers will publish stolen data publicly in their ransomware dashboard forum and data-sharing platforms. However, there is no clear and prove… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

    Comments: 10 pages, 1 figure, 2 tables, case study

  5. arXiv:2412.08346  [pdf, other

    cs.RO

    Grasping by parallel shape matching

    Authors: Wenzheng Zhang, Fahira Afzal Maken, Tin Lai, Fabio Ramos

    Abstract: Grasping is essential in robotic manipulation, yet challenging due to object and gripper diversity and real-world complexities. Traditional analytic approaches often have long optimization times, while data-driven methods struggle with unseen objects. This paper formulates the problem as a rigid shape matching between gripper and object, which optimizes with Annealed Stein Iterative Closest Point… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    Journal ref: ACRA 2024: Australasian Conference on Robotics and Automation, November 2024, Auckland, New Zealand

  6. arXiv:2410.03386  [pdf, ps, other

    cs.CY

    Chronic Disease Diagnoses Using Behavioral Data

    Authors: Di Wang, Yidan Hu, Eng Sing Lee, Hui Hwang Teong, Ray Tian Rui Lai, Wai Han Hoi, Chunyan Miao

    Abstract: Early detection of chronic diseases is beneficial to healthcare by providing a golden opportunity for timely interventions. Although numerous prior studies have successfully used machine learning (ML) models for disease diagnoses, they highly rely on medical data, which are scarce for most patients in the early stage of the chronic diseases. In this paper, we aim to diagnose hyperglycemia (diabete… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  7. arXiv:2407.14911  [pdf, other

    cs.CV

    Self-supervised transformer-based pre-training method with General Plant Infection dataset

    Authors: Zhengle Wang, Ruifeng Wang, Minjuan Wang, Tianyun Lai, Man Zhang

    Abstract: Pest and disease classification is a challenging issue in agriculture. The performance of deep learning models is intricately linked to training data diversity and quantity, posing issues for plant pest and disease datasets that remain underdeveloped. This study addresses these challenges by constructing a comprehensive dataset and proposing an advanced network architecture that combines Contrasti… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

    Comments: 14 pages, 5 figures, 4 tables, 3 formulas

  8. arXiv:2405.17926  [pdf, other

    cs.CV

    SarcNet: A Novel AI-based Framework to Automatically Analyze and Score Sarcomere Organizations in Fluorescently Tagged hiPSC-CMs

    Authors: Huyen Le, Khiet Dang, Tien Lai, Nhung Nguyen, Mai Tran, Hieu Pham

    Abstract: Quantifying sarcomere structure organization in human-induced pluripotent stem cell-derived cardiomyocytes (hiPSC-CMs) is crucial for understanding cardiac disease pathology, improving drug screening, and advancing regenerative medicine. Traditional methods, such as manual annotation and Fourier transform analysis, are labor-intensive, error-prone, and lack high-throughput capabilities. In this st… ▽ More

    Submitted 28 October, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  9. arXiv:2405.14068  [pdf, other

    cs.GT cs.PL

    Verifying Cake-Cutting, Faster

    Authors: Noah Bertram, Tean Lai, Justin Hsu

    Abstract: Envy-free cake-cutting protocols procedurally divide an infinitely divisible good among a set of agents so that no agent prefers another's allocation to their own. These protocols are highly complex and difficult to prove correct. Recently, Bertram, Levinson, and Hsu introduced a language called Slice for describing and verifying cake-cutting protocols. Slice programs can be translated to formulas… ▽ More

    Submitted 30 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: 53 Pages, 12 Figures, CAV 2024

    ACM Class: D.3.1; J.4

  10. arXiv:2312.12439  [pdf, other

    cs.CV physics.optics

    Single-pixel 3D imaging based on fusion temporal data of single photon detector and millimeter-wave radar

    Authors: Tingqin Lai, Xiaolin Liang, Yi Zhu, Xinyi Wu, Lianye Liao, Xuelin Yuan, Ping Su, Shihai Sun

    Abstract: Recently, there has been increased attention towards 3D imaging using single-pixel single-photon detection (also known as temporal data) due to its potential advantages in terms of cost and power efficiency. However, to eliminate the symmetry blur in the reconstructed images, a fixed background is required. This paper proposes a fusion-data-based 3D imaging method that utilizes a single-pixel sing… ▽ More

    Submitted 20 October, 2023; originally announced December 2023.

    Comments: Accepted by Chinese Optics Letters, and comments are welcome

    Journal ref: Chinese Optics Letters, Vol.2, No.2, 2024

  11. arXiv:2310.12294  [pdf, other

    cs.LG

    Open-Set Multivariate Time-Series Anomaly Detection

    Authors: Thomas Lai, Thi Kieu Khanh Ho, Narges Armanfard

    Abstract: Numerous methods for time-series anomaly detection (TSAD) have emerged in recent years, most of which are unsupervised and assume that only normal samples are available during the training phase, due to the challenge of obtaining abnormal data in real-world scenarios. Still, limited samples of abnormal data are often available, albeit they are far from representative of all possible anomalies. Sup… ▽ More

    Submitted 7 August, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted to ECAI-2024

  12. arXiv:2309.00252  [pdf, other

    cs.CV cs.LG

    Interpretable Medical Imagery Diagnosis with Self-Attentive Transformers: A Review of Explainable AI for Health Care

    Authors: Tin Lai

    Abstract: Recent advancements in artificial intelligence (AI) have facilitated its widespread adoption in primary medical services, addressing the demand-supply imbalance in healthcare. Vision Transformers (ViT) have emerged as state-of-the-art computer vision models, benefiting from self-attention modules. However, compared to traditional machine-learning approaches, deep-learning models are complex and ar… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  13. arXiv:2308.04071  [pdf, other

    cs.RO cs.AI cs.LG

    Path Signatures for Diversity in Probabilistic Trajectory Optimisation

    Authors: Lucas Barcelos, Tin Lai, Rafael Oliveira, Paulo Borges, Fabio Ramos

    Abstract: Motion planning can be cast as a trajectory optimisation problem where a cost is minimised as a function of the trajectory being generated. In complex environments with several obstacles and complicated geometry, this optimisation problem is usually difficult to solve and prone to local minima. However, recent advancements in computing hardware allow for parallel trajectory optimisation where mult… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  14. arXiv:2307.11991  [pdf, other

    cs.CL cs.AI

    Psy-LLM: Scaling up Global Mental Health Psychological Services with AI-based Large Language Models

    Authors: Tin Lai, Yukun Shi, Zicong Du, Jiajie Wu, Ken Fu, Yichao Dou, Ziqi Wang

    Abstract: The demand for psychological counselling has grown significantly in recent years, particularly with the global outbreak of COVID-19, which has heightened the need for timely and professional mental health support. Online psychological counselling has emerged as the predominant mode of providing services in response to this demand. In this study, we propose the Psy-LLM framework, an AI-based assist… ▽ More

    Submitted 1 September, 2023; v1 submitted 22 July, 2023; originally announced July 2023.

  15. arXiv:2307.10596  [pdf, other

    cs.LG cs.SI stat.ML

    Ensemble Learning based Anomaly Detection for IoT Cybersecurity via Bayesian Hyperparameters Sensitivity Analysis

    Authors: Tin Lai, Farnaz Farid, Abubakar Bello, Fariza Sabrina

    Abstract: The Internet of Things (IoT) integrates more than billions of intelligent devices over the globe with the capability of communicating with other connected devices with little to no human intervention. IoT enables data aggregation and analysis on a large scale to improve life quality in many domains. In particular, data collected by IoT contain a tremendous amount of information for anomaly detecti… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  16. Real-time Aerial Detection and Reasoning on Embedded-UAVs

    Authors: Tin Lai

    Abstract: We present a unified pipeline architecture for a real-time detection system on an embedded system for UAVs. Neural architectures have been the industry standard for computer vision. However, most existing works focus solely on concatenating deeper layers to achieve higher accuracy with run-time performance as the trade-off. This pipeline of networks can exploit the domain-specific knowledge on aer… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: In TGRS

  17. arXiv:2303.14366  [pdf, other

    stat.ML cs.LG

    Hybrid Fuzzy-Crisp Clustering Algorithm: Theory and Experiments

    Authors: Akira R. Kinjo, Daphne Teck Ching Lai

    Abstract: With the membership function being strictly positive, the conventional fuzzy c-means clustering method sometimes causes imbalanced influence when clusters of vastly different sizes exist. That is, an outstandingly large cluster drags to its center all the other clusters, however far they are separated. To solve this problem, we propose a hybrid fuzzy-crisp clustering algorithm based on a target fu… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: 41 pages, 13 figures, 10 tables

  18. arXiv:2301.10153  [pdf, other

    q-fin.ST cs.LG

    Sequential Graph Attention Learning for Predicting Dynamic Stock Trends (Student Abstract)

    Authors: Tzu-Ya Lai, Wen Jung Cheng, Jun-En Ding

    Abstract: The stock market is characterized by a complex relationship between companies and the market. This study combines a sequential graph structure with attention mechanisms to learn global and local information within temporal time. Specifically, our proposed "GAT-AGNN" module compares model performance across multiple industries as well as within single industries. The results show that the proposed… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

  19. arXiv:2301.09175  [pdf, other

    cs.CL

    Ensemble Transfer Learning for Multilingual Coreference Resolution

    Authors: Tuan Manh Lai, Heng Ji

    Abstract: Entity coreference resolution is an important research problem with many applications, including information extraction and question answering. Coreference resolution for English has been studied extensively. However, there is relatively little work for other languages. A problem that frequently occurs when working with a non-English language is the scarcity of annotated training data. To overcome… ▽ More

    Submitted 22 January, 2023; originally announced January 2023.

  20. arXiv:2211.04781  [pdf

    cs.CV

    Profiling Obese Subgroups in National Health and Nutritional Status Survey Data using Machine Learning Techniques: A Case Study from Brunei Darussalam

    Authors: Usman Khalil, Owais Ahmed Malik, Daphne Teck Ching Lai, Ong Sok King

    Abstract: National Health and Nutritional Status Survey (NHANSS) is conducted annually by the Ministry of Health in Negara Brunei Darussalam to assess the population health and nutritional patterns and characteristics. The main aim of this study was to discover meaningful patterns (groups) from the obese sample of NHANSS data by applying data reduction and interpretation techniques. The mixed nature of the… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: A Case study of Obese Subgroups from Brunei Darussalam: 15 Pages, 4 figures, journal

  21. arXiv:2211.03985  [pdf, other

    stat.CO cs.LG

    Adaptive Data Depth via Multi-Armed Bandits

    Authors: Tavor Z. Baharav, Tze Leung Lai

    Abstract: Data depth, introduced by Tukey (1975), is an important tool in data science, robust statistics, and computational geometry. One chief barrier to its broader practical utility is that many common measures of depth are computationally intensive, requiring on the order of $n^d$ operations to exactly compute the depth of a single point within a data set of $n$ points in $d$-dimensional space. Often h… ▽ More

    Submitted 9 November, 2022; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: Keywords: multi-armed bandits, data depth, adaptivity, large-scale computation, simplicial depth

  22. arXiv:2211.02700  [pdf, other

    cs.AI q-bio.NC

    Achieving mouse-level strategic evasion performance using real-time computational planning

    Authors: German Espinosa, Gabrielle E. Wink, Alexander T. Lai, Daniel A. Dombeck, Malcolm A. MacIver

    Abstract: Planning is an extraordinary ability in which the brain imagines and then enacts evaluated possible futures. Using traditional planning models, computer scientists have attempted to replicate this capacity with some level of success but ultimately face a reoccurring limitation: as the plan grows in steps, the number of different possible futures makes it intractable to determine the right sequence… ▽ More

    Submitted 8 November, 2022; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: 6 pages, 4 figures, ICRA 2023

  23. arXiv:2209.09932  [pdf, other

    cs.SE cs.AI

    Comparative analysis of real bugs in open-source Machine Learning projects -- A Registered Report

    Authors: Tuan Dung Lai, Anj Simmons, Scott Barnett, Jean-Guy Schneider, Rajesh Vasa

    Abstract: Background: Machine Learning (ML) systems rely on data to make predictions, the systems have many added components compared to traditional software systems such as the data processing pipeline, serving pipeline, and model training. Existing research on software maintenance has studied the issue-reporting needs and resolution process for different types of issues, such as performance and security i… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: 6 pages, 3 figures, registered report, conference paper, accepted at ESEM -- the International Symposium on Empirical Software Engineering and Measurement

  24. arXiv:2209.05222  [pdf, other

    cs.RO cs.AI

    A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding

    Authors: Tin Lai

    Abstract: Simultaneous Localisation and Mapping (SLAM) is one of the fundamental problems in autonomous mobile robots where a robot needs to reconstruct a previously unseen environment while simultaneously localising itself with respect to the map. In particular, Visual-SLAM uses various sensors from the mobile robot for collecting and sensing a representation of the map. Traditionally, geometric model-base… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

  25. arXiv:2207.08130  [pdf, other

    cs.AI cs.RO stat.ML

    Discover Life Skills for Planning with Bandits via Observing and Learning How the World Works

    Authors: Tin Lai

    Abstract: We propose a novel approach for planning agents to compose abstract skills via observing and learning from historical interactions with the world. Our framework operates in a Markov state-space model via a set of actions under unknown pre-conditions. We formulate skills as high-level abstract policies that propose action plans based on the current state. Each policy learns new plans by observing t… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  26. arXiv:2204.11817  [pdf, other

    cs.CL cs.AI

    Translation between Molecules and Natural Language

    Authors: Carl Edwards, Tuan Lai, Kevin Ros, Garrett Honke, Kyunghyun Cho, Heng Ji

    Abstract: We present $\textbf{MolT5}$ $-$ a self-supervised learning framework for pretraining models on a vast amount of unlabeled natural language text and molecule strings. $\textbf{MolT5}$ allows for new, useful, and challenging analogs of traditional vision-language tasks, such as molecule captioning and text-based de novo molecule generation (altogether: translation between molecules and language), wh… ▽ More

    Submitted 3 November, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted at EMNLP 2022. Data and code can be found on [Github](https://github.com/blender-nlp/MolT5)

  27. arXiv:2203.03422  [pdf, other

    cs.LG math.NA

    Water and Sediment Analyse Using Predictive Models

    Authors: Xiaoting Xu, Tin Lai, Sayka Jahan, Farnaz Farid

    Abstract: The increasing prevalence of marine pollution during the past few decades motivated recent research to help ease the situation. Typical water quality assessment requires continuous monitoring of water and sediments at remote locations with labour intensive laboratory tests to determine the degree of pollution. We propose an automated framework where we formalise a predictive model using Machine Le… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

  28. arXiv:2203.00975  [pdf, other

    cs.RO cs.LG

    L4KDE: Learning for KinoDynamic Tree Expansion

    Authors: Tin Lai, Weiming Zhi, Tucker Hermans, Fabio Ramos

    Abstract: We present the Learning for KinoDynamic Tree Expansion (L4KDE) method for kinodynamic planning. Tree-based planning approaches, such as rapidly exploring random tree (RRT), are the dominant approach to finding globally optimal plans in continuous state-space motion planning. Central to these approaches is tree-expansion, the procedure in which new nodes are added into an ever-expanding tree. We st… ▽ More

    Submitted 17 September, 2023; v1 submitted 2 March, 2022; originally announced March 2022.

  29. arXiv:2202.13404  [pdf, other

    cs.CL

    Improving Candidate Retrieval with Entity Profile Generation for Wikidata Entity Linking

    Authors: Tuan Manh Lai, Heng Ji, ChengXiang Zhai

    Abstract: Entity linking (EL) is the task of linking entity mentions in a document to referent entities in a knowledge base (KB). Many previous studies focus on Wikipedia-derived KBs. There is little work on EL over Wikidata, even though it is the most extensive crowdsourced KB. The scale of Wikidata can open up many new real-world applications, but its massive number of entities also makes EL challenging.… ▽ More

    Submitted 14 March, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

    Comments: ACL 2022 (Findings)

  30. arXiv:2201.05314  [pdf, other

    cs.CV cs.NE cs.RO

    A Novel Skeleton-Based Human Activity Discovery Using Particle Swarm Optimization with Gaussian Mutation

    Authors: Parham Hadikhani, Daphne Teck Ching Lai, Wee-Hong Ong

    Abstract: Human activity discovery aims to cluster the activities performed by humans without any prior information on what defines each activity. Most methods presented in human activity recognition are supervised, where there are labeled inputs to train the system. In reality, it is difficult to label activities data because of its huge volume and the variety of human activities. This paper proposes an un… ▽ More

    Submitted 18 October, 2022; v1 submitted 14 January, 2022; originally announced January 2022.

  31. arXiv:2111.10559  [pdf, other

    cs.LG cs.AI

    Learning Non-Stationary Time-Series with Dynamic Pattern Extractions

    Authors: Xipei Wang, Haoyu Zhang, Yuanbo Zhang, Meng Wang, Jiarui Song, Tin Lai, Matloob Khushi

    Abstract: The era of information explosion had prompted the accumulation of a tremendous amount of time-series data, including stationary and non-stationary time-series data. State-of-the-art algorithms have achieved a decent performance in dealing with stationary temporal data. However, traditional algorithms that tackle stationary time-series do not apply to non-stationary series like Forex trading. This… ▽ More

    Submitted 20 November, 2021; originally announced November 2021.

  32. sbp-env: Sampling-based Motion Planners' Testing Environment

    Authors: Tin Lai

    Abstract: Sampling-based motion planners' testing environment (sbp-env) is a full feature framework to quickly test different sampling-based algorithms for motion planning. sbp-env focuses on the flexibility of tinkering with different aspects of the framework, and had divided the main planning components into two categories (i) samplers and (ii) planners. The focus of motion planning research had been ma… ▽ More

    Submitted 27 October, 2021; v1 submitted 15 October, 2021; originally announced October 2021.

    Journal ref: Journal of Open Source Software, 6(66), 3782 (2021)

  33. arXiv:2109.10209  [pdf, other

    cs.RO

    Rapid Replanning in Consecutive Pick-and-Place Tasks with Lazy Experience Graph

    Authors: Tin Lai, Fabio Ramos

    Abstract: In an environment where a manipulator needs to execute multiple consecutive tasks, the act of object manoeuvre will change the underlying configuration space, affecting all subsequent tasks. Previously free configurations might now be occupied by the manoeuvred objects, and previously occupied space might now open up new paths. We propose Lazy Tree-based Replanner (LTR*) -- a novel hybrid planner… ▽ More

    Submitted 3 September, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

  34. arXiv:2109.02237  [pdf, other

    cs.CL cs.AI

    BERT might be Overkill: A Tiny but Effective Biomedical Entity Linker based on Residual Convolutional Neural Networks

    Authors: Tuan Lai, Heng Ji, ChengXiang Zhai

    Abstract: Biomedical entity linking is the task of linking entity mentions in a biomedical document to referent entities in a knowledge base. Recently, many BERT-based models have been introduced for the task. While these models have achieved competitive results on many datasets, they are computationally expensive and contain about 110M parameters. Little is known about the factors contributing to their imp… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP 2021 (Findings)

  35. arXiv:2108.11775  [pdf, other

    cs.RO cs.LG

    Parallelised Diffeomorphic Sampling-based Motion Planning

    Authors: Tin Lai, Weiming Zhi, Tucker Hermans, Fabio Ramos

    Abstract: We propose Parallelised Diffeomorphic Sampling-based Motion Planning (PDMP). PDMP is a novel parallelised framework that uses bijective and differentiable mappings, or diffeomorphisms, to transform sampling distributions of sampling-based motion planners, in a manner akin to normalising flows. Unlike normalising flow models which use invertible neural network structures to represent these diffeomo… ▽ More

    Submitted 22 September, 2021; v1 submitted 26 August, 2021; originally announced August 2021.

  36. arXiv:2108.09889  [pdf, other

    cs.CL

    A Unified Transformer-based Framework for Duplex Text Normalization

    Authors: Tuan Manh Lai, Yang Zhang, Evelina Bakhturina, Boris Ginsburg, Heng Ji

    Abstract: Text normalization (TN) and inverse text normalization (ITN) are essential preprocessing and postprocessing steps for text-to-speech synthesis and automatic speech recognition, respectively. Many methods have been proposed for either TN or ITN, ranging from weighted finite-state transducers to neural networks. Despite their impressive performance, these methods aim to tackle only one of the two ta… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: Under Review

  37. arXiv:2107.01700  [pdf, other

    cs.CL

    End-to-end Neural Coreference Resolution Revisited: A Simple yet Effective Baseline

    Authors: Tuan Manh Lai, Trung Bui, Doo Soon Kim

    Abstract: Since the first end-to-end neural coreference resolution model was introduced, many extensions to the model have been proposed, ranging from using higher-order inference to directly optimizing evaluation metrics using reinforcement learning. Despite improving the coreference resolution performance by a large margin, these extensions add substantial extra complexity to the original model. Motivated… ▽ More

    Submitted 8 February, 2022; v1 submitted 4 July, 2021; originally announced July 2021.

    Comments: Accepted by ICASSP 2022

  38. arXiv:2107.01650  [pdf, other

    cs.LG

    Learning ODEs via Diffeomorphisms for Fast and Robust Integration

    Authors: Weiming Zhi, Tin Lai, Lionel Ott, Edwin V. Bonilla, Fabio Ramos

    Abstract: Advances in differentiable numerical integrators have enabled the use of gradient descent techniques to learn ordinary differential equations (ODEs). In the context of machine learning, differentiable solvers are central for Neural ODEs (NODEs), a class of deep learning models with continuous depth, rather than discrete layers. However, these integrators can be unsatisfactorily slow and inaccurate… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

  39. arXiv:2105.13456  [pdf, other

    cs.CL

    Joint Biomedical Entity and Relation Extraction with Knowledge-Enhanced Collective Inference

    Authors: Tuan Lai, Heng Ji, ChengXiang Zhai, Quan Hung Tran

    Abstract: Compared to the general news domain, information extraction (IE) from biomedical text requires much broader domain knowledge. However, many previous IE methods do not utilize any external knowledge during inference. Due to the exponential growth of biomedical publications, models that do not go beyond their fixed set of parameters will likely fall behind. Inspired by how humans look up relevant in… ▽ More

    Submitted 31 May, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: Accepted by ACL 2021

  40. arXiv:2104.01697  [pdf, other

    cs.CL

    A Context-Dependent Gated Module for Incorporating Symbolic Semantics into Event Coreference Resolution

    Authors: Tuan Lai, Heng Ji, Trung Bui, Quan Hung Tran, Franck Dernoncourt, Walter Chang

    Abstract: Event coreference resolution is an important research problem with many applications. Despite the recent remarkable success of pretrained language models, we argue that it is still highly beneficial to utilize symbolic features for the task. However, as the input for coreference resolution typically comes from upstream components in the information extraction pipeline, the automatically extracted… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

    Comments: Accepted by NAACL 2021

  41. arXiv:2103.04487  [pdf, other

    cs.RO

    Rapidly-exploring Random Forest: Adaptively Exploits Local Structure with Generalised Multi-Trees Motion Planning

    Authors: Tin Lai

    Abstract: Sampling-based motion planners perform exceptionally well in robotic applications that operate in high-dimensional space. However, most works often constrain the planning workspace rooted at some fixed locations, do not adaptively reason on strategy in narrow passages, and ignore valuable local structure information. In this paper, we propose Rapidly-exploring Random Forest (RRF*) -- a generalised… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

  42. arXiv:2011.13470  [pdf, other

    cs.CL cs.LG

    AutoNLU: An On-demand Cloud-based Natural Language Understanding System for Enterprises

    Authors: Nham Le, Tuan Lai, Trung Bui, Doo Soon Kim

    Abstract: With the renaissance of deep learning, neural networks have achieved promising results on many natural language understanding (NLU) tasks. Even though the source codes of many neural network models are publicly available, there is still a large gap from open-sourced models to solving real-world problems in enterprises. Therefore, to fill this gap, we introduce AutoNLU, an on-demand cloud-based sys… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

    Comments: Accepted to AACL 2020 (Demo)

  43. arXiv:2011.06235  [pdf, other

    cs.RO cs.LG

    Anticipatory Navigation in Crowds by Probabilistic Prediction of Pedestrian Future Movements

    Authors: Weiming Zhi, Tin Lai, Lionel Ott, Fabio Ramos

    Abstract: Critical for the coexistence of humans and robots in dynamic environments is the capability for agents to understand each other's actions, and anticipate their movements. This paper presents Stochastic Process Anticipatory Navigation (SPAN), a framework that enables nonholonomic robots to navigate in environments with crowds, while anticipating and accounting for the motion patterns of pedestrians… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  44. arXiv:2011.03096  [pdf, other

    cs.CL cs.LG

    Explain by Evidence: An Explainable Memory-based Neural Network for Question Answering

    Authors: Quan Tran, Nhan Dam, Tuan Lai, Franck Dernoncourt, Trung Le, Nham Le, Dinh Phung

    Abstract: Interpretability and explainability of deep neural networks are challenging due to their scale, complexity, and the agreeable notions on which the explaining process rests. Previous work, in particular, has focused on representing internal components of neural networks through human-friendly visuals and concepts. On the other hand, in real life, when making a decision, human tends to rely on simil… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

    Comments: Accepted to COLING 2020

  45. arXiv:2010.13033  [pdf, other

    cs.AI

    Robust Hierarchical Planning with Policy Delegation

    Authors: Tin Lai, Philippe Morere

    Abstract: We propose a novel framework and algorithm for hierarchical planning based on the principle of delegation. This framework, the Markov Intent Process, features a collection of skills which are each specialised to perform a single task well. Skills are aware of their intended effects and are able to analyse planning goals to delegate planning to the best-suited skill. This principle dynamically crea… ▽ More

    Submitted 25 October, 2020; originally announced October 2020.

  46. arXiv:2010.11980  [pdf, other

    cs.CL cs.LG

    A Joint Learning Approach based on Self-Distillation for Keyphrase Extraction from Scientific Documents

    Authors: Tuan Manh Lai, Trung Bui, Doo Soon Kim, Quan Hung Tran

    Abstract: Keyphrase extraction is the task of extracting a small set of phrases that best describe a document. Most existing benchmark datasets for the task typically have limited numbers of annotated documents, making it challenging to train increasingly complex neural networks. In contrast, digital libraries store millions of scientific articles online, covering a wide range of topics. While a significant… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted to COLING 2020

  47. arXiv:2010.11323  [pdf, other

    cs.RO cs.LG

    Learning to Plan Optimally with Flow-based Motion Planner

    Authors: Tin Lai, Fabio Ramos

    Abstract: Sampling-based motion planning is the predominant paradigm in many real-world robotic applications, but its performance is immensely dependent on the quality of the samples. The majority of traditional planners are inefficient as they use uninformative sampling distributions as opposed to exploiting structures and patterns in the problem to guide better sampling strategies. Moreover, most current… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  48. arXiv:2007.03805  [pdf, other

    cs.CL cs.AI cs.IR

    ISA: An Intelligent Shopping Assistant

    Authors: Tuan Manh Lai, Trung Bui, Nedim Lipka

    Abstract: Despite the growth of e-commerce, brick-and-mortar stores are still the preferred destinations for many people. In this paper, we present ISA, a mobile-based intelligent shopping assistant that is designed to improve shopping experience in physical stores. ISA assists users by leveraging advanced techniques in computer vision, speech processing, and natural language processing. An in-store user on… ▽ More

    Submitted 23 September, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

    Comments: Accepted by AACL 2020 (Demo)

  49. arXiv:2005.03501  [pdf

    cs.CV

    Heidelberg Colorectal Data Set for Surgical Data Science in the Sensor Operating Room

    Authors: Lena Maier-Hein, Martin Wagner, Tobias Ross, Annika Reinke, Sebastian Bodenstedt, Peter M. Full, Hellena Hempe, Diana Mindroc-Filimon, Patrick Scholz, Thuy Nuong Tran, Pierangela Bruno, Anna Kisilenko, Benjamin Müller, Tornike Davitashvili, Manuela Capek, Minu Tizabi, Matthias Eisenmann, Tim J. Adler, Janek Gröhl, Melanie Schellenberg, Silvia Seidlitz, T. Y. Emmy Lai, Bünyamin Pekdemir, Veith Roethlingshoefer, Fabian Both , et al. (8 additional authors not shown)

    Abstract: Image-based tracking of medical instruments is an integral part of surgical data science applications. Previous research has addressed the tasks of detecting, segmenting and tracking medical instruments based on laparoscopic video data. However, the proposed methods still tend to fail when applied to challenging images and do not generalize well to data they have not been trained on. This paper in… ▽ More

    Submitted 23 February, 2021; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: Submitted to Nature Scientific Data

  50. arXiv:1910.12995  [pdf, other

    cs.CL cs.LG

    A Simple but Effective BERT Model for Dialog State Tracking on Resource-Limited Systems

    Authors: Tuan Manh Lai, Quan Hung Tran, Trung Bui, Daisuke Kihara

    Abstract: In a task-oriented dialog system, the goal of dialog state tracking (DST) is to monitor the state of the conversation from the dialog history. Recently, many deep learning based methods have been proposed for the task. Despite their impressive performance, current neural architectures for DST are typically heavily-engineered and conceptually complex, making it difficult to implement, debug, and ma… ▽ More

    Submitted 8 February, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: Accepted to ICASSP 2020