Skip to main content

Showing 1–50 of 57 results for author: Vo, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11237  [pdf, ps, other

    cs.SE cs.CL

    LLM-as-a-Judge for Reference-less Automatic Code Validation and Refinement for Natural Language to Bash in IT Automation

    Authors: Ngoc Phuoc An Vo, Brent Paulovicks, Vadim Sheinin

    Abstract: In an effort to automatically evaluate and select the best model and improve code quality for automatic incident remediation in IT Automation, it is crucial to verify if the generated code for remediation action is syntactically and semantically correct and whether it can be executed correctly as intended. There are three approaches: 1) conventional methods use surface form similarity metrics (tok… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: 10 pages

  2. arXiv:2506.06349  [pdf, ps, other

    eess.SP cs.CV cs.LG

    Heart Rate Classification in ECG Signals Using Machine Learning and Deep Learning

    Authors: Thien Nhan Vo

    Abstract: This study addresses the classification of heartbeats from ECG signals through two distinct approaches: traditional machine learning utilizing hand-crafted features and deep learning via transformed images of ECG beats. The dataset underwent preprocessing steps, including downsampling, filtering, and normalization, to ensure consistency and relevance for subsequent analysis. In the first approach,… ▽ More

    Submitted 16 June, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

  3. arXiv:2506.02657  [pdf, ps, other

    cs.IT cs.LG

    Maximizing the Promptness of Metaverse Systems using Edge Computing by Deep Reinforcement Learning

    Authors: Tam Ninh Thi-Thanh, Trinh Van Chien, Hung Tran, Nguyen Hoai Son, Van Nhan Vo

    Abstract: Metaverse and Digital Twin (DT) have attracted much academic and industrial attraction to approach the future digital world. This paper introduces the advantages of deep reinforcement learning (DRL) in assisting Metaverse system-based Digital Twin. In this system, we assume that it includes several Metaverse User devices collecting data from the real world to transfer it into the virtual world, a… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: 6 pages, 3 figures, and 2 tables. Published by IEEE at ATC2024

  4. arXiv:2506.01609  [pdf, ps, other

    cs.NI eess.SP

    Network Digital Twin for 6G and Beyond: An End-to-End View Across Multi-Domain Network Ecosystems

    Authors: Dinh-Hieu Tran, Nazar Waheed, Yuris Mulya Saputra, Xingqin Lin, Cong T. Nguyen, Tedros Salih Abdu, Van Nhan Vo, Van-Quan Pham, Madyan Alsenwi, Abuzar Babikir Mohammad Adam, Symeon Chatzinotas, Eva Lagaunas, Hung Tran, Tu Ho Dac, Nguyen Van Huynh

    Abstract: With the rapid development of technology, the number of smart mobile users is increasing, accompanied by growing demands from applications such as virtual/augmented reality (VR/XR), remote surgery, autonomous vehicles, and real-time holographic communications, all of which require high transmission rates and ultra-low latency in 6G and beyond networks (6G+). This poses enormous challenges in effic… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: 40 pages, 15 figures

  5. arXiv:2505.21954  [pdf, ps, other

    cs.CV cs.AI

    UniTalk: Towards Universal Active Speaker Detection in Real World Scenarios

    Authors: Le Thien Phuc Nguyen, Zhuoran Yu, Khoa Quang Nhat Cao, Yuwei Guo, Tu Ho Manh Pham, Tuan Tai Nguyen, Toan Ngo Duc Vo, Lucas Poon, Soochahn Lee, Yong Jae Lee

    Abstract: We present UniTalk, a novel dataset specifically designed for the task of active speaker detection, emphasizing challenging scenarios to enhance model generalization. Unlike previously established benchmarks such as AVA, which predominantly features old movies and thus exhibits significant domain gaps, UniTalk focuses explicitly on diverse and difficult real-world conditions. These include underre… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  6. Knowledge Distillation for Enhancing Walmart E-commerce Search Relevance Using Large Language Models

    Authors: Hongwei Shang, Nguyen Vo, Nitin Yadav, Tian Zhang, Ajit Puthenputhussery, Xunfan Cai, Shuyi Chen, Prijith Chandran, Changsung Kang

    Abstract: Ensuring the products displayed in e-commerce search results are relevant to users queries is crucial for improving the user experience. With their advanced semantic understanding, deep learning models have been widely used for relevance matching in search tasks. While large language models (LLMs) offer superior ranking capabilities, it is challenging to deploy LLMs in real-time systems due to the… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

    Comments: 9 pages, published at WWWW'25

    Journal ref: The Web Conference 2025

  7. arXiv:2505.06825  [pdf, other

    cs.CV cs.LG

    Active Learning for Multi-class Image Classification

    Authors: Thien Nhan Vo

    Abstract: A principle bottleneck in image classification is the large number of training examples needed to train a classifier. Using active learning, we can reduce the number of training examples to teach a CNN classifier by strategically selecting examples. Assigning values to image examples using different uncertainty metrics allows the model to identify and select high-value examples in a smaller traini… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

  8. arXiv:2505.06818  [pdf, other

    cs.LG

    Deep Learning for On-Street Parking Violation Prediction

    Authors: Thien Nhan Vo

    Abstract: Illegal parking along with the lack of available parking spaces are among the biggest issues faced in many large cities. These issues can have a significant impact on the quality of life of citizens. On-street parking systems have been designed to this end aiming at ensuring that parking spaces will be available for the local population, while also providing easy access to parking for people visit… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

  9. arXiv:2505.04097  [pdf, ps, other

    eess.IV cs.CV cs.LG

    3D Brain MRI Classification for Alzheimer Diagnosis Using CNN with Data Augmentation

    Authors: Thien Nhan Vo, Bac Nam Ho

    Abstract: A three-dimensional convolutional neural network was developed to classify T1-weighted brain MRI scans as healthy or Alzheimer. The network comprises 3D convolution, pooling, batch normalization, dense ReLU layers, and a sigmoid output. Using stochastic noise injection and five-fold cross-validation, the model achieved test set accuracy of 0.912 and area under the ROC curve of 0.961, an improvemen… ▽ More

    Submitted 17 June, 2025; v1 submitted 6 May, 2025; originally announced May 2025.

  10. arXiv:2505.03617  [pdf, ps, other

    cs.LG

    Understand the Effect of Importance Weighting in Deep Learning on Dataset Shift

    Authors: Thien Nhan Vo

    Abstract: We evaluate the effectiveness of importance weighting in deep neural networks under label shift and covariate shift. On synthetic 2D data (linearly separable and moon-shaped) using logistic regression and MLPs, we observe that weighting strongly affects decision boundaries early in training but fades with prolonged optimization. On CIFAR-10 with various class imbalances, only L2 regularization (no… ▽ More

    Submitted 17 June, 2025; v1 submitted 6 May, 2025; originally announced May 2025.

  11. arXiv:2503.06380  [pdf, other

    cs.CV cs.CL

    TI-JEPA: An Innovative Energy-based Joint Embedding Strategy for Text-Image Multimodal Systems

    Authors: Khang H. N. Vo, Duc P. T. Nguyen, Thong Nguyen, Tho T. Quan

    Abstract: This paper focuses on multimodal alignment within the realm of Artificial Intelligence, particularly in text and image modalities. The semantic gap between the textual and visual modality poses a discrepancy problem towards the effectiveness of multi-modalities fusion. Therefore, we introduce Text-Image Joint Embedding Predictive Architecture (TI-JEPA), an innovative pre-training strategy that lev… ▽ More

    Submitted 8 March, 2025; originally announced March 2025.

  12. arXiv:2410.04692  [pdf, other

    cs.LG stat.ML

    A Clifford Algebraic Approach to E(n)-Equivariant High-order Graph Neural Networks

    Authors: Viet-Hoang Tran, Thieu N. Vo, Tho Tran Huu, Tan Minh Nguyen

    Abstract: Designing neural network architectures that can handle data symmetry is crucial. This is especially important for geometric graphs whose properties are equivariance under Euclidean transformations. Current equivariant graph neural networks (EGNNs), particularly those using message passing, have a limitation in expressive power. Recent high-order graph neural networks can overcome this limitation,… ▽ More

    Submitted 13 March, 2025; v1 submitted 6 October, 2024; originally announced October 2024.

  13. arXiv:2410.04213  [pdf, ps, other

    cs.LG

    Equivariant Polynomial Functional Networks

    Authors: Thieu N. Vo, Viet-Hoang Tran, Tho Tran Huu, An Nguyen The, Thanh Tran, Minh-Khoi Nguyen-Nhat, Duy-Tung Pham, Tan Minh Nguyen

    Abstract: Neural Functional Networks (NFNs) have gained increasing interest due to their wide range of applications, including extracting information from implicit representations of data, editing network weights, and evaluating policies. A key design principle of NFNs is their adherence to the permutation and scaling symmetries inherent in the connectionist structure of the input neural networks. Recent NF… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

  14. arXiv:2410.04209  [pdf, other

    cs.LG

    Equivariant Neural Functional Networks for Transformers

    Authors: Viet-Hoang Tran, Thieu N. Vo, An Nguyen The, Tho Tran Huu, Minh-Khoi Nguyen-Nhat, Thanh Tran, Duy-Tung Pham, Tan Minh Nguyen

    Abstract: This paper systematically explores neural functional networks (NFN) for transformer architectures. NFN are specialized neural networks that treat the weights, gradients, or sparsity patterns of a deep neural network (DNN) as input data and have proven valuable for tasks such as learnable optimizers, implicit data representations, and weight editing. While NFN have been extensively developed for ML… ▽ More

    Submitted 7 March, 2025; v1 submitted 5 October, 2024; originally announced October 2024.

    Comments: Accepted in ICLR 2025

  15. arXiv:2410.03292  [pdf, other

    cs.LG

    Demystifying the Token Dynamics of Deep Selective State Space Models

    Authors: Thieu N Vo, Tung D. Pham, Xin T. Tong, Tan Minh Nguyen

    Abstract: Selective state space models (SSM), such as Mamba, have gained prominence for their effectiveness in modeling sequential data. Despite their outstanding empirical performance, a comprehensive theoretical understanding of deep selective SSM remains elusive, hindering their further development and adoption for applications that need high fidelity. In this paper, we investigate the dynamical properti… ▽ More

    Submitted 7 March, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: Accepted at ICLR 2025 (spotlight)

  16. SplitVAEs: Decentralized scenario generation from siloed data for stochastic optimization problems

    Authors: H M Mohaimanul Islam, Huynh Q. N. Vo, Paritosh Ramanan

    Abstract: Stochastic optimization problems in large-scale multi-stakeholder networked systems (e.g., power grids and supply chains) rely on data-driven scenarios to encapsulate complex spatiotemporal interdependencies. However, centralized aggregation of stakeholder data is challenging due to the existence of data silos resulting from computational and logistical bottlenecks. In this paper, we present Split… ▽ More

    Submitted 30 January, 2025; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: This work has been published to the 2024 IEEE International Conference on Big Data

  17. arXiv:2409.11697  [pdf, other

    cs.LG

    Monomial Matrix Group Equivariant Neural Functional Networks

    Authors: Viet-Hoang Tran, Thieu N. Vo, Tho H. Tran, An T. Nguyen, Tan M. Nguyen

    Abstract: Neural functional networks (NFNs) have recently gained significant attention due to their diverse applications, ranging from predicting network generalization and network editing to classifying implicit neural representation. Previous NFN designs often depend on permutation symmetries in neural networks' weights, which traditionally arise from the unordered arrangement of neurons in hidden layers.… ▽ More

    Submitted 13 March, 2025; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: 10 pages in the main text. Published at NeurIPS 2024. The code is available at https://github.com/MathematicalAI-NUS/Monomial-NFN

  18. The Lynchpin of In-Memory Computing: A Benchmarking Framework for Vector-Matrix Multiplication in RRAMs

    Authors: Md Tawsif Rahman Chowdhury, Huynh Quang Nguyen Vo, Paritosh Ramanan, Murat Yildirim, Gozde Tutuncuoglu

    Abstract: The Von Neumann bottleneck, a fundamental challenge in conventional computer architecture, arises from the inability to execute fetch and data operations simultaneously due to a shared bus linking processing and memory units. This bottleneck significantly limits system performance, increases energy consumption, and exacerbates computational complexity. Emerging technologies such as Resistive Rando… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

    Comments: ICONS 2024.Copyright 2024 IEEE.Personal use of this material is permitted.Permission from IEEE must be obtained for all other uses,in any current or future media,including reprinting/republishing this material for advertising or promotional purposes,creating new collective works,for resale or redistribution to servers or lists or reuse of any copyrighted component of this work in other works

  19. arXiv:2408.10974  [pdf, other

    cs.NI

    Deep Reinforcement Learning for Network Energy Saving in 6G and Beyond Networks

    Authors: Dinh-Hieu Tran, Nguyen Van Huynh, Soumeya Kaada, Van Nhan Vo, Eva Lagunas, Symeon Chatzinotas

    Abstract: Network energy saving has received great attention from operators and vendors to reduce energy consumption and CO2 emissions to the environment as well as significantly reduce costs for mobile network operators. However, the design of energy-saving networks also needs to ensure the mobile users' (MUs) QoS requirements such as throughput requirements (TR). This work considers a mobile cellular netw… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 7 pages, 4 figures

  20. arXiv:2407.15021  [pdf, other

    cs.CL

    Enhancing Incremental Summarization with Structured Representations

    Authors: EunJeong Hwang, Yichao Zhou, James Bradley Wendt, Beliz Gunel, Nguyen Vo, Jing Xie, Sandeep Tata

    Abstract: Large language models (LLMs) often struggle with processing extensive input contexts, which can lead to redundant, inaccurate, or incoherent summaries. Recent methods have used unstructured memory to incrementally process these contexts, but they still suffer from information overload due to the volume of unstructured data handled. In our study, we introduce structured knowledge representations (… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

  21. arXiv:2405.06807  [pdf, other

    cs.CL cs.SE

    Execution-Based Evaluation of Natural Language to Bash and PowerShell for Incident Remediation

    Authors: Ngoc Phuoc An Vo, Brent Paulovicks, Vadim Sheinin

    Abstract: Given recent advancements of Large Language Models (LLMs), code generation tasks attract immense attention for wide application in different domains. In an effort to evaluate and select a best model to automatically remediate system incidents discovered by Application Performance Monitoring (APM) platforms, it is crucial to verify if the generated code is syntactically and semantically correct, an… ▽ More

    Submitted 16 December, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  22. ViTHSD: Exploiting Hatred by Targets for Hate Speech Detection on Vietnamese Social Media Texts

    Authors: Cuong Nhat Vo, Khanh Bao Huynh, Son T. Luu, Trong-Hop Do

    Abstract: The growth of social networks makes toxic content spread rapidly. Hate speech detection is a task to help decrease the number of harmful comments. With the diversity in the hate speech created by users, it is necessary to interpret the hate speech besides detecting it. Hence, we propose a methodology to construct a system for targeted hate speech detection from online streaming texts from social m… ▽ More

    Submitted 8 February, 2025; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: Published at Journal of Computational Social Science

  23. arXiv:2403.19710  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    STRUM-LLM: Attributed and Structured Contrastive Summarization

    Authors: Beliz Gunel, James B. Wendt, Jing Xie, Yichao Zhou, Nguyen Vo, Zachary Fisher, Sandeep Tata

    Abstract: Users often struggle with decision-making between two options (A vs B), as it usually requires time-consuming research across multiple web pages. We propose STRUM-LLM that addresses this challenge by generating attributed, structured, and helpful contrastive summaries that highlight key differences between the two options. STRUM-LLM identifies helpful contrast: the specific attributes along which… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  24. arXiv:2403.19161  [pdf, other

    cs.CL

    Improving Vietnamese-English Medical Machine Translation

    Authors: Nhu Vo, Dat Quoc Nguyen, Dung D. Le, Massimo Piccardi, Wray Buntine

    Abstract: Machine translation for Vietnamese-English in the medical domain is still an under-explored research area. In this paper, we introduce MedEV -- a high-quality Vietnamese-English parallel dataset constructed specifically for the medical domain, comprising approximately 360K sentence pairs. We conduct extensive experiments comparing Google Translate, ChatGPT (gpt-3.5-turbo), state-of-the-art Vietnam… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: To appear in Proceedings of LREC-COLING 2024

  25. arXiv:2402.04821  [pdf, other

    cs.LG

    E(3)-Equivariant Mesh Neural Networks

    Authors: Thuan Trang, Nhat Khang Ngo, Daniel Levy, Thieu N. Vo, Siamak Ravanbakhsh, Truong Son Hy

    Abstract: Triangular meshes are widely used to represent three-dimensional objects. As a result, many recent works have address the need for geometric deep learning on 3D mesh. However, we observe that the complexities in many of these architectures does not translate to practical performance, and simple deep models for geometric graphs are competitive in practice. Motivated by this observation, we minimall… ▽ More

    Submitted 18 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  26. arXiv:2312.05448  [pdf, other

    cs.CL

    Domain Adaptation of a State of the Art Text-to-SQL Model: Lessons Learned and Challenges Found

    Authors: Irene Manotas, Octavian Popescu, Ngoc Phuoc An Vo, Vadim Sheinin

    Abstract: There are many recent advanced developments for the Text-to-SQL task, where the Picard model is one of the the top performing models as measured by the Spider dataset competition. However, bringing Text-to-SQL systems to realistic use-cases through domain adaptation remains a tough challenge. We analyze how well the base T5 Language Model and Picard perform on query structures different from the S… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    ACM Class: I.2.7

  27. arXiv:2205.00630  [pdf, other

    cs.CV cs.AI

    Design equivariant neural networks for 3D point cloud

    Authors: Thuan N. A. Trang, Thieu N. Vo, Khuong D. Nguyen

    Abstract: This work seeks to improve the generalization and robustness of existing neural networks for 3D point clouds by inducing group equivariance under general group transformations. The main challenge when designing equivariant models for point clouds is how to trade-off the performance of the model and the complexity. Existing equivariant models are either too complicate to implement or very high comp… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  28. arXiv:2108.02929  [pdf

    cs.CV

    VinaFood21: A Novel Dataset for Evaluating Vietnamese Food Recognition

    Authors: Thuan Trong Nguyen, Thuan Q. Nguyen, Dung Vo, Vi Nguyen, Ngoc Ho, Nguyen D. Vo, Kiet Van Nguyen, Khang Nguyen

    Abstract: Vietnam is such an attractive tourist destination with its stunning and pristine landscapes and its top-rated unique food and drink. Among thousands of Vietnamese dishes, foreigners and native people are interested in easy-to-eat tastes and easy-to-do recipes, along with reasonable prices, mouthwatering flavors, and popularity. Due to the diversity and almost all the dishes have significant simila… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

  29. Does Your Dermatology Classifier Know What It Doesn't Know? Detecting the Long-Tail of Unseen Conditions

    Authors: Abhijit Guha Roy, Jie Ren, Shekoofeh Azizi, Aaron Loh, Vivek Natarajan, Basil Mustafa, Nick Pawlowski, Jan Freyberg, Yuan Liu, Zach Beaver, Nam Vo, Peggy Bui, Samantha Winter, Patricia MacWilliams, Greg S. Corrado, Umesh Telang, Yun Liu, Taylan Cemgil, Alan Karthikesalingam, Balaji Lakshminarayanan, Jim Winkens

    Abstract: We develop and rigorously evaluate a deep learning based system that can accurately classify skin conditions while detecting rare conditions for which there is not enough data available for training a confident classifier. We frame this task as an out-of-distribution (OOD) detection problem. Our novel approach, hierarchical outlier detection (HOD) assigns multiple abstention classes for each train… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: Under Review, 19 Pages

    Journal ref: Medical Image Analysis (2022)

  30. arXiv:2104.00660  [pdf, other

    cs.CL

    Recognizing and Splitting Conditional Sentences for Automation of Business Processes Management

    Authors: Ngoc Phuoc An Vo, Irene Manotas, Octavian Popescu, Algimantas Cerniauskas, Vadim Sheinin

    Abstract: Business Process Management (BPM) is the discipline which is responsible for management of discovering, analyzing, redesigning, monitoring, and controlling business processes. One of the most crucial tasks of BPM is discovering and modelling business processes from text documents. In this paper, we present our system that resolves an end-to-end problem consisting of 1) recognizing conditional sent… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: Preprint

  31. arXiv:2102.02680  [pdf, other

    cs.AI cs.IR

    Hierarchical Multi-head Attentive Network for Evidence-aware Fake News Detection

    Authors: Nguyen Vo, Kyumin Lee

    Abstract: The widespread of fake news and misinformation in various domains ranging from politics, economics to public health has posed an urgent need to automatically fact-check information. A recent trend in fake news detection is to utilize evidence from external sources. However, existing evidence-aware fake news detection methods focused on either only word-level attention or evidence-level attention,… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

    Comments: EACL2021

  32. arXiv:2101.02415  [pdf, other

    cs.LG cs.CL

    Simplified DOM Trees for Transferable Attribute Extraction from the Web

    Authors: Yichao Zhou, Ying Sheng, Nguyen Vo, Nick Edmonds, Sandeep Tata

    Abstract: There has been a steady need to precisely extract structured knowledge from the web (i.e. HTML documents). Given a web page, extracting a structured object along with various attributes of interest (e.g. price, publisher, author, and genre for a book) can facilitate a variety of downstream applications such as large-scale knowledge base construction, e-commerce product search, and personalized rec… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: 10 pages, 9 figures

  33. FreeDOM: A Transferable Neural Architecture for Structured Information Extraction on Web Documents

    Authors: Bill Yuchen Lin, Ying Sheng, Nguyen Vo, Sandeep Tata

    Abstract: Extracting structured data from HTML documents is a long-studied problem with a broad range of applications like augmenting knowledge bases, supporting faceted search, and providing domain-specific experiences for key verticals like shopping and movies. Previous approaches have either required a small number of examples for each target site or relied on carefully handcrafted heuristics built over… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: in Proc. of KDD 2020 (Research Track). Figure 5 updated

  34. arXiv:2010.03159  [pdf, other

    cs.IR cs.AI

    Where Are the Facts? Searching for Fact-checked Information to Alleviate the Spread of Fake News

    Authors: Nguyen Vo, Kyumin Lee

    Abstract: Although many fact-checking systems have been developed in academia and industry, fake news is still proliferating on social media. These systems mostly focus on fact-checking but usually neglect online users who are the main drivers of the spread of misinformation. How can we use fact-checked information to improve users' consciousness of fake news to which they are exposed? How can we stop users… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: Full paper, EMNLP 2020

  35. arXiv:2002.03157  [pdf, other

    cs.CV

    Towards Reading Beyond Faces for Sparsity-Aware 4D Affect Recognition

    Authors: Muzammil Behzad, Nhat Vo, Xiaobai Li, Guoying Zhao

    Abstract: In this paper, we present a sparsity-aware deep network for automatic 4D facial expression recognition (FER). Given 4D data, we first propose a novel augmentation method to combat the data limitation problem for deep learning. This is achieved by projecting the input data into RGB and depth map images and then iteratively performing randomized channel concatenation. Encoded in the given 3D landmar… ▽ More

    Submitted 19 August, 2020; v1 submitted 8 February, 2020; originally announced February 2020.

  36. arXiv:2001.04118  [pdf, other

    cs.CV eess.IV eess.SP

    A Bayesian Filter for Multi-view 3D Multi-object Tracking with Occlusion Handling

    Authors: Jonah Ong, Ba Tuong Vo, Ba Ngu Vo, Du Yong Kim, Sven Nordholm

    Abstract: This paper proposes an online multi-camera multi-object tracker that only requires monocular detector training, independent of the multi-camera configurations, allowing seamless extension/deletion of cameras without retraining effort. The proposed algorithm has a linear complexity in the total number of detections across the cameras, and hence scales gracefully with the number of cameras. It opera… ▽ More

    Submitted 27 October, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

    Comments: 18 pages, 11 figures, TPAMI

  37. arXiv:2001.02214  [pdf, other

    cs.IR cs.CL cs.SI

    Attributed Multi-Relational Attention Network for Fact-checking URL Recommendation

    Authors: Di You, Nguyen Vo, Kyumin Lee, Qiang Liu

    Abstract: To combat fake news, researchers mostly focused on detecting fake news and journalists built and maintained fact-checking sites (e.g., Snopes.com and Politifact.com). However, fake news dissemination has been greatly promoted via social media sites, and these fact-checking sites have not been fully utilized. To overcome these problems and complement existing methods against fake news, in this pape… ▽ More

    Submitted 7 January, 2020; originally announced January 2020.

    Comments: CIKM2019

  38. arXiv:1910.05445  [pdf, other

    cs.CV

    Landmarks-assisted Collaborative Deep Framework for Automatic 4D Facial Expression Recognition

    Authors: Muzammil Behzad, Nhat Vo, Xiaobai Li, Guoying Zhao

    Abstract: We propose a novel landmarks-assisted collaborative end-to-end deep framework for automatic 4D FER. Using 4D face scan data, we calculate its various geometrical images, and afterwards use rank pooling to generate their dynamic images encapsulating important facial muscle movements over time. As well, the given 3D landmarks are projected on a 2D plane as binary images and convolutional layers are… ▽ More

    Submitted 7 February, 2020; v1 submitted 11 October, 2019; originally announced October 2019.

    Comments: Published in 15th IEEE International Conference on Automatic Face and Gesture Recognition

  39. arXiv:1910.02202  [pdf, other

    cs.CL cs.AI cs.IR

    Learning from Fact-checkers: Analysis and Generation of Fact-checking Language

    Authors: Nguyen Vo, Kyumin Lee

    Abstract: In fighting against fake news, many fact-checking systems comprised of human-based fact-checking sites (e.g., snopes.com and politifact.com) and automatic detection systems have been developed in recent years. However, online users still keep sharing fake news even when it has been debunked. It means that early fake news detection may be insufficient and we need another complementary approach to m… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: SIGIR 2019

  40. arXiv:1905.02319  [pdf, other

    cs.CV

    Automatic 4D Facial Expression Recognition via Collaborative Cross-domain Dynamic Image Network

    Authors: Muzammil Behzad, Nhat Vo, Xiaobai Li, Guoying Zhao

    Abstract: This paper proposes a novel 4D Facial Expression Recognition (FER) method using Collaborative Cross-domain Dynamic Image Network (CCDN). Given a 4D data of face scans, we first compute its geometrical images, and then combine their correlated information in the proposed cross-domain image representations. The acquired set is then used to generate cross-domain dynamic images (CDI) via rank pooling… ▽ More

    Submitted 7 February, 2020; v1 submitted 6 May, 2019; originally announced May 2019.

    Comments: Published in the 30th British Machine Vision Conference (BMVC) 2019

  41. arXiv:1903.00793  [pdf, other

    cs.CV

    Let's Transfer Transformations of Shared Semantic Representations

    Authors: Nam Vo, Lu Jiang, James Hays

    Abstract: With a good image understanding capability, can we manipulate the images high level semantic representation? Such transformation operation can be used to generate or retrieve similar images but with a desired modification (for example changing beach background to street background); similar ability has been demonstrated in zero shot learning, attribute composition and attribute manipulation image… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

  42. arXiv:1901.11048  [pdf, ps, other

    cs.SC math.AC math.AG

    Rational Solutions of First-Order Algebraic Ordinary Difference Equations

    Authors: Thieu N. Vo, Yi Zhang

    Abstract: We propose an algebraic geometric approach for studying rational solutions of first-order algebraic ordinary difference equations. For an autonomous first-order algebraic ordinary difference equations, we give an upper bound for the degrees of its rational solutions, and thus derive a complete algorithm for computing corresponding rational solutions.

    Submitted 1 February, 2019; v1 submitted 30 January, 2019; originally announced January 2019.

  43. arXiv:1812.07119  [pdf, other

    cs.CV

    Composing Text and Image for Image Retrieval - An Empirical Odyssey

    Authors: Nam Vo, Lu Jiang, Chen Sun, Kevin Murphy, Li-Jia Li, Li Fei-Fei, James Hays

    Abstract: In this paper, we study the task of image retrieval, where the input query is specified in the form of an image plus some text that describes desired modifications to the input image. For example, we may present an image of the Eiffel tower, and ask the system to find images which are visually similar but are modified in small ways, such as being taken at nighttime instead of during the day. To ta… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

  44. arXiv:1806.08895  [pdf, other

    cs.DC

    MRAttractor: Detecting Communities from Large-Scale Graphs

    Authors: Nguyen Vo, Kyumin Lee, Thanh Tran

    Abstract: Detecting groups of users, who have similar opinions, interests, or social behavior, has become an important task for many applications. A recent study showed that dynamic distance based Attractor, a community detection algorithm, outperformed other community detection algorithms such as Spectral clustering, Louvain and Infomap, achieving higher Normalized Mutual Information (NMI) and Adjusted Ran… ▽ More

    Submitted 22 June, 2018; originally announced June 2018.

    Comments: Full paper accepted at IEEE Big Data 2017

  45. arXiv:1806.07516  [pdf, other

    cs.IR cs.SI

    The Rise of Guardians: Fact-checking URL Recommendation to Combat Fake News

    Authors: Nguyen Vo, Kyumin Lee

    Abstract: A large body of research work and efforts have been focused on detecting fake news and building online fact-check systems in order to debunk fake news as soon as possible. Despite the existence of these systems, fake news is still wildly shared by online users. It indicates that these systems may not be fully utilized. After detecting fake news, what is the next step to stop people from sharing it… ▽ More

    Submitted 9 October, 2019; v1 submitted 19 June, 2018; originally announced June 2018.

    Comments: SIGIR 2018

  46. arXiv:1803.09646  [pdf, ps, other

    cs.SC

    On Existence and Uniqueness of Formal Power Series Solutions of Algebraic Ordinary Differential Equations

    Authors: Sebastian Falkensteiner, Yi Zhang, Thieu N. Vo

    Abstract: Given an algebraic ordinary differential equation (AODE), we propose a computational method to determine when a truncated power series can be extended to a formal power series solution. If a certain regularity condition on the given AODE or on the initial values is fulfilled, we compute all of the solutions. Moreover, when the existence is confirmed, we present the algebraic structure of the set o… ▽ More

    Submitted 2 July, 2021; v1 submitted 26 March, 2018; originally announced March 2018.

  47. arXiv:1803.03310  [pdf, other

    cs.CV

    Generalization in Metric Learning: Should the Embedding Layer be the Embedding Layer?

    Authors: Nam Vo, James Hays

    Abstract: This work studies deep metric learning under small to medium scale data as we believe that better generalization could be a contributing factor to the improvement of previous fine-grained image retrieval methods; it should be considered when designing future techniques. In particular, we investigate using other layers in a deep metric learning system (besides the embedding layer) for feature extra… ▽ More

    Submitted 10 December, 2018; v1 submitted 8 March, 2018; originally announced March 2018.

    Comments: new version for WACV

  48. arXiv:1710.04743  [pdf, other

    cs.CY cs.AI cs.SI

    Identifying On-time Reward Delivery Projects with Estimating Delivery Duration on Kickstarter

    Authors: Thanh Tran, Kyumin Lee, Nguyen Vo, Hongkyu Choi

    Abstract: In Crowdfunding platforms, people turn their prototype ideas into real products by raising money from the crowd, or invest in someone else's projects. In reward-based crowdfunding platforms such as Kickstarter and Indiegogo, selecting accurate reward delivery duration becomes crucial for creators, backers, and platform providers to keep the trust between the creators and the backers, and the trust… ▽ More

    Submitted 12 October, 2017; originally announced October 2017.

  49. arXiv:1709.04174  [pdf, ps, other

    cs.SC

    Rational Solutions of High-Order Algebraic Ordinary Differential Equations

    Authors: Thieu N. Vo, Yi Zhang

    Abstract: We consider algebraic ordinary differential equations (AODEs) and study their polynomial and rational solutions. A sufficient condition for an AODE to have a degree bound for its polynomial solutions is presented. An AODE satisfying this condition is called \emph{noncritical}. We prove that usual low order classes of AODEs are noncritical. For rational solutions, we determine a class of AODEs, whi… ▽ More

    Submitted 22 April, 2018; v1 submitted 13 September, 2017; originally announced September 2017.

  50. arXiv:1705.04838  [pdf, other

    cs.CV

    Revisiting IM2GPS in the Deep Learning Era

    Authors: Nam Vo, Nathan Jacobs, James Hays

    Abstract: Image geolocalization, inferring the geographic location of an image, is a challenging computer vision problem with many potential applications. The recent state-of-the-art approach to this problem is a deep image classification approach in which the world is spatially divided into cells and a deep network is trained to predict the correct cell for a given image. We propose to combine this approac… ▽ More

    Submitted 13 May, 2017; originally announced May 2017.