-
TuneUp: A Simple Improved Training Strategy for Graph Neural Networks
Authors:
Weihua Hu,
Kaidi Cao,
Kexin Huang,
Edward W Huang,
Karthik Subbian,
Kenji Kawaguchi,
Jure Leskovec
Abstract:
Despite recent advances in Graph Neural Networks (GNNs), their training strategies remain largely under-explored. The conventional training strategy learns over all nodes in the original graph(s) equally, which can be sub-optimal as certain nodes are often more difficult to learn than others. Here we present TuneUp, a simple curriculum-based training strategy for improving the predictive performan…
▽ More
Despite recent advances in Graph Neural Networks (GNNs), their training strategies remain largely under-explored. The conventional training strategy learns over all nodes in the original graph(s) equally, which can be sub-optimal as certain nodes are often more difficult to learn than others. Here we present TuneUp, a simple curriculum-based training strategy for improving the predictive performance of GNNs. TuneUp trains a GNN in two stages. In the first stage, TuneUp applies conventional training to obtain a strong base GNN. The base GNN tends to perform well on head nodes (nodes with large degrees) but less so on tail nodes (nodes with small degrees). Therefore, the second stage of TuneUp focuses on improving prediction on the difficult tail nodes by further training the base GNN on synthetically generated tail node data. We theoretically analyze TuneUp and show it provably improves generalization performance on tail nodes. TuneUp is simple to implement and applicable to a broad range of GNN architectures and prediction tasks. Extensive evaluation of TuneUp on five diverse GNN architectures, three types of prediction tasks, and both transductive and inductive settings shows that TuneUp significantly improves the performance of the base GNN on tail nodes, while often even improving the performance on head nodes. Altogether, TuneUp produces up to 57.6% and 92.2% relative predictive performance improvement in the transductive and the challenging inductive settings, respectively.
△ Less
Submitted 26 August, 2023; v1 submitted 26 October, 2022;
originally announced October 2022.
-
Combining Accelerometer and Gyroscope Data in Smartphone-Based Activity Recognition using Movelets
Authors:
Emily Huang,
Kebin Yan,
Jukka-Pekka Onnela
Abstract:
Physical activity patterns can be informative about a patient's health status. Traditionally, activity data have been gathered using patient self-report. However, these subjective data can suffer from bias and are difficult to collect over long time periods. Smartphones offer an opportunity to address these challenges. The smartphone has built-in sensors that can be programmed to collect data obje…
▽ More
Physical activity patterns can be informative about a patient's health status. Traditionally, activity data have been gathered using patient self-report. However, these subjective data can suffer from bias and are difficult to collect over long time periods. Smartphones offer an opportunity to address these challenges. The smartphone has built-in sensors that can be programmed to collect data objectively, unobtrusively, and continuously. Due to their widespread adoption, smartphones are also accessible to most of the population. A main challenge in smartphone-based activity recognition is extracting information optimally from multiple sensors to identify the unique features of different activities. In our study, we analyze data collected by the accelerometer and gyroscope, which measure the phone's acceleration and angular velocity, respectively. We propose an extension to the "movelet method" that jointly incorporates both sensors. We also apply this joint-sensor method to a data set we collected previously. The findings show that combining data from the two sensors can result in more accurate activity recognition than using each sensor alone. For example, the joint-sensor method reduces errors of the gyroscope-only method in differentiating between standing and sitting. It also reduces errors of the accelerometer-only method in classifying vigorous activities.
△ Less
Submitted 5 February, 2022; v1 submitted 2 September, 2021;
originally announced September 2021.
-
Detecting Transaction-based Tax Evasion Activities on Social Media Platforms Using Multi-modal Deep Neural Networks
Authors:
Lelin Zhang,
Xi Nan,
Eva Huang,
Sidong Liu
Abstract:
Social media platforms now serve billions of users by providing convenient means of communication, content sharing and even payment between different users. Due to such convenient and anarchic nature, they have also been used rampantly to promote and conduct business activities between unregistered market participants without paying taxes. Tax authorities worldwide face difficulties in regulating…
▽ More
Social media platforms now serve billions of users by providing convenient means of communication, content sharing and even payment between different users. Due to such convenient and anarchic nature, they have also been used rampantly to promote and conduct business activities between unregistered market participants without paying taxes. Tax authorities worldwide face difficulties in regulating these hidden economy activities by traditional regulatory means. This paper presents a machine learning based Regtech tool for international tax authorities to detect transaction-based tax evasion activities on social media platforms. To build such a tool, we collected a dataset of 58,660 Instagram posts and manually labelled 2,081 sampled posts with multiple properties related to transaction-based tax evasion activities. Based on the dataset, we developed a multi-modal deep neural network to automatically detect suspicious posts. The proposed model combines comments, hashtags and image modalities to produce the final output. As shown by our experiments, the combined model achieved an AUC of 0.808 and F1 score of 0.762, outperforming any single modality models. This tool could help tax authorities to identify audit targets in an efficient and effective manner, and combat social e-commerce tax evasion in scale.
△ Less
Submitted 27 July, 2020;
originally announced July 2020.
-
Aortic Pressure Forecasting with Deep Sequence Learning
Authors:
Eliza Huang,
Rui Wang,
Uma Chandrasekaran,
Rose Yu
Abstract:
Mean aortic pressure (MAP) is a major determinant of perfusion in all organs systems. The ability to forecast MAP would enhance the ability of physicians to estimate prognosis of the patient and assist in early detection of hemodynamic instability. However, forecasting MAP is challenging because the blood pressure (BP) time series is noisy and can be highly non-stationary. The aim of this study wa…
▽ More
Mean aortic pressure (MAP) is a major determinant of perfusion in all organs systems. The ability to forecast MAP would enhance the ability of physicians to estimate prognosis of the patient and assist in early detection of hemodynamic instability. However, forecasting MAP is challenging because the blood pressure (BP) time series is noisy and can be highly non-stationary. The aim of this study was to forecast the mean aortic pressure five minutes in advance, using the 25 Hz time series data of previous five minutes as input. We provide a benchmark study of different deep learning models for BP forecasting. We investigate a left ventricular dwelling transvalvular micro-axial device, the Impella, in patients undergoing high-risk percutaneous intervention. The Impella provides hemodynamic support, thus aiding in native heart function recovery. It is also equipped with pressure sensors to capture high frequency MAP measurements at origin, instead of peripherally. Our dataset and the clinical application is novel in the BP forecasting field. We performed a comprehensive study on time series with increasing, decreasing, and stationary trends. The experiments show that recurrent neural networks with Legendre Memory Unit achieve the best performance with an overall forecasting error of 1.8 mmHg.
△ Less
Submitted 16 October, 2020; v1 submitted 11 May, 2020;
originally announced May 2020.
-
Predicting Onset of Dementia in Parkinson's Disease Patients
Authors:
Dhruv Agarwal,
Abhishek Srivastava,
Edward W Huang
Abstract:
Alzheimer's disease (AD) and Parkinson's disease (PD) are the two most common neurodegenerative disorders in humans. Because a significant percentage of patients have clinical and pathological features of both diseases, it has been hypothesized that the patho-cascades of the two diseases overlap. Despite this evidence, these two diseases are rarely studied in a joint manner. In this paper, we util…
▽ More
Alzheimer's disease (AD) and Parkinson's disease (PD) are the two most common neurodegenerative disorders in humans. Because a significant percentage of patients have clinical and pathological features of both diseases, it has been hypothesized that the patho-cascades of the two diseases overlap. Despite this evidence, these two diseases are rarely studied in a joint manner. In this paper, we utilize clinical, imaging, genetic, and biospecimen features to cluster AD and PD patients into the same feature space. By training a machine learning classifier on the combined feature space, we predict the disease stage of patients two years after their baseline visits. We observed a considerable improvement in the prediction accuracy of Parkinson's dementia patients due to combined training on Alzheimer's and Parkinson's patients, thereby affirming the claim that these two diseases can be jointly studied.
△ Less
Submitted 2 June, 2019;
originally announced June 2019.
-
Transfer Learning via Latent Factor Modeling to Improve Prediction of Surgical Complications
Authors:
Elizabeth C Lorenzi,
Zhifei Sun,
Erich Huang,
Ricardo Henao,
Katherine A Heller
Abstract:
We aim to create a framework for transfer learning using latent factor models to learn the dependence structure between a larger source dataset and a target dataset. The methodology is motivated by our goal of building a risk-assessment model for surgery patients, using both institutional and national surgical outcomes data. The national surgical outcomes data is collected through NSQIP (National…
▽ More
We aim to create a framework for transfer learning using latent factor models to learn the dependence structure between a larger source dataset and a target dataset. The methodology is motivated by our goal of building a risk-assessment model for surgery patients, using both institutional and national surgical outcomes data. The national surgical outcomes data is collected through NSQIP (National Surgery Quality Improvement Program), a database housing almost 4 million patients from over 700 different hospitals. We build a latent factor model with a hierarchical prior on the loadings matrix to appropriately account for the different covariance structure in our data. We extend this model to handle more complex relationships between the populations by deriving a scale mixture formulation using stick-breaking properties. Our model provides a transfer learning framework that utilizes all information from both the source and target data, while modeling the underlying inherent differences between them.
△ Less
Submitted 1 December, 2016;
originally announced December 2016.