-
Probabilistic Assessment of West Nile Virus Spillover Risk Using a Compartmental Mechanistic Model
Authors:
Saman Hosseini,
Lee W. Cohnstaedt,
Matin Marjani,
Caterina Scoglio
Abstract:
This paper presents a novel probabilistic approach for assessing the risk of West Nile Disease (WND) spillover to the human population. The assessment has been conducted under two different scenarios: (1) assessment of the onset of spillover, and (2) assessment of the severity of the epidemic after the onset of the disease. A compartmental model of differential equations is developed to describe t…
▽ More
This paper presents a novel probabilistic approach for assessing the risk of West Nile Disease (WND) spillover to the human population. The assessment has been conducted under two different scenarios: (1) assessment of the onset of spillover, and (2) assessment of the severity of the epidemic after the onset of the disease. A compartmental model of differential equations is developed to describe the disease transmission mechanism, and a probability density function for pathogen spillover to humans is derived based on the model for the assessment of the risk of the spillover onset and the severity of the epidemic. The prediction strategy involves making a long-term forecast and then updating it with a short-term (lead time of two weeks or daily). The methodology is demonstrated using detailed outbreak data from high-case counties in California, including Orange County, Los Angeles County, and Kern County. The predicted results are compared with actual infection dates reported by the California Department of Public Health for 2022-2024 to assess prediction accuracy. The performance accuracy is evaluated using a logarithmic scoring system and compared with one of the most renowned predictive models to assess its effectiveness. In all prediction scenarios, the model demonstrated strong performance. Lastly, the method is applied to explore the impact of global warming on spillover risk, revealing an increasing trend in the number of high-risk days and a shift toward a greater proportion of these days over time for the onset of the disease.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
A Variational Auto-Encoder for Reservoir Monitoring
Authors:
Kristian Gundersen,
Seyyed A. Hosseini,
Anna Oleynik,
Guttorm Alendal
Abstract:
Carbon dioxide Capture and Storage (CCS) is an important strategy in mitigating anthropogenic CO$_2$ emissions. In order for CCS to be successful, large quantities of CO$_2$ must be stored and the storage site conformance must be monitored. Here we present a deep learning method to reconstruct pressure fields and classify the flux out of the storage formation based on the pressure data from Above…
▽ More
Carbon dioxide Capture and Storage (CCS) is an important strategy in mitigating anthropogenic CO$_2$ emissions. In order for CCS to be successful, large quantities of CO$_2$ must be stored and the storage site conformance must be monitored. Here we present a deep learning method to reconstruct pressure fields and classify the flux out of the storage formation based on the pressure data from Above Zone Monitoring Interval (AZMI) wells. The deep learning method is a version of a semi conditional variational auto-encoder tailored to solve two tasks: reconstruction of an incremental pressure field and leakage rate classification. The method, predictions and associated uncertainty estimates are illustrated on the synthetic data from a high-fidelity heterogeneous 2D numerical reservoir model, which was used to simulate subsurface CO$_2$ movement and pressure changes in the AZMI due to a CO$_2$ leakage.
△ Less
Submitted 2 October, 2020; v1 submitted 23 September, 2020;
originally announced September 2020.
-
AdaS: Adaptive Scheduling of Stochastic Gradients
Authors:
Mahdi S. Hosseini,
Konstantinos N. Plataniotis
Abstract:
The choice of step-size used in Stochastic Gradient Descent (SGD) optimization is empirically selected in most training procedures. Moreover, the use of scheduled learning techniques such as Step-Decaying, Cyclical-Learning, and Warmup to tune the step-size requires extensive practical experience--offering limited insight into how the parameters update--and is not consistent across applications. T…
▽ More
The choice of step-size used in Stochastic Gradient Descent (SGD) optimization is empirically selected in most training procedures. Moreover, the use of scheduled learning techniques such as Step-Decaying, Cyclical-Learning, and Warmup to tune the step-size requires extensive practical experience--offering limited insight into how the parameters update--and is not consistent across applications. This work attempts to answer a question of interest to both researchers and practitioners, namely \textit{"how much knowledge is gained in iterative training of deep neural networks?"} Answering this question introduces two useful metrics derived from the singular values of the low-rank factorization of convolution layers in deep neural networks. We introduce the notions of \textit{"knowledge gain"} and \textit{"mapping condition"} and propose a new algorithm called Adaptive Scheduling (AdaS) that utilizes these derived metrics to adapt the SGD learning rate proportionally to the rate of change in knowledge gain over successive iterations. Experimentation reveals that, using the derived metrics, AdaS exhibits: (a) faster convergence and superior generalization over existing adaptive learning methods; and (b) lack of dependence on a validation set to determine when to stop training. Code is available at \url{https://github.com/mahdihosseini/AdaS}.
△ Less
Submitted 11 June, 2020;
originally announced June 2020.
-
Blang: Bayesian declarative modelling of general data structures and inference via algorithms based on distribution continua
Authors:
Alexandre Bouchard-Côté,
Kevin Chern,
Davor Cubranic,
Sahand Hosseini,
Justin Hume,
Matteo Lepur,
Zihui Ouyang,
Giorgio Sgarbi
Abstract:
Consider a Bayesian inference problem where a variable of interest does not take values in a Euclidean space. These "non-standard" data structures are in reality fairly common. They are frequently used in problems involving latent discrete factor models, networks, and domain specific problems such as sequence alignments and reconstructions, pedigrees, and phylogenies. In principle, Bayesian infere…
▽ More
Consider a Bayesian inference problem where a variable of interest does not take values in a Euclidean space. These "non-standard" data structures are in reality fairly common. They are frequently used in problems involving latent discrete factor models, networks, and domain specific problems such as sequence alignments and reconstructions, pedigrees, and phylogenies. In principle, Bayesian inference should be particularly well-suited in such scenarios, as the Bayesian paradigm provides a principled way to obtain confidence assessment for random variables of any type. However, much of the recent work on making Bayesian analysis more accessible and computationally efficient has focused on inference in Euclidean spaces.
In this paper, we introduce Blang, a domain specific language and library aimed at bridging this gap. Blang allows users to perform Bayesian analysis on arbitrary data types while using a declarative syntax similar to BUGS. Blang is augmented with intuitive language additions to create data types of the user's choosing. To perform inference at scale on such arbitrary state spaces, Blang leverages recent advances in sequential Monte Carlo and non-reversible Markov chain Monte Carlo methods.
△ Less
Submitted 23 June, 2021; v1 submitted 22 December, 2019;
originally announced December 2019.
-
SoulMate: Short-text author linking through Multi-aspect temporal-textual embedding
Authors:
Saeed Najafipour,
Saeid Hosseini,
Wen Hua,
Mohammad Reza Kangavari,
Xiaofang Zhou
Abstract:
Linking authors of short-text contents has important usages in many applications, including Named Entity Recognition (NER) and human community detection. However, certain challenges lie ahead. Firstly, the input short-text contents are noisy, ambiguous, and do not follow the grammatical rules. Secondly, traditional text mining methods fail to effectively extract concepts through words and phrases.…
▽ More
Linking authors of short-text contents has important usages in many applications, including Named Entity Recognition (NER) and human community detection. However, certain challenges lie ahead. Firstly, the input short-text contents are noisy, ambiguous, and do not follow the grammatical rules. Secondly, traditional text mining methods fail to effectively extract concepts through words and phrases. Thirdly, the textual contents are temporally skewed, which can affect the semantic understanding by multiple time facets. Finally, using the complementary knowledge-bases makes the results biased to the content of the external database and deviates the understanding and interpretation away from the real nature of the given short text corpus. To overcome these challenges, we devise a neural network-based temporal-textual framework that generates the tightly connected author subgraphs from microblog short-text contents. Our approach, on the one hand, computes the relevance score (edge weight) between the authors through considering a portmanteau of contents and concepts, and on the other hand, employs a stack-wise graph cutting algorithm to extract the communities of the related authors. Experimental results show that compared to other knowledge-centered competitors, our multi-aspect vector space model can achieve a higher performance in linking short-text authors. Additionally, given the author linking task, the more comprehensive the dataset is, the higher the significance of the extracted concepts will be.
△ Less
Submitted 27 October, 2019;
originally announced October 2019.
-
ChOracle: A Unified Statistical Framework for Churn Prediction
Authors:
Ali Khodadadi,
Seyed Abbas Hosseini,
Ehsan Pajouheshgar,
Farnam Mansouri,
Hamid R. Rabiee
Abstract:
User churn is an important issue in online services that threatens the health and profitability of services. Most of the previous works on churn prediction convert the problem into a binary classification task where the users are labeled as churned and non-churned. More recently, some works have tried to convert the user churn prediction problem into the prediction of user return time. In this app…
▽ More
User churn is an important issue in online services that threatens the health and profitability of services. Most of the previous works on churn prediction convert the problem into a binary classification task where the users are labeled as churned and non-churned. More recently, some works have tried to convert the user churn prediction problem into the prediction of user return time. In this approach which is more realistic in real world online services, at each time-step the model predicts the user return time instead of predicting a churn label. However, the previous works in this category suffer from lack of generality and require high computational complexity. In this paper, we introduce \emph{ChOracle}, an oracle that predicts the user churn by modeling the user return times to service by utilizing a combination of Temporal Point Processes and Recurrent Neural Networks. Moreover, we incorporate latent variables into the proposed recurrent neural network to model the latent user loyalty to the system. We also develop an efficient approximate variational algorithm for learning parameters of the proposed RNN by using back propagation through time. Finally, we demonstrate the superior performance of ChOracle on a wide variety of real world datasets.
△ Less
Submitted 15 September, 2019;
originally announced September 2019.
-
A Weight-based Information Filtration Algorithm for Stock-Correlation Networks
Authors:
Seyed Soheil Hosseini,
Nick Wormald,
Tianhai Tian
Abstract:
Several algorithms have been proposed to filter information on a complete graph of correlations across stocks to build a stock-correlation network. Among them the planar maximally filtered graph (PMFG) algorithm uses $3n-6$ edges to build a graph whose features include a high frequency of small cliques and a good clustering of stocks. We propose a new algorithm which we call proportional degree (P…
▽ More
Several algorithms have been proposed to filter information on a complete graph of correlations across stocks to build a stock-correlation network. Among them the planar maximally filtered graph (PMFG) algorithm uses $3n-6$ edges to build a graph whose features include a high frequency of small cliques and a good clustering of stocks. We propose a new algorithm which we call proportional degree (PD) to filter information on the complete graph of normalised mutual information (NMI) across stocks. Our results show that the PD algorithm produces a network showing better homogeneity with respect to cliques, as compared to economic sectoral classification than its PMFG counterpart. We also show that the partition of the PD network obtained through normalised spectral clustering (NSC) agrees better with the NSC of the complete graph than the corresponding one obtained from PMFG. Finally, we show that the clusters in the PD network are more robust with respect to the removal of random sets of edges than those in the PMFG network.
△ Less
Submitted 11 April, 2019;
originally announced April 2019.
-
Deep Convolutional Neural Network for Automated Detection of Mind Wandering using EEG Signals
Authors:
Seyedroohollah Hosseini,
Xuan Guo
Abstract:
Mind wandering (MW) is a ubiquitous phenomenon which reflects a shift in attention from task-related to task-unrelated thoughts. There is a need for intelligent interfaces that can reorient attention when MW is detected due to its detrimental effects on performance and productivity. In this paper, we propose a deep learning model for MW detection using Electroencephalogram (EEG) signals. Specifica…
▽ More
Mind wandering (MW) is a ubiquitous phenomenon which reflects a shift in attention from task-related to task-unrelated thoughts. There is a need for intelligent interfaces that can reorient attention when MW is detected due to its detrimental effects on performance and productivity. In this paper, we propose a deep learning model for MW detection using Electroencephalogram (EEG) signals. Specifically, we develop a channel-wise deep convolutional neural network (CNN) model to classify the features of focusing state and MW extracted from EEG signals. This is the first study that employs CNN to automatically detect MW using only EEG data. The experimental results on the collected dataset demonstrate promising performance with 91.78% accuracy, 92.84% sensitivity, and 90.73% specificity.
△ Less
Submitted 5 February, 2019;
originally announced February 2019.
-
Uncertainty Principle in Distributed MIMO Radars
Authors:
Seyed MohammadReza Hosseini,
Afshin Isazadeh,
Ali Noroozi,
Mohammad Ali Sebt
Abstract:
Radar uncertainty principle indicates that there is an inherent invariance in the product of the time-delay and Doppler-shift measurement accuracy and resolution which can be tuned by the waveform at transmitter. In this paper, based on the radar uncertainty principle, a conceptual waveform design is proposed for a distributed multiple-input multiple-output (MIMO) radar system in order to improve…
▽ More
Radar uncertainty principle indicates that there is an inherent invariance in the product of the time-delay and Doppler-shift measurement accuracy and resolution which can be tuned by the waveform at transmitter. In this paper, based on the radar uncertainty principle, a conceptual waveform design is proposed for a distributed multiple-input multiple-output (MIMO) radar system in order to improve the Cramer-Rao lower bound (CRLB) of the target position and velocity. To this end, a non-convex band constrained optimization problem is formulated, and a local and the global solution to the problem are obtained by sequential quadratic programming (SQP) and particle swarm algorithms, respectively. Numerical results are also included to illustrate the effectiveness of the proposed mechanism on the CRLB of the target position and velocity. By numerical results, it is also concluded that the global solution to the optimization problem is obtained at a vertex of the bounding box.
△ Less
Submitted 23 January, 2019;
originally announced January 2019.
-
Fashion-Gen: The Generative Fashion Dataset and Challenge
Authors:
Negar Rostamzadeh,
Seyedarian Hosseini,
Thomas Boquet,
Wojciech Stokowiec,
Ying Zhang,
Christian Jauvin,
Chris Pal
Abstract:
We introduce a new dataset of 293,008 high definition (1360 x 1360 pixels) fashion images paired with item descriptions provided by professional stylists. Each item is photographed from a variety of angles. We provide baseline results on 1) high-resolution image generation, and 2) image generation conditioned on the given text descriptions. We invite the community to improve upon these baselines.…
▽ More
We introduce a new dataset of 293,008 high definition (1360 x 1360 pixels) fashion images paired with item descriptions provided by professional stylists. Each item is photographed from a variety of angles. We provide baseline results on 1) high-resolution image generation, and 2) image generation conditioned on the given text descriptions. We invite the community to improve upon these baselines. In this paper, we also outline the details of a challenge that we are launching based upon this dataset.
△ Less
Submitted 30 July, 2018; v1 submitted 21 June, 2018;
originally announced June 2018.
-
Recurrent Poisson Factorization for Temporal Recommendation
Authors:
Seyed Abbas Hosseini,
Keivan Alizadeh,
Ali Khodadadi,
Ali Arabzadeh,
Mehrdad Farajtabar,
Hongyuan Zha,
Hamid R. Rabiee
Abstract:
Poisson factorization is a probabilistic model of users and items for recommendation systems, where the so-called implicit consumer data is modeled by a factorized Poisson distribution. There are many variants of Poisson factorization methods who show state-of-the-art performance on real-world recommendation tasks. However, most of them do not explicitly take into account the temporal behavior and…
▽ More
Poisson factorization is a probabilistic model of users and items for recommendation systems, where the so-called implicit consumer data is modeled by a factorized Poisson distribution. There are many variants of Poisson factorization methods who show state-of-the-art performance on real-world recommendation tasks. However, most of them do not explicitly take into account the temporal behavior and the recurrent activities of users which is essential to recommend the right item to the right user at the right time. In this paper, we introduce Recurrent Poisson Factorization (RPF) framework that generalizes the classical PF methods by utilizing a Poisson process for modeling the implicit feedback. RPF treats time as a natural constituent of the model and brings to the table a rich family of time-sensitive factorization models. To elaborate, we instantiate several variants of RPF who are capable of handling dynamic user preferences and item specification (DRPF), modeling the social-aspect of product adoption (SRPF), and capturing the consumption heterogeneity among users and items (HRPF). We also develop a variational algorithm for approximate posterior inference that scales up to massive data sets. Furthermore, we demonstrate RPF's superior performance over many state-of-the-art methods on synthetic dataset, and large scale real-world datasets on music streaming logs, and user-item interactions in M-Commerce platforms.
△ Less
Submitted 4 March, 2017;
originally announced March 2017.
-
HNP3: A Hierarchical Nonparametric Point Process for Modeling Content Diffusion over Social Media
Authors:
Seyed Abbas Hosseini,
Ali Khodadadi,
Soheil Arabzade,
Hamid R. Rabiee
Abstract:
This paper introduces a novel framework for modeling temporal events with complex longitudinal dependency that are generated by dependent sources. This framework takes advantage of multidimensional point processes for modeling time of events. The intensity function of the proposed process is a mixture of intensities, and its complexity grows with the complexity of temporal patterns of data. Moreov…
▽ More
This paper introduces a novel framework for modeling temporal events with complex longitudinal dependency that are generated by dependent sources. This framework takes advantage of multidimensional point processes for modeling time of events. The intensity function of the proposed process is a mixture of intensities, and its complexity grows with the complexity of temporal patterns of data. Moreover, it utilizes a hierarchical dependent nonparametric approach to model marks of events. These capabilities allow the proposed model to adapt its temporal and topical complexity according to the complexity of data, which makes it a suitable candidate for real world scenarios. An online inference algorithm is also proposed that makes the framework applicable to a vast range of applications. The framework is applied to a real world application, modeling the diffusion of contents over networks. Extensive experiments reveal the effectiveness of the proposed framework in comparison with state-of-the-art methods.
△ Less
Submitted 2 October, 2016;
originally announced October 2016.
-
Correction to: "Blind maximum likelihood separation of a linear-quadratic mixture"
Authors:
Shahram Hosseini,
Yannick Deville
Abstract:
An error occurred in the computation of a gradient in our paper entitled "Blind maximum likelihood separation of a linear-quadratic mixture", presented in ICA'2004. The equations (20) in Appendix and (17) in the text were not correct. The current paper presents the correct version of these equations.
An error occurred in the computation of a gradient in our paper entitled "Blind maximum likelihood separation of a linear-quadratic mixture", presented in ICA'2004. The equations (20) in Appendix and (17) in the text were not correct. The current paper presents the correct version of these equations.
△ Less
Submitted 6 January, 2010;
originally announced January 2010.
-
Effect of indirect dependencies on "A mutual information minimization approach for a class of nonlinear recurrent separating systems"
Authors:
Yannick Deville,
Alain Deville,
Shahram Hosseini
Abstract:
In a recent paper [4], Duarte and Jutten investigated the Blind Source Separation (BSS) problem, for the nonlinear mixing model that they introduced in that paper. They proposed to solve this problem by using information-theoretic tools, more precisely by minimizing the mutual information (MI) of the outputs of the separating structure. When applying the MI approach to BSS problems, one usually…
▽ More
In a recent paper [4], Duarte and Jutten investigated the Blind Source Separation (BSS) problem, for the nonlinear mixing model that they introduced in that paper. They proposed to solve this problem by using information-theoretic tools, more precisely by minimizing the mutual information (MI) of the outputs of the separating structure. When applying the MI approach to BSS problems, one usually determines the analytical expressions of the derivatives of the MI with respect to the parameters of the considered separating model. In the literature, these calculations were mainly reported for linear mixtures up to now. They are more complex for nonlinear mixtures, due to dependencies between the considered quantities. Moreover, the notations commonly employed by the BSS community in such calculations may become misleading when using them for nonlinear mixtures, due to the above-mentioned dependencies. We claim that the calculations reported in [4] contain an error, because they did not take into account all these dependencies. In this document, we therefore explain this phenomenon, by showing the effect of indirect dependencies on the application of the MI approach to the mixing and separating models considered in [4]. We thus introduce a corrected expression of the gradient of the considered BSS criterion based on MI. This correct gradient may then e.g. be used to optimize the adaptive coefficients of the considered separating system by means of the well-known gradient descent algorithm. As explained hereafter, this investigation has some similarities with an analysis that we previously reported in another arXiv document [3]. However, these two investigations concern different problems (mixture and separating structure, mathematical tools: see paper).
△ Less
Submitted 25 October, 2009; v1 submitted 23 October, 2009;
originally announced October 2009.