Search | arXiv e-print repository

arXiv:2005.11353 [pdf, other]

A Tree Architecture of LSTM Networks for Sequential Regression with Missing Data

Authors: S. Onur Sahin, Suleyman S. Kozat

Abstract: We investigate regression for variable length sequential data containing missing samples and introduce a novel tree architecture based on the Long Short-Term Memory (LSTM) networks. In our architecture, we employ a variable number of LSTM networks, which use only the existing inputs in the sequence, in a tree-like architecture without any statistical assumptions or imputations on the missing data,… ▽ More We investigate regression for variable length sequential data containing missing samples and introduce a novel tree architecture based on the Long Short-Term Memory (LSTM) networks. In our architecture, we employ a variable number of LSTM networks, which use only the existing inputs in the sequence, in a tree-like architecture without any statistical assumptions or imputations on the missing data, unlike all the previous approaches. In particular, we incorporate the missingness information by selecting a subset of these LSTM networks based on "presence-pattern" of a certain number of previous inputs. From the mixture of experts perspective, we train different LSTM networks as our experts for various missingness patterns and then combine their outputs to generate the final prediction. We also provide the computational complexity analysis of the proposed architecture, which is in the same order of the complexity of the conventional LSTM architectures for the sequence length. Our method can be readily extended to similar structures such as GRUs, RNNs as remarked in the paper. In the experiments, we achieve significant performance improvements with respect to the state-of-the-art methods for the well-known financial and real life datasets. △ Less

Submitted 22 May, 2020; originally announced May 2020.

Comments: 10 pages, 8 figures

arXiv:2004.04835 [pdf, other]

COVID-19 in a social reinsurance framework: Forewarned is forearmed

Authors: S. Sahin, M. C. Boado-Penas, C. Constantinescu, J. Eisenberg, K. Henshaw, M. Hu, J. Wang, W. Zhu

Abstract: The crisis caused by COVID-19 revealed the global unpreparedness to handle the impact of a pandemic. In this paper, we present a statistical analysis of the data related to the COVID-19 outbreak in China, specifically the infection speed, death and fatality rates in Hubei province. By fitting distributions of these quantities we design a parametric reinsurance contract whose trigger and cap are ba… ▽ More The crisis caused by COVID-19 revealed the global unpreparedness to handle the impact of a pandemic. In this paper, we present a statistical analysis of the data related to the COVID-19 outbreak in China, specifically the infection speed, death and fatality rates in Hubei province. By fitting distributions of these quantities we design a parametric reinsurance contract whose trigger and cap are based on the probability distributions of the infection speed, death and fatality rates. In particular, fitting the distribution for the infection speed and death rates we provide a measure of the effectiveness of a state's action during an epidemic, and propose a reinsurance contract as a supplement to a state's social insurance to alleviate financial costs. △ Less

Submitted 9 April, 2020; originally announced April 2020.

arXiv:1912.04406 [pdf, other]

Semiparametric Regression for Dual Population Mortality

Authors: Gary Venter, Şule Şahin

Abstract: Parameter shrinkage applied optimally can always reduce error and projection variances from those of maximum likelihood estimation. Many variables that actuaries use are on numerical scales, like age or year, which require parameters at each point. Rather than shrinking these towards zero, nearby parameters are better shrunk towards each other. Semiparametric regression is a statistical discipline… ▽ More Parameter shrinkage applied optimally can always reduce error and projection variances from those of maximum likelihood estimation. Many variables that actuaries use are on numerical scales, like age or year, which require parameters at each point. Rather than shrinking these towards zero, nearby parameters are better shrunk towards each other. Semiparametric regression is a statistical discipline for building curves across parameter classes using shrinkage methodology. It is similar to but more parsimonious than cubic splines. We introduce it in the context of Bayesian shrinkage and apply it to joint mortality modeling for related populations. Bayesian shrinkage of slope changes of linear splines is an approach to semiparametric modeling that evolved in the actuarial literature. It has some theoretical and practical advantages, like closed-form curves, direct and transparent determination of degree of shrinkage and of placing knots for the splines, and quantifying goodness of fit. It is also relatively easy to apply to the many nonlinear models that arise in actuarial work. We find that it compares well to a more complex state-of-the-art statistical spline shrinkage approach on a popular example from that literature. △ Less

Submitted 20 December, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

Comments: 39 pages, 8 graphs

arXiv:1907.05283 [pdf, other]

A Comparison of Super-Resolution and Nearest Neighbors Interpolation Applied to Object Detection on Satellite Data

Authors: Evan Koester, Cem Safak Sahin

Abstract: As Super-Resolution (SR) has matured as a research topic, it has been applied to additional topics beyond image reconstruction. In particular, combining classification or object detection tasks with a super-resolution preprocessing stage has yielded improvements in accuracy especially with objects that are small relative to the scene. While SR has shown promise, a study comparing SR and naive upsc… ▽ More As Super-Resolution (SR) has matured as a research topic, it has been applied to additional topics beyond image reconstruction. In particular, combining classification or object detection tasks with a super-resolution preprocessing stage has yielded improvements in accuracy especially with objects that are small relative to the scene. While SR has shown promise, a study comparing SR and naive upscaling methods such as Nearest Neighbors (NN) interpolation when applied as a preprocessing step for object detection has not been performed. We apply the topic to satellite data and compare the Multi-scale Deep Super-Resolution (MDSR) system to NN on the xView challenge dataset. To do so, we propose a pipeline for processing satellite data that combines multi-stage image tiling and upscaling, the YOLOv2 object detection architecture, and label stitching. We compare the effects of training models using an upscaling factor of 4, upscaling images from 30cm Ground Sample Distance (GSD) to an effective GSD of 7.5cm. Upscaling by this factor significantly improves detection results, increasing Average Precision (AP) of a generalized vehicle class by 23 percent. We demonstrate that while SR produces upscaled images that are more visually pleasing than their NN counterparts, object detection networks see little difference in accuracy with images upsampled using NN obtaining nearly identical results to the MDSRx4 enhanced images with a difference of 0.0002 AP between the two methods. △ Less

Submitted 8 July, 2019; originally announced July 2019.

arXiv:1702.07680 [pdf, other]

Consistent Alignment of Word Embedding Models

Authors: Cem Safak Sahin, Rajmonda S. Caceres, Brandon Oselio, William M. Campbell

Abstract: Word embedding models offer continuous vector representations that can capture rich contextual semantics based on their word co-occurrence patterns. While these word vectors can provide very effective features used in many NLP tasks such as clustering similar words and inferring learning relationships, many challenges and open research questions remain. In this paper, we propose a solution that al… ▽ More Word embedding models offer continuous vector representations that can capture rich contextual semantics based on their word co-occurrence patterns. While these word vectors can provide very effective features used in many NLP tasks such as clustering similar words and inferring learning relationships, many challenges and open research questions remain. In this paper, we propose a solution that aligns variations of the same model (or different models) in a joint low-dimensional latent space leveraging carefully generated synthetic data points. This generative process is inspired by the observation that a variety of linguistic relationships is captured by simple linear operations in embedded space. We demonstrate that our approach can lead to substantial improvements in recovering embeddings of local neighborhoods. △ Less

Submitted 24 February, 2017; originally announced February 2017.

Comments: 4 pages, 2 figures

Showing 1–5 of 5 results for author: Sahin, S