-
US Presidential Election 2012 Prediction using Census Corrected Twitter Model
Authors:
Murphy Choy,
Michelle Cheong,
Ma Nang Laik,
Koo Ping Shung
Abstract:
US Presidential Election 2012 has been a very tight race between the two key candidates. There were intense battle between the two key candidates. The election reflects the sentiment of the electorate towards the achievements of the incumbent President Obama. The campaign lasted several months and the effects can be felt in the internet and twitter. The presidential debates injected new vigor in t…
▽ More
US Presidential Election 2012 has been a very tight race between the two key candidates. There were intense battle between the two key candidates. The election reflects the sentiment of the electorate towards the achievements of the incumbent President Obama. The campaign lasted several months and the effects can be felt in the internet and twitter. The presidential debates injected new vigor in the challenger's campaign and successfully captured the electorate of several states posing a threat to the incumbent's position. Much of the sentiment in the election has been captured in the online discussions. In this paper, we will be using the original model described in Choy et. al. (2011) using twitter data to forecast the next US president.
△ Less
Submitted 11 November, 2012; v1 submitted 5 November, 2012;
originally announced November 2012.
-
Modified Entropy Measure for Detection of Association Rules Under Simpson's Paradox Context
Authors:
Murphy Choy,
Cally Claire Ong,
Michelle Cheong
Abstract:
The rapid explosion in retail data calls for more effective and efficient discovery of association rules to develop relevant business strategies and rules.Unlike online shopping sites, most brick and mortar retail shops are located in geographically and demographically diverse areas. This diversity presents a new challenge to the classical association rule model which assumes a homogenous group of…
▽ More
The rapid explosion in retail data calls for more effective and efficient discovery of association rules to develop relevant business strategies and rules.Unlike online shopping sites, most brick and mortar retail shops are located in geographically and demographically diverse areas. This diversity presents a new challenge to the classical association rule model which assumes a homogenous group of customers behaving differently. The focus of this paper is centered on the discovery of association rules that were hidden as a result of a geographical and demographically diverse data. We will introduce a novel measure which incorporates the entropy measure with modified weighting for the detection of association rules not detected by the standard measures due to Simpson's paradox. The proposed measure is evaluated using a real-word case study involving a major retailer of fashion good in the context of traditional brick and mortar setting.
△ Less
Submitted 3 October, 2012;
originally announced October 2012.
-
Identification of Demand through Statistical Distribution Modeling for Improved Demand Forecasting
Authors:
Murphy Choy,
Michelle L. F. Cheong
Abstract:
Demand functions for goods are generally cyclical in nature with characteristics such as trend or stochasticity. Most existing demand forecasting techniques in literature are designed to manage and forecast this type of demand functions. However, if the demand function is lumpy in nature, then the general demand forecasting techniques may fail given the unusual characteristics of the function. Pro…
▽ More
Demand functions for goods are generally cyclical in nature with characteristics such as trend or stochasticity. Most existing demand forecasting techniques in literature are designed to manage and forecast this type of demand functions. However, if the demand function is lumpy in nature, then the general demand forecasting techniques may fail given the unusual characteristics of the function. Proper identification of the underlying demand function and using the most appropriate forecasting technique becomes critical. In this paper, we will attempt to explore the key characteristics of the different types of demand function and relate them to known statistical distributions. By fitting statistical distributions to actual past demand data, we are then able to identify the correct demand functions, so that the the most appropriate forecasting technique can be applied to obtain improved forecasting results. We applied the methodology to a real case study to show the reduction in forecasting errors obtained.
△ Less
Submitted 30 September, 2011;
originally announced October 2011.
-
A sentiment analysis of Singapore Presidential Election 2011 using Twitter data with census correction
Authors:
Murphy Choy,
Michelle L. F. Cheong,
Ma Nang Laik,
Koo Ping Shung
Abstract:
Sentiment analysis is a new area in text analytics where it focuses on the analysis and understanding of the emotions from the text patterns. This new form of analysis has been widely adopted in customer relation management especially in the context of complaint management. With increasing level of interest in this technology, more and more companies are adopting it and using it to champion their…
▽ More
Sentiment analysis is a new area in text analytics where it focuses on the analysis and understanding of the emotions from the text patterns. This new form of analysis has been widely adopted in customer relation management especially in the context of complaint management. With increasing level of interest in this technology, more and more companies are adopting it and using it to champion their marketing efforts. However, sentiment analysis using twitter has remained extremely difficult to manage due to the sampling bias. In this paper, we will discuss about the application of using reweighting techniques in conjunction with online sentiment divisions to predict the vote percentage that individual candidate will receive. There will be in depth discussion about the various aspects using sentiment analysis to predict outcomes as well as the potential pitfalls in the estimation due to the anonymous nature of the internet.
△ Less
Submitted 29 August, 2011;
originally announced August 2011.