-
Nowcasting economic and social data: when and why search engine data fails, an illustration using Google Flu Trends
Authors:
Paul Ormerod,
Rickard Nyman,
R Alexander Bentley
Abstract:
Obtaining an accurate picture of the current state of the economy is particularly important to central banks and finance ministries, and of epidemics to health ministries. There is increasing interest in the use of search engine data to provide such 'nowcasts' of social and economic indicators. However, people may search for a phrase because they independently want the information, or they may sea…
▽ More
Obtaining an accurate picture of the current state of the economy is particularly important to central banks and finance ministries, and of epidemics to health ministries. There is increasing interest in the use of search engine data to provide such 'nowcasts' of social and economic indicators. However, people may search for a phrase because they independently want the information, or they may search simply because many others are searching for it. We consider the effect of the motivation for searching on the accuracy of forecasts made using search engine data of contemporaneous social and economic indicators. We illustrate the implications for forecasting accuracy using four episodes in which Google Flu Trends data gave accurate predictions of actual flu cases, and four in which the search data over-predicted considerably. Using a standard statistical methodology, the Bass diffusion model, we show that the independent search for information motive was much stronger in the cases of accurate prediction than in the inaccurate ones. Social influence, the fact that people may search for a phrase simply because many others are, was much stronger in the inaccurate compared to the accurate cases. Search engine data may therefore be an unreliable predictor of contemporaneous indicators when social influence on the decision to search is strong.
△ Less
Submitted 1 August, 2014;
originally announced August 2014.
-
Social network markets: the influence of network structure when consumers face decisions over many similar choices
Authors:
Paul Ormerod,
Bassel Tarbush,
R. Alexander Bentley
Abstract:
In social network markets, the act of consumer choice in these industries is governed not just by the set of incentives described by conventional consumer demand theory, but by the choices of others in which an individual's payoff is an explicit function of the actions of others. We observe two key empirical features of outcomes in social networked markets. First, a highly right-skewed, non-Gaussi…
▽ More
In social network markets, the act of consumer choice in these industries is governed not just by the set of incentives described by conventional consumer demand theory, but by the choices of others in which an individual's payoff is an explicit function of the actions of others. We observe two key empirical features of outcomes in social networked markets. First, a highly right-skewed, non-Gaussian distribution of the number of times competing alternatives are selected at a point in time. Second, there is turnover in the rankings of popularity over time. We show here that such outcomes can arise either when there is no alternative which exhibits inherent superiority in its attributes, or when agents find it very difficult to discern any differences in quality amongst the alternatives which are available so that it is as if no superiority exists. These features appear to obtain, as a reasonable approximation, in many social network markets. We examine the impact of network structure on both the rank-size distribution of choices at a point in time, and on the life spans of the most popular choices. We show that a key influence on outcomes is the extent to which the network follows a hierarchical structure. It is the social network properties of the markets, the meso-level structure, which determine outcomes rather than the objective attributes of the products.
△ Less
Submitted 5 October, 2012;
originally announced October 2012.
-
Ex ante prediction of cascade sizes on networks of agents facing binary outcomes
Authors:
Paul Ormerod,
Ellie Evans
Abstract:
We consider in this paper the potential for ex ante prediction of the cascade size in a model of binary choice with externalities (Schelling 1973, Watts 2002). Agents are connected on a network and can be in one of two states of the world, 0 or 1. Initially, all are in state 0 and a small number of seeds are selected at random to switch to state1. A simple threshold rule specifies whether other ag…
▽ More
We consider in this paper the potential for ex ante prediction of the cascade size in a model of binary choice with externalities (Schelling 1973, Watts 2002). Agents are connected on a network and can be in one of two states of the world, 0 or 1. Initially, all are in state 0 and a small number of seeds are selected at random to switch to state1. A simple threshold rule specifies whether other agents switch subsequently. The cascade size (the percolation) is the proportion of all agents which eventually switches to state 1. We select information on the connectivity of the initial seeds, the connectivity of the agents to which they are connected, the thresholds of these latter agents, and the thresholds of the agents to which these are connected. We obtain results for random, small world and scale -free networks with different network parameters and numbers of initial seeds. The results are robust with respect to these factors. We perform least squares regression of the logit transformation of the cascade size (Hosmer and Lemeshow 1989) on these potential explanatory variables. We find considerable explanatory power for the ex ante prediction of cascade sizes. For the random networks, on average 32 per cent of the variance of the cascade sizes is explained, 40 per cent for the small world and 46 per cent for the scale-free. The connectivity variables are hardly ever significant in the regressions, whether relating to the seeds themselves or to the agents connected to the seeds. In contrast, the information on the thresholds of agents contains much more explanatory power. This supports the conjecture of Watts and Dodds (2007.) that large cascades are driven by a small mass of easily influenced agents.
△ Less
Submitted 17 March, 2011;
originally announced March 2011.