-
Friends, Key Players and the Adoption and Use of Experience Goods
Authors:
Rhys Murrian,
Paul A. Raschky,
Klaus Ackermann
Abstract:
This paper empirically investigates how an individual's network influences their purchase and subsequent use of experience goods. Utilising data on the network and game-ownership of over 108 million users from the world's largest video game platform, we analyse whether a user's friendship network influences their decision to purchase single-player video games. Our identification strategy uses an i…
▽ More
This paper empirically investigates how an individual's network influences their purchase and subsequent use of experience goods. Utilising data on the network and game-ownership of over 108 million users from the world's largest video game platform, we analyse whether a user's friendship network influences their decision to purchase single-player video games. Our identification strategy uses an instrumental variable (IV) approach that employs the temporal lag of purchasing decisions from second degree friends. We find strong peer effects in the individual game adoption in the contemporary week. The effect is stronger if the friend who purchased the game is an old friend compared to a key player in the friendship network. Comparing the results to adoption decisions for a major label game, we find peer effects of a similar size and duration. However, the time subsequently spent playing the games is higher for players who were neither influenced by a peer who is a key player nor an old friend. Considering the increasing importance of online networks on consumption decisions, our findings offer some first insights on the heterogeneity of peer effects between old and key player friends and also provide evidence in consumers' biases in social learning.
△ Less
Submitted 22 September, 2024;
originally announced September 2024.
-
The Heterogeneous Productivity Effects of Generative AI
Authors:
David Kreitmeir,
Paul A. Raschky
Abstract:
We analyse the individual productivity effects of Italy's ban on ChatGPT, a generative pretrained transformer chatbot. We compile data on the daily coding output quantity and quality of over 36,000 GitHub users in Italy and other European countries and combine these data with the sudden announcement of the ban in a difference-in-differences framework. Among the affected users in Italy, we find a s…
▽ More
We analyse the individual productivity effects of Italy's ban on ChatGPT, a generative pretrained transformer chatbot. We compile data on the daily coding output quantity and quality of over 36,000 GitHub users in Italy and other European countries and combine these data with the sudden announcement of the ban in a difference-in-differences framework. Among the affected users in Italy, we find a short-term increase in output quantity and quality for less experienced users and a decrease in productivity on more routine tasks for experienced users.
△ Less
Submitted 2 June, 2024; v1 submitted 4 March, 2024;
originally announced March 2024.
-
The Unintended Consequences of Censoring Digital Technology -- Evidence from Italy's ChatGPT Ban
Authors:
David H. Kreitmeir,
Paul A. Raschky
Abstract:
We analyse the effects of the ban of ChatGPT, a generative pre-trained transformer chatbot, on individual productivity. We first compile data on the hourly coding output of over 8,000 professional GitHub users in Italy and other European countries to analyse the impact of the ban on individual productivity. Combining the high-frequency data with the sudden announcement of the ban in a difference-i…
▽ More
We analyse the effects of the ban of ChatGPT, a generative pre-trained transformer chatbot, on individual productivity. We first compile data on the hourly coding output of over 8,000 professional GitHub users in Italy and other European countries to analyse the impact of the ban on individual productivity. Combining the high-frequency data with the sudden announcement of the ban in a difference-in-differences framework, we find that the output of Italian developers decreased by around 50% in the first two business days after the ban and recovered after that. Applying a synthetic control approach to daily Google search and Tor usage data shows that the ban led to a significant increase in the use of censorship bypassing tools. Our findings show that users swiftly implement strategies to bypass Internet restrictions but this adaptation activity creates short-term disruptions and hampers productivity.
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
Competing for Attention -- The Effect of Talk Radio on Elections and Political Polarization in the US
Authors:
Ashani Amarasinghe,
Paul A. Raschky
Abstract:
This paper studies the effects of talk radio, specifically the Rush Limbaugh Show, on electoral outcomes and attitude polarization in the U.S. We propose a novel identification strategy that considers the radio space in each county as a market where multiple stations are competing for listeners' attention. Our measure of competition is a spatial Herfindahl-Hirschman Index (HHI) in radio frequencie…
▽ More
This paper studies the effects of talk radio, specifically the Rush Limbaugh Show, on electoral outcomes and attitude polarization in the U.S. We propose a novel identification strategy that considers the radio space in each county as a market where multiple stations are competing for listeners' attention. Our measure of competition is a spatial Herfindahl-Hirschman Index (HHI) in radio frequencies. To address endogeneity concerns, we exploit the variation in competition based on accidental frequency overlaps in a county, conditional on the overall level of radio frequency competition. We find that counties with higher exposure to the Rush Limbaugh Show have a systematically higher vote share for Donald Trump in the 2016 and 2020 U.S. presidential elections. Combining our county-level Rush Limbaugh Show exposure measure with individual survey data reveals that self-identifying Republicans in counties with higher exposure to the Show express more conservative political views, while self-identifying Democrats in these same counties express more moderate political views. Taken together, these findings provide some of the first insights on the effects of contemporary talk radio on political outcomes, both at the aggregate and individual level.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
Predicting Political Ideology from Digital Footprints
Authors:
Michael Kitchener,
Nandini Anantharama,
Simon D. Angus,
Paul A. Raschky
Abstract:
This paper proposes a new method to predict individual political ideology from digital footprints on one of the world's largest online discussion forum. We compiled a unique data set from the online discussion forum reddit that contains information on the political ideology of around 91,000 users as well as records of their comment frequency and the comments' text corpus in over 190,000 different…
▽ More
This paper proposes a new method to predict individual political ideology from digital footprints on one of the world's largest online discussion forum. We compiled a unique data set from the online discussion forum reddit that contains information on the political ideology of around 91,000 users as well as records of their comment frequency and the comments' text corpus in over 190,000 different subforums of interest. Applying a set of statistical learning approaches, we show that information about activity in non-political discussion forums alone, can very accurately predict a user's political ideology. Depending on the model, we are able to predict the economic dimension of ideology with an accuracy of up to 90.63% and the social dimension with and accuracy of up to 82.02%. In comparison, using the textual features from actual comments does not improve predictive accuracy. Our paper highlights the importance of revealed digital behaviour to complement stated preferences from digital communication when analysing human preferences and behaviour using online data.
△ Less
Submitted 1 June, 2022;
originally announced June 2022.
-
Estimating Sleep & Work Hours from Alternative Data by Segmented Functional Classification Analysis (SFCA)
Authors:
Klaus Ackermann,
Simon D. Angus,
Paul A. Raschky
Abstract:
Alternative data is increasingly adapted to predict human and economic behaviour. This paper introduces a new type of alternative data by re-conceptualising the internet as a data-driven insights platform at global scale. Using data from a unique internet activity and location dataset drawn from over 1.5 trillion observations of end-user internet connections, we construct a functional dataset cove…
▽ More
Alternative data is increasingly adapted to predict human and economic behaviour. This paper introduces a new type of alternative data by re-conceptualising the internet as a data-driven insights platform at global scale. Using data from a unique internet activity and location dataset drawn from over 1.5 trillion observations of end-user internet connections, we construct a functional dataset covering over 1,600 cities during a 7 year period with temporal resolution of just 15min. To predict accurate temporal patterns of sleep and work activity from this data-set, we develop a new technique, Segmented Functional Classification Analysis (SFCA), and compare its performance to a wide array of linear, functional, and classification methods. To confirm the wider applicability of SFCA, in a second application we predict sleep and work activity using SFCA from US city-wide electricity demand functional data. Across both problems, SFCA is shown to out-perform current methods.
△ Less
Submitted 15 October, 2020;
originally announced October 2020.
-
Object Recognition for Economic Development from Daytime Satellite Imagery
Authors:
Klaus Ackermann,
Alexey Chernikov,
Nandini Anantharama,
Miethy Zaman,
Paul A Raschky
Abstract:
Reliable data about the stock of physical capital and infrastructure in developing countries is typically very scarce. This is particular a problem for data at the subnational level where existing data is often outdated, not consistently measured or coverage is incomplete. Traditional data collection methods are time and labor-intensive costly, which often prohibits developing countries from colle…
▽ More
Reliable data about the stock of physical capital and infrastructure in developing countries is typically very scarce. This is particular a problem for data at the subnational level where existing data is often outdated, not consistently measured or coverage is incomplete. Traditional data collection methods are time and labor-intensive costly, which often prohibits developing countries from collecting this type of data. This paper proposes a novel method to extract infrastructure features from high-resolution satellite images. We collected high-resolution satellite images for 5 million 1km $\times$ 1km grid cells covering 21 African countries. We contribute to the growing body of literature in this area by training our machine learning algorithm on ground-truth data. We show that our approach strongly improves the predictive accuracy. Our methodology can build the foundation to then predict subnational indicators of economic development for areas where this data is either missing or unreliable.
△ Less
Submitted 11 September, 2020;
originally announced September 2020.
-
The Internet as Quantitative Social Science Platform: Insights from a Trillion Observations
Authors:
Klaus Ackermann,
Simon D Angus,
Paul A Raschky
Abstract:
With the large-scale penetration of the internet, for the first time, humanity has become linked by a single, open, communications platform. Harnessing this fact, we report insights arising from a unified internet activity and location dataset of an unparalleled scope and accuracy drawn from over a trillion (1.5$\times 10^{12}$) observations of end-user internet connections, with temporal resoluti…
▽ More
With the large-scale penetration of the internet, for the first time, humanity has become linked by a single, open, communications platform. Harnessing this fact, we report insights arising from a unified internet activity and location dataset of an unparalleled scope and accuracy drawn from over a trillion (1.5$\times 10^{12}$) observations of end-user internet connections, with temporal resolution of just 15min over 2006-2012. We first apply this dataset to the expansion of the internet itself over 1,647 urban agglomerations globally. We find that unique IP per capita counts reach saturation at approximately one IP per three people, and take, on average, 16.1 years to achieve; eclipsing the estimated 100- and 60- year saturation times for steam-power and electrification respectively. Next, we use intra-diurnal internet activity features to up-scale traditional over-night sleep observations, producing the first global estimate of over-night sleep duration in 645 cities over 7 years. We find statistically significant variation between continental, national and regional sleep durations including some evidence of global sleep duration convergence. Finally, we estimate the relationship between internet concentration and economic outcomes in 411 OECD regions and find that the internet's expansion is associated with negative or positive productivity gains, depending strongly on sectoral considerations. To our knowledge, our study is the first of its kind to use online/offline activity of the entire internet to infer social science insights, demonstrating the unparalleled potential of the internet as a social data-science platform.
△ Less
Submitted 19 January, 2017;
originally announced January 2017.