-
Beyond Sentiment: Examining the Role of Moral Foundations in User Engagement with News on Twitter
Authors:
Jacopo D'Ignazi,
Kyriaki Kalimeri,
Mariano G. Beiró
Abstract:
This study uses sentiment analysis and the Moral Foundations Theory (MFT) to characterise news content in social media and examine its association with user engagement. We employ Natural Language Processing to quantify the moral and affective linguistic markers. At the same time, we automatically define thematic macro areas of news from major U.S. news outlets and their Twitter followers (Jan 2020…
▽ More
This study uses sentiment analysis and the Moral Foundations Theory (MFT) to characterise news content in social media and examine its association with user engagement. We employ Natural Language Processing to quantify the moral and affective linguistic markers. At the same time, we automatically define thematic macro areas of news from major U.S. news outlets and their Twitter followers (Jan 2020 - Mar 2021). By applying Non-Negative Matrix Factorisation to the obtained linguistic features we extract clusters of similar moral and affective profiles, and we identify the emotional and moral characteristics that mostly explain user engagement via regression modelling. We observe that Surprise, Trust, and Harm are crucial elements explaining user engagement and discussion length and that Twitter content from news media outlets has more explanatory power than their linked articles. We contribute with actionable findings evidencing the potential impact of employing specific moral and affective nuances in public and journalistic discourse in today's communication landscape. In particular, our results emphasise the need to balance engagement strategies with potential priming risks in our evolving media landscape.
△ Less
Submitted 17 February, 2025;
originally announced February 2025.
-
Characterizing User Behavior: The Interplay Between Mobility Patterns and Mobile Traffic
Authors:
Anne Josiane Kouam,
Aline Carneiro Viana,
Mariano G. Beiró,
Leo Ferres,
Luca Pappalardo
Abstract:
Mobile devices have become essential for capturing human activity, and eXtended Data Records (XDRs) offer rich opportunities for detailed user behavior modeling, which is useful for designing personalized digital services. Previous studies have primarily focused on aggregated mobile traffic and mobility analyses, often neglecting individual-level insights. This paper introduces a novel approach th…
▽ More
Mobile devices have become essential for capturing human activity, and eXtended Data Records (XDRs) offer rich opportunities for detailed user behavior modeling, which is useful for designing personalized digital services. Previous studies have primarily focused on aggregated mobile traffic and mobility analyses, often neglecting individual-level insights. This paper introduces a novel approach that explores the dependency between traffic and mobility behaviors at the user level. By analyzing 13 individual features that encompass traffic patterns and various mobility aspects, we enhance the understanding of how these behaviors interact. Our advanced user modeling framework integrates traffic and mobility behaviors over time, allowing for fine-grained dependencies while maintaining population heterogeneity through user-specific signatures. Furthermore, we develop a Markov model that infers traffic behavior from mobility and vice versa, prioritizing significant dependencies while addressing privacy concerns. Using a week-long XDR dataset from 1,337,719 users across several provinces in Chile, we validate our approach, demonstrating its robustness and applicability in accurately inferring user behavior and matching mobility and traffic profiles across diverse urban contexts.
△ Less
Submitted 24 March, 2025; v1 submitted 31 January, 2025;
originally announced January 2025.
-
Who talks about what? Comparing the information treatment in traditional media with online discussions
Authors:
Hendrik Schawe,
Mariano Gastón Beiró,
J. Ignacio Alvarez-Hamelin,
Dimitris Kotzinos,
Laura Hernández
Abstract:
We study the dynamics of interactions between a traditional medium, the New York Times journal, and its followers in Twitter, using a massive dataset. It consists of the metadata of the articles published by the journal during the first year of the COVID-19 pandemic, and the posts published in Twitter by a large set of followers of the @nytimes account along with those published by a set of follow…
▽ More
We study the dynamics of interactions between a traditional medium, the New York Times journal, and its followers in Twitter, using a massive dataset. It consists of the metadata of the articles published by the journal during the first year of the COVID-19 pandemic, and the posts published in Twitter by a large set of followers of the @nytimes account along with those published by a set of followers of several other media of different kind. The dynamics of discussions held in Twitter by exclusive followers of a medium show a strong dependence on the medium they follow: the followers of @FoxNews show the highest similarity to each other and a strong differentiation of interests with the general group. Our results also reveal the difference in the attention payed to U.S. presidential elections by the journal and by its followers, and show that the topic related to the ``Black Lives Matter'' movement started in Twitter, and was addressed later by the journal.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
Moral Narratives Around the Vaccination Debate on Facebook
Authors:
Mariano Gastón Beiró,
Jacopo D'Ignazi,
Maria Florencia Prado,
Victoria Perez Bustos,
Kyriaki Kalimeri
Abstract:
Vaccine hesitancy is a complex issue with psychological, cultural, and even societal factors entangled in the decision-making process. The narrative around this process is captured in our everyday interactions; social media data offer a direct and spontaneous view of peoples' argumentation. Here, we analysed more than 500,000 public posts and comments from Facebook Pages dedicated to the topic of…
▽ More
Vaccine hesitancy is a complex issue with psychological, cultural, and even societal factors entangled in the decision-making process. The narrative around this process is captured in our everyday interactions; social media data offer a direct and spontaneous view of peoples' argumentation. Here, we analysed more than 500,000 public posts and comments from Facebook Pages dedicated to the topic of vaccination to study the role of moral values and, in particular, the understudied role of the Liberty moral foundation from the actual user-generated text. We operationalise morality by employing the Moral Foundations Theory, while our proposed framework is based on recurrent neural network classifiers with a short memory and entity linking information. Our findings show that the principal moral narratives around the vaccination debate focus on the values of Liberty, Care, and Authority. Vaccine advocates urge compliance with the authorities as prosocial behaviour to protect society. On the other hand, vaccine sceptics mainly build their narrative around the value of Liberty, advocating for the right to choose freely whether to adhere or not to the vaccination. We contribute to the automatic understanding of vaccine hesitancy drivers emerging from user-generated text, providing concrete insights into the moral framing around vaccination decision-making. Especially in emergencies such as the Covid-19 pandemic, contrary to traditional surveys, these insights can be provided contemporary to the event, helping policymakers craft communication campaigns that adequately address the concerns of the hesitant population.
△ Less
Submitted 15 March, 2023; v1 submitted 3 June, 2022;
originally announced June 2022.
-
Designing weighted and multiplex networks for deep learning user geolocation in Twitter
Authors:
Federico M. Funes,
José Ignacio Alvarez-Hamelin,
Mariano G. Beiró
Abstract:
Predicting the geographical location of users of social media like Twitter has found several applications in health surveillance, emergency monitoring, content personalization, and social studies in general. In this work we contribute to the research in this area by designing and evaluating new methods based on the literature of weighted multigraphs combined with state-of-the-art deep learning tec…
▽ More
Predicting the geographical location of users of social media like Twitter has found several applications in health surveillance, emergency monitoring, content personalization, and social studies in general. In this work we contribute to the research in this area by designing and evaluating new methods based on the literature of weighted multigraphs combined with state-of-the-art deep learning techniques. The explored methods depart from a similar underlying structure (that of an extended mention and/or follower network) but use different information processing strategies, e.g., information diffusion through transductive and inductive algorithms -- RGCNs and GraphSAGE, respectively -- and node embeddings with Node2vec+. These graphs are then combined with attention mechanisms to incorporate the users' text view into the models. We assess the performance of each of these methods and compare them to baseline models in the publicly available Twitter-US dataset; we also make a new dataset available based on a large Twitter capture in Latin America. Finally, our work discusses the limitations and validity of the comparisons among methods in the context of different label definitions and metrics.
△ Less
Submitted 13 December, 2021;
originally announced December 2021.
-
Evolution of the political opinion landscape during electoral periods
Authors:
Tomás Mussi Reyero,
Mariano G. Beiró,
J. Ignacio Alvarez-Hamelin,
Laura Hernández,
Dimitris Kotzinos
Abstract:
We present a study of the evolution of the political landscape during the 2015 and 2019 presidential elections in Argentina, based on the data obtained from the micro-blogging platform Twitter. We build a semantic network based on the hashtags used by all the users following at least one of the main candidates. With this network we can detect the topics that are discussed in the society. At a diff…
▽ More
We present a study of the evolution of the political landscape during the 2015 and 2019 presidential elections in Argentina, based on the data obtained from the micro-blogging platform Twitter. We build a semantic network based on the hashtags used by all the users following at least one of the main candidates. With this network we can detect the topics that are discussed in the society. At a difference with most studies of opinion on social media, we do not choose the topics a priori, they naturally emerge from the community structure of the semantic network instead. We assign to each user a dynamical topic vector which measures the evolution of her/his opinion in this space and allows us to monitor the similarities and differences among groups of supporters of different candidates. Our results show that the method is able to detect the dynamics of formation of opinion on different topics and, in particular, it can capture the reshaping of the political opinion landscape which has led to the inversion of result between the two rounds of the 2015 election.
△ Less
Submitted 18 November, 2020;
originally announced November 2020.
-
Learning language variations in news corpora through differential embeddings
Authors:
Carlos Selmo,
Julian F. Martinez,
Mariano G. Beiró,
J. Ignacio Alvarez-Hamelin
Abstract:
There is an increasing interest in the NLP community in capturing variations in the usage of language, either through time (i.e., semantic drift), across regions (as dialects or variants) or in different social contexts (i.e., professional or media technolects). Several successful dynamical embeddings have been proposed that can track semantic change through time. Here we show that a model with a…
▽ More
There is an increasing interest in the NLP community in capturing variations in the usage of language, either through time (i.e., semantic drift), across regions (as dialects or variants) or in different social contexts (i.e., professional or media technolects). Several successful dynamical embeddings have been proposed that can track semantic change through time. Here we show that a model with a central word representation and a slice-dependent contribution can learn word embeddings from different corpora simultaneously. This model is based on a star-like representation of the slices. We apply it to The New York Times and The Guardian newspapers, and we show that it can capture both temporal dynamics in the yearly slices of each corpus, and language variations between US and UK English in a curated multi-source corpus. We provide an extensive evaluation of this methodology.
△ Less
Submitted 13 November, 2020;
originally announced November 2020.
-
Evaluation of Biases in Self-reported Demographic and Psychometric Information: Traditional versus Facebook-based Surveys
Authors:
Kyriaki Kalimeri,
Mariano G. Beiro,
Andrea Bonanomi,
Alessandro Rosina,
Ciro Cattuto
Abstract:
Social media in scientific research offer a unique digital observatory of human behaviours and hence great opportunities to conduct research at large scale answering complex sociodemographic questions. We focus on the identification and assessment of biases in social media administered surveys. This study aims to shed light on population, self-selection and behavioural biases, empirically comparin…
▽ More
Social media in scientific research offer a unique digital observatory of human behaviours and hence great opportunities to conduct research at large scale answering complex sociodemographic questions. We focus on the identification and assessment of biases in social media administered surveys. This study aims to shed light on population, self-selection and behavioural biases, empirically comparing the consistency between self-reported information collected traditionally versus social media administered questionnaires, including demographic and psychometric attributes. We engaged a demographically representative cohort of young adults in Italy (approximately 4,000 participants) in taking a traditionally administered online survey and then, after one year, we invited them to use our ad hoc Facebook application (988 accepted) where they filled in part of the initial survey. We assess the statistically significant differences indicating population, self-selection, and behavioural biases due to the different context in which the questionnaire is administered. Our findings suggest that surveys administered on Facebook do not exhibit major biases with respect to traditionally administered surveys neither in terms of demographics, nor personality traits. Loyalty, authority, and social binding values were higher in the Facebook platform, probably due to the platform's intrinsic social character. We conclude, that Facebook apps are valid research tools for administering demographic and psychometric surveys provided that the entailed biases are taken into consideration. We contribute to the characterisation of Facebook apps as a valid scientific tool to administer demographic and psychometric surveys, and to the assessment of population, self-selection, and behavioural biases in the collected data.
△ Less
Submitted 23 January, 2019;
originally announced January 2019.
-
Shopping Mall Attraction and Social Mixing at a City Scale
Authors:
Mariano G. Beiró,
Loreto Bravo,
Diego Caro,
Ciro Cattuto,
Leo Ferres,
Eduardo Graells-Garrido
Abstract:
The social inclusion aspects of shopping malls and their effects on our understanding of urban spaces have been a controversial argument largely discussed in the literature. Shopping malls offer an open, safe and democratic version of the public space. Many of their detractors suggest that malls target their customers in subtle ways, promoting social exclusion. In this work, we analyze whether mal…
▽ More
The social inclusion aspects of shopping malls and their effects on our understanding of urban spaces have been a controversial argument largely discussed in the literature. Shopping malls offer an open, safe and democratic version of the public space. Many of their detractors suggest that malls target their customers in subtle ways, promoting social exclusion. In this work, we analyze whether malls offer opportunities for social mixing by analyzing the patterns of shopping mall visits in a large Latin-American city: Santiago de Chile.
We use a large XDR (Data Detail Records) dataset from a telecommunication company to analyze the mobility of $387,152$ cell phones around $16$ large malls in Santiago de Chile during one month. We model the influx of people to malls in terms of a gravity model of mobility, and we are able to predict the customer profile distribution of each mall, explaining it in terms of mall location, the population distribution, and mall size.
Then, we analyze the concept of social attraction, expressed as people from low and middle classes being attracted by malls that target high-income customers. We include a social attraction factor in our model and find that it is negligible in the process of choosing a mall. We observe that social mixing arises only in peripheral malls located farthest from the city center, which both low and middle class people visit. Using a co-visitation model we show that people tend to choose a restricted profile of malls according to their socio-economic status and their distance from the mall. We conclude that the potential for social mixing in malls could be capitalized by designing public policies regarding transportation and mobility.
△ Less
Submitted 9 February, 2018; v1 submitted 31 January, 2018;
originally announced February 2018.
-
Predicting Demographics, Moral Foundations, and Human Values from Digital Behaviors
Authors:
Kyriaki Kalimeri,
Mariano G. Beiro,
Matteo Delfino,
Robert Raleigh,
Ciro Cattuto
Abstract:
Personal electronic devices including smartphones give access to behavioural signals that can be used to learn about the characteristics and preferences of individuals. In this study, we explore the connection between demographic and psychological attributes and the digital behavioural records, for a cohort of 7,633 people, closely representative of the US population with respect to gender, age, g…
▽ More
Personal electronic devices including smartphones give access to behavioural signals that can be used to learn about the characteristics and preferences of individuals. In this study, we explore the connection between demographic and psychological attributes and the digital behavioural records, for a cohort of 7,633 people, closely representative of the US population with respect to gender, age, geographical distribution, education, and income. Along with the demographic data, we collected self-reported assessments on validated psychometric questionnaires for moral traits and basic human values and combined this information with passively collected multi-modal digital data from web browsing behaviour and smartphone usage. A machine learning framework was then designed to infer both the demographic and psychological attributes from the behavioural data. In a cross-validated setting, our models predicted demographic attributes with good accuracy as measured by the weighted AUROC score (Area Under the Receiver Operating Characteristic), but were less performant for the moral traits and human values. These results call for further investigation since they are still far from unveiling individuals' psychological fabric. This connection, along with the most predictive features that we provide for each attribute, might prove useful for designing personalised services, communication strategies, and interventions, and can be used to sketch a portrait of people with a similar worldview.
△ Less
Submitted 21 November, 2018; v1 submitted 5 December, 2017;
originally announced December 2017.
-
Predicting human mobility through the assimilation of social media traces into mobility models
Authors:
M. G. Beiró,
A. Panisson,
M. Tizzoni,
C. Cattuto
Abstract:
Predicting human mobility flows at different spatial scales is challenged by the heterogeneity of individual trajectories and the multi-scale nature of transportation networks. As vast amounts of digital traces of human behaviour become available, an opportunity arises to improve mobility models by integrating into them proxy data on mobility collected by a variety of digital platforms and locatio…
▽ More
Predicting human mobility flows at different spatial scales is challenged by the heterogeneity of individual trajectories and the multi-scale nature of transportation networks. As vast amounts of digital traces of human behaviour become available, an opportunity arises to improve mobility models by integrating into them proxy data on mobility collected by a variety of digital platforms and location-aware services. Here we propose a hybrid model of human mobility that integrates a large-scale publicly available dataset from a popular photo-sharing system with the classical gravity model, under a stacked regression procedure. We validate the performance and generalizability of our approach using two ground-truth datasets on air travel and daily commuting in the United States: using two different cross-validation schemes we show that the hybrid model affords enhanced mobility prediction at both spatial scales.
△ Less
Submitted 5 February, 2016; v1 submitted 18 January, 2016;
originally announced January 2016.
-
Router-level community structure of the Internet Autonomous Systems
Authors:
Mariano G. Beiró,
Sebastián P. Grynberg,
J. Ignacio Alvarez-Hamelin
Abstract:
The Internet is composed of routing devices connected between them and organized into independent administrative entities: the Autonomous Systems. The existence of different types of Autonomous Systems (like large connectivity providers, Internet Service Providers or universities) together with geographical and economical constraints, turns the Internet into a complex modular and hierarchical ne…
▽ More
The Internet is composed of routing devices connected between them and organized into independent administrative entities: the Autonomous Systems. The existence of different types of Autonomous Systems (like large connectivity providers, Internet Service Providers or universities) together with geographical and economical constraints, turns the Internet into a complex modular and hierarchical network. This organization is reflected in many properties of the Internet topology, like its high degree of clustering and its robustness.
In this work, we study the modular structure of the Internet router-level graph in order to assess to what extent the Autonomous Systems satisfy some of the known notions of community structure. We show that the modular structure of the Internet is much richer than what can be captured by the current community detection methods, which are severely affected by resolution limits and by the heterogeneity of the Autonomous Systems. Here we overcome this issue by using a multiresolution detection algorithm combined with a small sample of nodes. We also discuss recent work on community structure in the light of our results.
△ Less
Submitted 27 March, 2015; v1 submitted 25 March, 2015;
originally announced March 2015.
-
Deciphering the global organization of clustering in real complex networks
Authors:
Pol Colomer-de-Simon,
M. Angeles Serrano,
Mariano G. Beiro,
J. Ignacio Alvarez-Hamelin,
Marian Boguna
Abstract:
We uncover the global organization of clustering in real complex networks. As it happens with other fundamental properties of networks such as the degree distribution, we find that real networks are neither completely random nor ordered with respect to clustering, although they tend to be closer to maximally random architectures. We reach this conclusion by comparing the global structure of cluste…
▽ More
We uncover the global organization of clustering in real complex networks. As it happens with other fundamental properties of networks such as the degree distribution, we find that real networks are neither completely random nor ordered with respect to clustering, although they tend to be closer to maximally random architectures. We reach this conclusion by comparing the global structure of clustering in real networks with that in maximally random and in maximally ordered clustered graphs. The former are produced with an exponential random graph model that maintains correlations among adjacent edges at the minimum needed to conform with the expected clustering spectrum; the later with a random model that arranges triangles in cliques inducing highly ordered structures. To compare the global organization of clustering in real and model networks, we compute $m$-core landscapes, where the $m$-core is defined, akin to the $k$-core, as the maximal subgraph with edges participating at least in $m$ triangles. This property defines a set of nested subgraphs that, contrarily to $k$-cores, is able to distinguish between hierarchical and modular architectures. To visualize the $m$-core decomposition we developed the LaNet-vi 3.0 tool.
△ Less
Submitted 1 June, 2013;
originally announced June 2013.
-
Obtaining Communities with a Fitness Growth Process
Authors:
Mariano G. Beiró,
Jorge R. Busch,
Sebastian P. Grynberg,
J. Ignacio Alvarez-Hamelin
Abstract:
The study of community structure has been a hot topic of research over the last years. But, while successfully applied in several areas, the concept lacks of a general and precise notion. Facts like the hierarchical structure and heterogeneity of complex networks make it difficult to unify the idea of community and its evaluation. The global functional known as modularity is probably the most used…
▽ More
The study of community structure has been a hot topic of research over the last years. But, while successfully applied in several areas, the concept lacks of a general and precise notion. Facts like the hierarchical structure and heterogeneity of complex networks make it difficult to unify the idea of community and its evaluation. The global functional known as modularity is probably the most used technique in this area. Nevertheless, its limits have been deeply studied. Local techniques as the ones by Lancichinetti et al. and Palla et al. arose as an answer to the resolution limit and degeneracies that modularity has.
Here we start from the algorithm by Lancichinetti et al. and propose a unique growth process for a fitness function that, while being local, finds a community partition that covers the whole network, updating the scale parameter dynamically. We test the quality of our results by using a set of benchmarks of heterogeneous graphs. We discuss alternative measures for evaluating the community structure and, in the light of them, infer possible explanations for the better performance of local methods compared to global ones in these cases.
△ Less
Submitted 6 June, 2012;
originally announced June 2012.