Skip to main content

Showing 1–25 of 25 results for author: Hidalgo, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13880  [pdf

    econ.GN cs.SI physics.soc-ph

    The Software Complexity of Nations

    Authors: Sándor Juhász, Johannes Wachs, Jermain Kaminski, César A. Hidalgo

    Abstract: Despite the growing importance of the digital sector, research on economic complexity and its implications continues to rely mostly on administrative records, e.g. data on exports, patents, and employment, that fail to capture the nuances of the digital economy. In this paper we use data on the geography of programming languages used in open-source software projects to extend economic complexity i… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:2405.03452  [pdf

    cs.CY cs.AI cs.CL

    Large Language Models (LLMs) as Agents for Augmented Democracy

    Authors: Jairo Gudiño-Rosero, Umberto Grandi, César A. Hidalgo

    Abstract: We explore an augmented democracy system built on off-the-shelf LLMs fine-tuned to augment data on citizen's preferences elicited over policies extracted from the government programs of the two main candidates of Brazil's 2022 presidential election. We use a train-test cross-validation setup to estimate the accuracy with which the LLMs predict both: a subject's individual political choices and the… ▽ More

    Submitted 30 July, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: 24 pages main manuscript with 4 figures. 13 pages of supplementary material

  3. arXiv:2308.02491  [pdf, other

    econ.GN cs.LG q-fin.ST

    Mapping Global Value Chains at the Product Level

    Authors: Lea Karbevska, César A. Hidalgo

    Abstract: Value chain data is crucial to navigate economic disruptions, such as those caused by the COVID-19 pandemic and the war in Ukraine. Yet, despite its importance, publicly available value chain datasets, such as the ``World Input-Output Database'', ``Inter-Country Input-Output Tables'', ``EXIOBASE'' or the ``EORA'', lack detailed information about products (e.g. Radio Receivers, Telephones, Electric… ▽ More

    Submitted 12 June, 2023; originally announced August 2023.

  4. arXiv:2306.08511  [pdf, other

    cs.MA cs.AI cs.CY

    Measuring and Controlling Divisiveness in Rank Aggregation

    Authors: Rachael Colley, Umberto Grandi, César Hidalgo, Mariana Macedo, Carlos Navarrete

    Abstract: In rank aggregation, members of a population rank issues to decide which are collectively preferred. We focus instead on identifying divisive issues that express disagreements among the preferences of individuals. We analyse the properties of our divisiveness measures and their relation to existing notions of polarisation. We also study their robustness under incomplete preferences and algorithms… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 25 pages, 8 figures

  5. Understanding Political Divisiveness using Online Participation data from the 2022 French and Brazilian Presidential Elections

    Authors: Carlos Navarrete, Mariana Macedo, Rachael Colley, Jingling Zhang, Nicole Ferrada, Maria Eduarda Mello, Rodrigo Lira, Carmelo Bastos-Filho, Umberto Grandi, Jerome Lang, César A. Hidalgo

    Abstract: Digital technologies can augment civic participation by facilitating the expression of detailed political preferences. Yet, digital participation efforts often rely on methods optimized for elections involving a few candidates. Here we present data collected in an online experiment where participants built personalized government programs by combining policies proposed by the candidates of the 202… ▽ More

    Submitted 25 October, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: 29 pages main manuscript with 5 figures. 55 pages of supplementary material

  6. arXiv:2210.10081  [pdf, other

    cs.CY cs.AI cs.HC

    Why people judge humans differently from machines: The role of perceived agency and experience

    Authors: Jingling Zhang, Jane Conway, César A. Hidalgo

    Abstract: People are known to judge artificial intelligence using a utilitarian moral philosophy and humans using a moral philosophy emphasizing perceived intentions. But why do people judge humans and machines differently? Psychology suggests that people may have different mind perception models of humans and machines, and thus, will treat human-like robots more similarly to the way they treat humans. Here… ▽ More

    Submitted 19 September, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: 8 pages, 3 figures

  7. arXiv:2209.08382  [pdf

    econ.GN cond-mat.stat-mech cs.CY

    Multidimensional Economic Complexity and Inclusive Green Growth

    Authors: Viktor Stojkoski, Philipp Koch, César A. Hidalgo

    Abstract: To achieve inclusive green growth, countries need to consider a multiplicity of economic, social, and environmental factors. These are often captured by metrics of economic complexity derived from the geography of trade, thus missing key information on innovative activities. To bridge this gap, we combine trade data with data on patent applications and research publications to build models that si… ▽ More

    Submitted 21 April, 2023; v1 submitted 17 September, 2022; originally announced September 2022.

    Journal ref: Communications Earth & Environment volume 4, Article number: 130 (2023)

  8. arXiv:2205.02164  [pdf

    econ.GN cond-mat.stat-mech cs.CY

    The Policy Implications of Economic Complexity

    Authors: César A. Hidalgo

    Abstract: In recent years economic complexity has grown into an active field of fundamental and applied research. Yet, despite important advances, the policy implications of economic complexity remain unclear or misunderstood. Here I organize the policy implications of economic complexity in a framework grounded on 4 Ws: what approaches, focused on identifying target activities and/or locations; when approa… ▽ More

    Submitted 7 August, 2023; v1 submitted 4 May, 2022; originally announced May 2022.

    Journal ref: Research Policy, 52(9), 104863 (2023)

  9. arXiv:2204.01483  [pdf, other

    cs.CY cs.LG stat.AP

    Assessing dengue fever risk in Costa Rica by using climate variables and machine learning techniques

    Authors: Luis A. Barboza, Shu-Wei Chou, Paola Vásquez, Yury E. García, Juan G. Calvo, Hugo C. Hidalgo, Fabio Sanchez

    Abstract: Dengue fever is a vector-borne disease mostly endemic to tropical and subtropical countries that affect millions every year and is considered a significant burden for public health. Its geographic distribution makes it highly sensitive to climate conditions. Here, we explore the effect of climate variables using the Generalized Additive Model for location, scale, and shape (GAMLSS) and Random Fore… ▽ More

    Submitted 23 March, 2022; originally announced April 2022.

    Comments: 13 pages, 4 figures

  10. arXiv:1909.11713  [pdf, other

    physics.soc-ph cs.SI physics.app-ph physics.data-an

    Strategic reciprocity improves academic performance in public elementary school children

    Authors: Cristian Candia, Víctor Landaeta-Torres, César A. Hidalgo, Carlos Rodriguez-Sickert

    Abstract: Social networks are pivotal for learning. Yet, we still lack a full understanding of the mechanisms connecting networks with learning outcomes. Here, we present the results of a large scale study (946 elementary school children from 45 different classrooms) designed to understand the social strategies used by elementary school children. We mapped the social networks of students using both, a non-a… ▽ More

    Submitted 29 September, 2019; v1 submitted 25 September, 2019; originally announced September 2019.

  11. arXiv:1905.10688  [pdf, other

    cs.LG cs.DB cs.IR stat.ML

    Sherlock: A Deep Learning Approach to Semantic Data Type Detection

    Authors: Madelon Hulsebos, Kevin Hu, Michiel Bakker, Emanuel Zgraggen, Arvind Satyanarayan, Tim Kraska, Çağatay Demiralp, César Hidalgo

    Abstract: Correctly detecting the semantic type of data columns is crucial for data science tasks such as automated data cleaning, schema matching, and data discovery. Existing data preparation and analysis systems rely on dictionary lookups and regular expression matching to detect semantic types. However, these matching-based approaches often are not robust to dirty data and only detect a limited number o… ▽ More

    Submitted 25 May, 2019; originally announced May 2019.

    Comments: KDD'19

  12. arXiv:1905.04616  [pdf, other

    cs.HC cs.DB cs.LG

    VizNet: Towards A Large-Scale Visualization Learning and Benchmarking Repository

    Authors: Kevin Hu, Neil Gaikwad, Michiel Bakker, Madelon Hulsebos, Emanuel Zgraggen, César Hidalgo, Tim Kraska, Guoliang Li, Arvind Satyanarayan, Çağatay Demiralp

    Abstract: Researchers currently rely on ad hoc datasets to train automated visualization tools and evaluate the effectiveness of visualization designs. These exemplars often lack the characteristics of real-world datasets, and their one-off nature makes it difficult to compare different techniques. In this paper, we present VizNet: a large-scale corpus of over 31 million datasets compiled from open data rep… ▽ More

    Submitted 11 May, 2019; originally announced May 2019.

    Comments: CHI'19

  13. Computational Aspects of Optimal Strategic Network Diffusion

    Authors: Marcin Waniek, Khaled Elbassioni, Flavio L. Pinheiro, Cesar A. Hidalgo, Aamena Alshamsi

    Abstract: Diffusion on complex networks is often modeled as a stochastic process. Yet, recent work on strategic diffusion emphasizes the decision power of agents and treats diffusion as a strategic problem. Here we study the computational aspects of strategic diffusion, i.e., finding the optimal sequence of nodes to activate a network in the minimum time. We prove that finding an optimal solution to this pr… ▽ More

    Submitted 30 January, 2020; v1 submitted 10 September, 2018; originally announced September 2018.

    Comments: 21 pages, 5 figures

    MSC Class: 68Q17 (Primary) 05C82 (Secondary) ACM Class: F.2.2; G.2.2

  14. arXiv:1808.04819  [pdf, other

    cs.HC cs.AI cs.LG

    VizML: A Machine Learning Approach to Visualization Recommendation

    Authors: Kevin Z. Hu, Michiel A. Bakker, Stephen Li, Tim Kraska, César A. Hidalgo

    Abstract: Data visualization should be accessible for all analysts with data, not just the few with technical expertise. Visualization recommender systems aim to lower the barrier to exploring basic visualizations by automatically generating results for analysts to search and select, rather than manually specify. Here, we demonstrate a novel machine learning-based approach to visualization recommendation th… ▽ More

    Submitted 14 August, 2018; originally announced August 2018.

  15. arXiv:1807.07887  [pdf

    physics.soc-ph cs.SI

    Complex Economic Activities Concentrate in Large Cities

    Authors: Pierre-Alexandre Balland, Cristian Jara-Figueroa, Sergio Petralia, Mathieu Steijn, David Rigby, Cesar A. Hidalgo

    Abstract: Why do some economic activities agglomerate more than others? And, why does the agglomeration of some economic activities continue to increase despite recent developments in communication and transportation technologies? In this paper, we present evidence that complex economic activities concentrate more in large cities. We find this to be true for technologies, scientific publications, industries… ▽ More

    Submitted 20 July, 2018; originally announced July 2018.

  16. arXiv:1705.00232  [pdf, other

    physics.soc-ph cs.SI

    Optimal diversification strategies in the networks of related products and of related research areas

    Authors: Aamena Alshamsi, Flavio L. Pinheiro, Cesar A. Hidalgo

    Abstract: Countries and cities are likely to enter economic activities that are related to those that are already present in them. Yet, while these path dependencies are universally acknowledged, we lack an understanding of the diversification strategies that can optimally balance the development of related and unrelated activities. Here, we develop algorithms to identify the activities that are optimal to… ▽ More

    Submitted 9 March, 2018; v1 submitted 29 April, 2017; originally announced May 2017.

    Comments: 3 Figures, 9 Pages, 32 References, Accepted at Nature Communications

  17. arXiv:1608.01769  [pdf, other

    cs.CV

    Deep Learning the City : Quantifying Urban Perception At A Global Scale

    Authors: Abhimanyu Dubey, Nikhil Naik, Devi Parikh, Ramesh Raskar, César A. Hidalgo

    Abstract: Computer vision methods that quantify the perception of urban environment are increasingly being used to study the relationship between a city's physical appearance and the behavior and health of its residents. Yet, the throughput of current methods is too limited to quantify the perception of cities across the world. To tackle this challenge, we introduce a new crowdsourced dataset containing 110… ▽ More

    Submitted 12 September, 2016; v1 submitted 5 August, 2016; originally announced August 2016.

    Comments: 23 pages, 8 figures. Accepted to the European Conference on Computer Vision (ECCV), 2016

  18. arXiv:1608.00462  [pdf, other

    cs.CY cs.SI physics.soc-ph

    Are Safer Looking Neighborhoods More Lively? A Multimodal Investigation into Urban Life

    Authors: Marco De Nadai, Radu L. Vieriu, Gloria Zen, Stefan Dragicevic, Nikhil Naik, Michele Caraviello, Cesar A. Hidalgo, Nicu Sebe, Bruno Lepri

    Abstract: Policy makers, urban planners, architects, sociologists, and economists are interested in creating urban areas that are both lively and safe. But are the safety and liveliness of neighborhoods independent characteristics? Or are they just two sides of the same coin? In a world where people avoid unsafe looking places, neighborhoods that look unsafe will be less lively, and will fail to harness the… ▽ More

    Submitted 1 August, 2016; originally announced August 2016.

    Comments: To appear in the Proceedings of ACM Multimedia Conference (MM), 2016. October 15 - 19, 2016, Amsterdam, Netherlands

  19. arXiv:1602.08409  [pdf

    cs.DL cs.SI physics.soc-ph

    The Research Space: using the career paths of scholars to predict the evolution of the research output of individuals, institutions, and nations

    Authors: Miguel R. Guevara, Dominik Hartmann, Manuel Aristarán, Marcelo Mendoza, César A. Hidalgo

    Abstract: In recent years scholars have built maps of science by connecting the academic fields that cite each other, are cited together, or that cite a similar literature. But since scholars cannot always publish in the fields they cite, or that cite them, these science maps are only rough proxies for the potential of a scholar, organization, or country, to enter a new academic field. Here we use a large d… ▽ More

    Submitted 14 April, 2016; v1 submitted 26 February, 2016; originally announced February 2016.

  20. arXiv:1512.05020  [pdf, other

    cs.CY

    How the medium shapes the message: Printing and the rise of the arts and sciences

    Authors: C. Jara-Figueroa, Amy Z. Yu, Cesar A. Hidalgo

    Abstract: Communication technologies, from printing to social media, affect our historical records by changing the way ideas are spread and recorded. Yet, finding statistical instruments to address the endogeneity of this relationship has been problematic. Here we use a city's distance to Mainz as an instrument for the introduction of the printing press in European cities, together with data on nearly 50 th… ▽ More

    Submitted 9 August, 2017; v1 submitted 15 December, 2015; originally announced December 2015.

  21. arXiv:1511.03981  [pdf

    physics.soc-ph cs.SI

    Disconnected, fragmented, or united? A trans-disciplinary review of network science

    Authors: Cesar A. Hidalgo

    Abstract: During decades the study of networks has been divided between the efforts of social scientists and natural scientists, two groups of scholars who often do not see eye to eye. In this review I present an effort to mutually translate the work conducted by scholars from both of these academic fronts hoping to continue to unify what has become a diverging body of literature. I argue that social and na… ▽ More

    Submitted 15 July, 2016; v1 submitted 12 November, 2015; originally announced November 2015.

  22. arXiv:1502.07310  [pdf

    physics.soc-ph cs.SI

    Pantheon 1.0, a manually verified dataset of globally famous biographies

    Authors: Amy Zhao Yu, Shahar Ronen, Kevin Hu, Tiffany Lu, César A. Hidalgo

    Abstract: We present the Pantheon 1.0 dataset: a manually verified dataset of individuals that have transcended linguistic, temporal, and geographic boundaries. The Pantheon 1.0 dataset includes the 11,341 biographies present in more than 25 languages in Wikipedia and is enriched with: (i) manually verified demographic information (place and date of birth, gender) (ii) a taxonomy of occupations classifying… ▽ More

    Submitted 5 January, 2016; v1 submitted 25 February, 2015; originally announced February 2015.

    Comments: Scientific Data 2:150075

  23. arXiv:1403.2708  [pdf, other

    physics.soc-ph cs.SI q-bio.PE

    Beyond network structure: How heterogenous susceptibility modulates the spread of epidemics

    Authors: Daniel Smilkov, Cesar A. Hidalgo, Ljupco Kocarev

    Abstract: The compartmental models used to study epidemic spreading often assume the same susceptibility for all individuals, and are therefore, agnostic about the effects that differences in susceptibility can have on epidemic spreading. Here we show that--for the SIS model--differential susceptibility can make networks more vulnerable to the spread of diseases when the correlation between a node's degree… ▽ More

    Submitted 10 March, 2014; originally announced March 2014.

    Comments: 13 pages, 2 figures

  24. arXiv:0906.4567  [pdf

    physics.data-an cs.NI physics.soc-ph

    Understanding the spreading patterns of mobile phone viruses

    Authors: P. Wang, M. Gonzalez, C. A. Hidalgo, A. -L. Barabasi

    Abstract: We model the mobility of mobile phone users to study the fundamental spreading patterns characterizing a mobile virus outbreak. We find that while Bluetooth viruses can reach all susceptible handsets with time, they spread slowly due to human mobility, offering ample opportunities to deploy antiviral software. In contrast, viruses utilizing multimedia messaging services could infect all users in… ▽ More

    Submitted 24 June, 2009; originally announced June 2009.

    Comments: 13 pages, 4 figures

    Journal ref: Science 324, 1071-1076 (2009)

  25. arXiv:0806.1256  [pdf, ps, other

    physics.soc-ph cond-mat.stat-mech cs.CY physics.bio-ph

    Understanding individual human mobility patterns

    Authors: M. C. Gonzalez, C. A. Hidalgo, A. -L. Barabasi

    Abstract: Despite their importance for urban planning, traffic forecasting, and the spread of biological and mobile viruses, our understanding of the basic laws governing human motion remains limited thanks to the lack of tools to monitor the time resolved location of individuals. Here we study the trajectory of 100,000 anonymized mobile phone users whose position is tracked for a six month period. We fin… ▽ More

    Submitted 6 June, 2008; originally announced June 2008.

    Comments: Supporting Webpage: http://www.nd.edu/~mgonza16/Marta'sHomepage_files/nature2008/research.html

    Journal ref: Nature 453, 479-482 (2008)