Skip to main content

Showing 1–10 of 10 results for author: Amaral, L A N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.11555  [pdf, ps, other

    physics.soc-ph cs.CY

    Breaking the Code: Multi-level Learning in the Eurovision Song Contest

    Authors: Luís A. Nunes Amaral, Arthur Capozzi, Dirk Helbing

    Abstract: Organizations learn from the market, political, and societal responses to their actions. While in some cases both the actions and responses take place in an open manner, in many others, some aspects may be hidden from external observers. The Eurovision Song Contest offers an interesting example to study organizational level learning at two levels: organizers and participants. We find evidence for… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  2. arXiv:1902.00716  [pdf, other

    physics.soc-ph cs.SI

    Centrality anomalies in complex networks as a result of model over-simplification

    Authors: Luiz G. A. Alves, Alberto Aleta, Francisco A. Rodrigues, Yamir Moreno, Luis A. Nunes Amaral

    Abstract: Tremendous advances have been made in our understanding of the properties and evolution of complex networks. These advances were initially driven by information-poor empirical networks and theoretical analysis of unweighted and undirected graphs. Recently, information-rich empirical data complex networks supported the development of more sophisticated models that include edge directionality and we… ▽ More

    Submitted 13 March, 2020; v1 submitted 2 February, 2019; originally announced February 2019.

    Comments: 14 pages, including 9 figures. APS style. Accepted for publication in New Journal of Physics

    Journal ref: New Journal of Physics 23, 013043 (2020)

  3. arXiv:1901.09848  [pdf, other

    cs.CL cs.LG physics.soc-ph

    A new evaluation framework for topic modeling algorithms based on synthetic corpora

    Authors: Hanyu Shi, Martin Gerlach, Isabel Diersen, Doug Downey, Luis A. N. Amaral

    Abstract: Topic models are in widespread use in natural language processing and beyond. Here, we propose a new framework for the evaluation of probabilistic topic modeling algorithms based on synthetic corpora containing an unambiguously defined ground truth topic structure. The major innovation of our approach is the ability to quantify the agreement between the planted and inferred topic structures by com… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.

    Comments: accepted for AISTATS 2019; code available at https://github.com/amarallab/synthetic_benchmark_topic_model; Main text (11 pages, 5 figures) and Supplementary Material (14 pages, 11 figures)

  4. arXiv:1511.00716  [pdf, ps, other

    physics.soc-ph cs.DL

    The Distribution of the Asymptotic Number of Citations to Sets of Publications by a Researcher or From an Academic Department Are Consistent With a Discrete Lognormal Model

    Authors: João A. G. Moreira, Xiao Han T. Zeng, Luís A. Nunes Amaral

    Abstract: How to quantify the impact of a researcher's or an institution's body of work is a matter of increasing importance to scientists, funding agencies, and hiring committees. The use of bibliometric indicators, such as the h-index or the Journal Impact Factor, have become widespread despite their known limitations. We argue that most existing bibliometric indicators are inconsistent, biased, and, wors… ▽ More

    Submitted 2 November, 2015; originally announced November 2015.

    Comments: 20 pages, 11 figures, 3 tables

  5. arXiv:1402.0422  [pdf, other

    stat.ML cs.IR cs.LG physics.soc-ph

    A high-reproducibility and high-accuracy method for automated topic classification

    Authors: Andrea Lancichinetti, M. Irmak Sirer, Jane X. Wang, Daniel Acuna, Konrad Körding, Luís A. Nunes Amaral

    Abstract: Much of human knowledge sits in large databases of unstructured text. Leveraging this knowledge requires algorithms that extract and record metadata on unstructured text documents. Assigning topics to documents will enable intelligent search, statistical characterization, and meaningful classification. Latent Dirichlet allocation (LDA) is the state-of-the-art in topic classification. Here, we perf… ▽ More

    Submitted 3 February, 2014; originally announced February 2014.

    Comments: 23 pages, 24 figures

  6. arXiv:1312.3986  [pdf, other

    physics.soc-ph cs.SI

    Correlations between user voting data, budget, and box office for films in the Internet Movie Database

    Authors: Max Wasserman, Satyam Mukherjee, Konner Scott, Xiao Han T. Zeng, Filippo Radicchi, Luís A. N. Amaral

    Abstract: The Internet Movie Database (IMDb) is one of the most-visited websites in the world and the premier source for information on films. Like Wikipedia, much of IMDb's information is user contributed. IMDb also allows users to voice their opinion on the quality of films through voting. We investigate whether there is a connection between this user voting data and certain economic film characteristics.… ▽ More

    Submitted 16 January, 2014; v1 submitted 13 December, 2013; originally announced December 2013.

    Comments: 14 pages, 8 figures, 3 tables, accepted for publication to JASIST

  7. arXiv:1212.3320  [pdf, other

    physics.soc-ph cs.DL physics.data-an

    The Possible Role of Resource Requirements and Academic Career-Choice Risk on Gender Differences in Publication Rate and Impact

    Authors: Jordi Duch, Xiao Han T. Zeng, Marta Sales-Pardo, Filippo Radicchi, Shayna Otis, Teresa K. Woodruff, Luis A. Nunes Amaral

    Abstract: Many studies demonstrate that there is still a significant gender bias, especially at higher career levels, in many areas including science, technology, engineering, and mathematics (STEM). We investigated field-dependent, gender-specific effects of the selective pressures individuals experience as they pursue a career in academia within seven STEM disciplines. We built a unique database that comp… ▽ More

    Submitted 13 December, 2012; originally announced December 2012.

    Comments: 9 figures and 3 tables

    Journal ref: PLoS ONE 7(12): e51332

  8. arXiv:1105.0469  [pdf, other

    physics.soc-ph cs.SI

    Rationality, irrationality and escalating behavior in lowest unique bid auctions

    Authors: Filippo Radicchi, Andrea Baronchelli, Luis A. N. Amaral

    Abstract: Information technology has revolutionized the traditional structure of markets. The removal of geographical and time constraints has fostered the growth of online auction markets, which now include millions of economic agents worldwide and annual transaction volumes in the billions of dollars. Here, we analyze bid histories of a little studied type of online auctions --- lowest unique bid auctions… ▽ More

    Submitted 18 January, 2012; v1 submitted 2 May, 2011; originally announced May 2011.

    Comments: 36 pages, 30 figures, 5 tables

    Journal ref: PloS ONE 7, e29910 (2012)

  9. arXiv:0905.0106  [pdf, ps, other

    physics.soc-ph cs.CY physics.data-an

    Characterizing Individual Communication Patterns

    Authors: R. Dean Malmgren, Jake M. Hofman, Luis A. N. Amaral, Duncan J. Watts

    Abstract: The increasing availability of electronic communication data, such as that arising from e-mail exchange, presents social and information scientists with new possibilities for characterizing individual behavior and, by extension, identifying latent structure in human populations. Here, we propose a model of individual e-mail communication that is sufficiently rich to capture meaningful variabilit… ▽ More

    Submitted 1 May, 2009; originally announced May 2009.

    Comments: 9 pages, 6 figures, to appear in Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09), June 28-July 1, Paris, France

  10. arXiv:0901.0585  [pdf, ps, other

    physics.soc-ph cs.CY physics.data-an

    A Poissonian explanation for heavy-tails in e-mail communication

    Authors: R. Dean Malmgren, Daniel B. Stouffer, Adilson E. Motter, Luis A. N. Amaral

    Abstract: Patterns of deliberate human activity and behavior are of utmost importance in areas as diverse as disease spread, resource allocation, and emergency response. Because of its widespread availability and use, e-mail correspondence provides an attractive proxy for studying human activity. Recently, it was reported that the probability density for the inter-event time $τ$ between consecutively sent… ▽ More

    Submitted 5 January, 2009; originally announced January 2009.

    Comments: 9 pages, 5 figures

    Journal ref: PNAS 105(47): 18153-18158 (2008)