Skip to main content

Showing 1–50 of 90 results for author: Morales, G

Searching in archive cs. Search in all archives.
.
  1. Alice and the Caterpillar: A more descriptive null model for assessing data mining results

    Authors: Giulia Preti, Gianmarco De Francisci Morales, Matteo Riondato

    Abstract: We introduce novel null models for assessing the results obtained from observed binary transactional and sequence datasets, using statistical hypothesis testing. Our null models maintain more properties of the observed dataset than existing ones. Specifically, they preserve the Bipartite Joint Degree Matrix of the bipartite (multi-)graph corresponding to the dataset, which ensures that the number… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Journal ref: Knowledge and Information Systems, 2024

  2. arXiv:2506.05086  [pdf, ps, other

    cs.SI cs.CY

    Early linguistic fingerprints of online users who engage with conspiracy communities

    Authors: Francesco Corso, Giuseppe Russo, Francesco Pierri, Gianmarco De Francisci Morales

    Abstract: Online social media platforms are often seen as catalysts for radicalization, as they provide spaces where extreme beliefs can take root and spread, sometimes leading to real-world consequences. Conspiracy theories represent a specific form of radicalization that is notoriously resistant to online moderation strategies. One explanation for this resilience is the presence of a "conspiratorial minds… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  3. arXiv:2504.20295  [pdf, other

    cs.LG cs.AI cs.CR

    The Dark Side of Digital Twins: Adversarial Attacks on AI-Driven Water Forecasting

    Authors: Mohammadhossein Homaei, Victor Gonzalez Morales, Oscar Mogollon-Gutierrez, Andres Caro

    Abstract: Digital twins (DTs) are improving water distribution systems by using real-time data, analytics, and prediction models to optimize operations. This paper presents a DT platform designed for a Spanish water supply network, utilizing Long Short-Term Memory (LSTM) networks to predict water consumption. However, machine learning models are vulnerable to adversarial attacks, such as the Fast Gradient S… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: 7 Pages, 7 Figures

  4. arXiv:2504.20275  [pdf, other

    cs.CR cs.AI cs.LG

    Smart Water Security with AI and Blockchain-Enhanced Digital Twins

    Authors: Mohammadhossein Homaei, Victor Gonzalez Morales, Oscar Mogollon Gutierrez, Ruben Molano Gomez, Andres Caro

    Abstract: Water distribution systems in rural areas face serious challenges such as a lack of real-time monitoring, vulnerability to cyberattacks, and unreliable data handling. This paper presents an integrated framework that combines LoRaWAN-based data acquisition, a machine learning-driven Intrusion Detection System (IDS), and a blockchain-enabled Digital Twin (BC-DT) platform for secure and transparent w… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: 8 Pages, 9 Figures

  5. arXiv:2502.05049  [pdf, other

    cs.SI cs.CY

    On the Inference of Sociodemographics on Reddit

    Authors: Federico Cinus, Corrado Monti, Paolo Bajardi, Gianmarco De Francisci Morales

    Abstract: Inference of sociodemographic attributes of social media users is an essential step for computational social science (CSS) research to link online and offline behavior. However, there is a lack of a systematic evaluation and clear guidelines for optimal methodologies for this task on Reddit, one of today's largest social media. In this study, we fill this gap by comparing state-of-the-art (SOTA) a… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  6. arXiv:2412.10570  [pdf, other

    cs.LG stat.ML

    Adaptive Sampling to Reduce Epistemic Uncertainty Using Prediction Interval-Generation Neural Networks

    Authors: Giorgio Morales, John Sheppard

    Abstract: Obtaining high certainty in predictive models is crucial for making informed and trustworthy decisions in many scientific and engineering domains. However, extensive experimentation required for model accuracy can be both costly and time-consuming. This paper presents an adaptive sampling approach designed to reduce epistemic uncertainty in predictive models. Our primary contribution is the develo… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

    Comments: Accepted to appear in AAAI 2025

  7. GPU Sharing with Triples Mode

    Authors: Chansup Byun, Albert Reuther, LaToya Anderson, William Arcand, Bill Bergeron, David Bestor, Alexander Bonn, Daniel Burrill, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Piotr Luszczek, Peter Michaleas, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Antonio Rosa, Charles Yee, Jeremy Kepner

    Abstract: There is a tremendous amount of interest in AI/ML technologies due to the proliferation of generative AI applications such as ChatGPT. This trend has significantly increased demand on GPUs, which are the workhorses for training AI models. Due to the high costs of GPUs and lacking supply, it has become of interest to optimize GPU usage in HPC centers. MIT Lincoln Laboratory Supercomputing Center (L… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

  8. LLload: An Easy-to-Use HPC Utilization Tool

    Authors: Chansup Byun, Albert Reuther, Julie Mullen, LaToya Anderson, William Arcand, Bill Bergeron, David Bestor, Alexander Bonn, Daniel Burrill, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Piotr Luszczek, Peter Michaleas, Lauren Milechin, Guillermo Morales, Andrew Prout, Antonio Rosa, Charles Yee, Jeremy Kepner

    Abstract: The increasing use and cost of high performance computing (HPC) requires new easy-to-use tools to enable HPC users and HPC systems engineers to transparently understand the utilization of resources. The MIT Lincoln Laboratory Supercomputing Center (LLSC) has developed a simple command, LLload, to monitor and characterize HPC workloads. LLload plays an important role in identifying opportunities fo… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

  9. arXiv:2410.10562  [pdf, other

    cs.CY cs.HC cs.SI stat.AP

    Causal Modeling of Climate Activism on Reddit

    Authors: Jacopo Lenti, Luca Maria Aiello, Corrado Monti, Gianmarco De Francisci Morales

    Abstract: Climate activism is crucial in stimulating collective societal and behavioral change towards sustainable practices through political pressure. Although multiple factors contribute to the participation in activism, their complex relationships and the scarcity of data on their interactions have restricted most prior research to studying them in isolation, thus preventing the development of a quantit… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  10. HPC with Enhanced User Separation

    Authors: Andrew Prout, Albert Reuther, Michael Houle, Michael Jones, Peter Michaleas, LaToya Anderson, William Arcand, Bill Bergeron, David Bestor, Alex Bonn, Daniel Burrill, Chansup Byun, Vijay Gadepally, Matthew Hubbell, Hayden Jananthan, Piotr Luszczek, Lauren Milechin, Guillermo Morales, Julie Mullen, Antonio Rosa, Charles Yee, Jeremy Kepner

    Abstract: HPC systems used for research run a wide variety of software and workflows. This software is often written or modified by users to meet the needs of their research projects, and rarely is built with security in mind. In this paper we explore several of the key techniques that MIT Lincoln Laboratory Supercomputing Center has deployed on its systems to manage the security implications of these workf… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

  11. arXiv:2409.08115  [pdf, other

    cs.NI cs.DM cs.PF cs.SE math.CO

    Anonymized Network Sensing Graph Challenge

    Authors: Hayden Jananthan, Michael Jones, William Arcand, David Bestor, William Bergeron, Daniel Burrill, Aydin Buluc, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Piotr Luszczek, Peter Michaleas, Lauren Milechin, Chasen Milner, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther , et al. (4 additional authors not shown)

    Abstract: The MIT/IEEE/Amazon GraphChallenge encourages community approaches to developing new solutions for analyzing graphs and sparse data derived from social media, sensor feeds, and scientific data to discover relationships between events as they unfold in the field. The anonymized network sensing Graph Challenge seeks to enable large, open, community-based approaches to protecting networks. Many large… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: Accepted to IEEE HPEC 2024

  12. arXiv:2409.06817  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Bifurcation Identification for Ultrasound-driven Robotic Cannulation

    Authors: Cecilia G. Morales, Dhruv Srikanth, Jack H. Good, Keith A. Dufendach, Artur Dubrawski

    Abstract: In trauma and critical care settings, rapid and precise intravascular access is key to patients' survival. Our research aims at ensuring this access, even when skilled medical personnel are not readily available. Vessel bifurcations are anatomical landmarks that can guide the safe placement of catheters or needles during medical procedures. Although ultrasound is advantageous in navigating anatomi… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2024

  13. arXiv:2409.03111  [pdf, other

    cs.NI cs.CR cs.CY cs.SI

    What is Normal? A Big Data Observational Science Model of Anonymized Internet Traffic

    Authors: Jeremy Kepner, Hayden Jananthan, Michael Jones, William Arcand, David Bestor, William Bergeron, Daniel Burrill, Aydin Buluc, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Piotr Luszczek, Lauren Milechin, Chasen Milner, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther , et al. (4 additional authors not shown)

    Abstract: Understanding what is normal is a key aspect of protecting a domain. Other domains invest heavily in observational science to develop models of normal behavior to better detect anomalies. Recent advances in high performance graph libraries, such as the GraphBLAS, coupled with supercomputers enables processing of the trillions of observations required. We leverage this approach to synthesize low-pa… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: Accepted to IEEE HPEC, 7 pages, 6 figures, 1 table, 41 references

  14. Polaris: Sampling from the Multigraph Configuration Model with Prescribed Color Assortativity

    Authors: Giulia Preti, Matteo Riondato, Aristides Gionis, Gianmarco De Francisci Morales

    Abstract: We introduce Polaris, a network null model for colored multi-graphs that preserves the Joint Color Matrix. Polaris is specifically designed for studying network polarization, where vertices belong to a side in a debate or a partisan group, represented by a vertex color, and relations have different strengths, represented by an integer-valued edge multiplicity. The key feature of Polaris is preserv… ▽ More

    Submitted 18 December, 2024; v1 submitted 2 September, 2024; originally announced September 2024.

    Comments: Accepted for publication at WSDM2025

  15. arXiv:2408.12872  [pdf, other

    cs.CY

    Moral Judgments in Online Discourse are not Biased by Gender

    Authors: Lorenzo Betti, Paolo Bajardi, Gianmarco De Francisci Morales

    Abstract: The interaction between social norms and gender roles prescribes gender-specific behaviors that influence moral judgments. Here, we study how moral judgments are biased by the gender of the protagonist of a story. Using data from r/AITA, a Reddit community with 17 million members who share first-hand experiences seeking community judgment on their behavior, we employ machine learning techniques to… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  16. arXiv:2407.21273  [pdf, other

    cs.CV cs.AI cs.LG

    Enhanced Uncertainty Estimation in Ultrasound Image Segmentation with MSU-Net

    Authors: Rohini Banerjee, Cecilia G. Morales, Artur Dubrawski

    Abstract: Efficient intravascular access in trauma and critical care significantly impacts patient outcomes. However, the availability of skilled medical personnel in austere environments is often limited. Autonomous robotic ultrasound systems can aid in needle insertion for medication delivery and support non-experts in such tasks. Despite advances in autonomous needle insertion, inaccuracies in vessel seg… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: Accepted for the 5th International Workshop of Advances in Simplifying Medical UltraSound (ASMUS), held in conjunction with MICCAI 2024, the 27th International Conference on Medical Image Computing and Computer Assisted Intervention

  17. arXiv:2407.12545  [pdf, other

    cs.CY cs.SI

    Conspiracy theories and where to find them on TikTok

    Authors: Francesco Corso, Francesco Pierri, Gianmarco De Francisci Morales

    Abstract: TikTok has skyrocketed in popularity over recent years, especially among younger audiences. However, there are public concerns about the potential of this platform to promote and amplify harmful content. This study presents the first systematic analysis of conspiracy theories on TikTok. By leveraging the official TikTok Research API we collect a longitudinal dataset of 1.5M videos shared in the U.… ▽ More

    Submitted 19 May, 2025; v1 submitted 17 July, 2024; originally announced July 2024.

  18. arXiv:2407.01481  [pdf, other

    cs.DC cs.PF

    LLload: Simplifying Real-Time Job Monitoring for HPC Users

    Authors: Chansup Byun, Julia Mullen, Albert Reuther, William Arcand, William Bergeron, David Bestor, Daniel Burrill, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Peter Michaleas, Guillermo Morales, Andrew Prout, Antonio Rosa, Charles Yee, Jeremy Kepner, Lauren Milechin

    Abstract: One of the more complex tasks for researchers using HPC systems is performance monitoring and tuning of their applications. Developing a practice of continuous performance improvement, both for speed-up and efficient use of resources is essential to the long term success of both the HPC practitioner and the research project. Profiling tools provide a nice view of the performance of an application… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  19. arXiv:2406.17834  [pdf, other

    cs.LG cs.AI

    Univariate Skeleton Prediction in Multivariate Systems Using Transformers

    Authors: Giorgio Morales, John W. Sheppard

    Abstract: Symbolic regression (SR) methods attempt to learn mathematical expressions that approximate the behavior of an observed system. However, when dealing with multivariate systems, they often fail to identify the functional form that explains the relationship between each variable and the system's response. To begin to address this, we propose an explainable neural SR method that generates univariate… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Paper accepted at European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) 2024

  20. arXiv:2403.10730  [pdf, other

    cs.LG

    Counterfactual Analysis of Neural Networks Used to Create Fertilizer Management Zones

    Authors: Giorgio Morales, John Sheppard

    Abstract: In Precision Agriculture, the utilization of management zones (MZs) that take into account within-field variability facilitates effective fertilizer management. This approach enables the optimization of nitrogen (N) rates to maximize crop yield production and enhance agronomic use efficiency. However, existing works often neglect the consideration of responsivity to fertilizer as a factor influenc… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted to appear in the International Joint Conference on Neural Networks 2024

  21. arXiv:2403.05358  [pdf, other

    cs.CY cs.LG cs.SI stat.ML

    Variational Inference of Parameters in Opinion Dynamics Models

    Authors: Jacopo Lenti, Fabrizio Silvestri, Gianmarco De Francisci Morales

    Abstract: Despite the frequent use of agent-based models (ABMs) for studying social phenomena, parameter estimation remains a challenge, often relying on costly simulation-based heuristics. This work uses variational inference to estimate the parameters of an opinion dynamics ABM, by transforming the estimation problem into an optimization task that can be solved directly. Our proposal relies on probabili… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  22. arXiv:2402.18470  [pdf, other

    cs.SI physics.data-an

    Higher-order null models as a lens for social systems

    Authors: Giulia Preti, Adriano Fazzone, Giovanni Petri, Gianmarco De Francisci Morales

    Abstract: Despite the widespread adoption of higher-order mathematical structures such as hypergraphs, methodological tools for their analysis lag behind those for traditional graphs. This work addresses a critical gap in this context by proposing two micro-canonical random null models for directed hypergraphs: the Directed Hypergraph Configuration Model (DHCM) and the Directed Hypergraph JOINT Model (DHJM)… ▽ More

    Submitted 17 September, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: Accepted for publication in PRX

  23. arXiv:2402.13855  [pdf, other

    cs.CY cs.SI

    What we can learn from TikTok through its Research API

    Authors: Francesco Corso, Francesco Pierri, Gianmarco De Francisci Morales

    Abstract: TikTok is a social media platform that has gained immense popularity over the last few years, particularly among younger demographics, due to the viral trends and challenges shared worldwide. The recent release of a free Research API opens the door to collecting data on posted videos, associated comments, and user activities. Our study focuses on evaluating the reliability of the results returned… ▽ More

    Submitted 4 April, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 11 pages, 8 Figures, submitted to DHOW at WebSci'24

  24. arXiv:2401.13656  [pdf, other

    cs.SI cs.CY physics.soc-ph stat.AP

    Navigating Multidimensional Ideologies with Reddit's Political Compass: Economic Conflict and Social Affinity

    Authors: Ernesto Colacrai, Federico Cinus, Gianmarco De Francisci Morales, Michele Starnini

    Abstract: The prevalent perspective in quantitative research on opinion dynamics flattens the landscape of the online political discourse into a traditional left--right dichotomy. While this approach helps simplify the analysis and modeling effort, it also neglects the intrinsic multidimensional richness of ideologies. In this study, we analyze social interactions on Reddit, under the lens of a multi-dimens… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  25. arXiv:2311.00118  [pdf, other

    cs.LG q-bio.NC stat.AP stat.ME stat.ML

    Extracting the Multiscale Causal Backbone of Brain Dynamics

    Authors: Gabriele D'Acunto, Francesco Bonchi, Gianmarco De Francisci Morales, Giovanni Petri

    Abstract: The bulk of the research effort on brain connectivity revolves around statistical associations among brain regions, which do not directly relate to the causal mechanisms governing brain dynamics. Here we propose the multiscale causal backbone (MCB) of brain dynamics, shared by a set of individuals across multiple temporal scales, and devise a principled methodology to extract it. Our approach le… ▽ More

    Submitted 19 March, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: Accepted at the 3rd conference on Causal Learning and Reasoning (CLeaR 2024)

  26. arXiv:2310.19951  [pdf, other

    cs.CY cs.SI physics.soc-ph

    Measuring Behavior Change with Observational Studies: a Review

    Authors: Arianna Pera, Gianmarco de Francisci Morales, Luca Maria Aiello

    Abstract: Exploring behavioral change in the digital age is imperative for societal progress in the context of 21st-century challenges. We analyzed 148 articles (2000-2023) and built a map that categorizes behaviors and change detection methodologies, platforms of reference, and theoretical frameworks that characterize online behavior change. Our findings uncover a focus on sentiment shifts, an emphasis on… ▽ More

    Submitted 2 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

  27. Systematic discrepancies in the delivery of political ads on Facebook and Instagram

    Authors: Dominik Bär, Francesco Pierri, Gianmarco De Francisci Morales, Stefan Feuerriegel

    Abstract: Political advertising on social media has become a central element in election campaigns. However, granular information about political advertising on social media was previously unavailable, thus raising concerns regarding fairness, accountability, and transparency in the electoral process. In this paper, we analyze targeted political advertising on social media via a unique, large-scale dataset… ▽ More

    Submitted 24 June, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: Accepted for publication at PNAS NEXUS. The first two authors contributed equally to this research

    Journal ref: Dominik Bär, Francesco Pierri, Gianmarco De Francisci Morales, Stefan Feuerriegel, Systematic discrepancies in the delivery of political ads on Facebook and Instagram, PNAS Nexus, 2024;, pgae247

  28. arXiv:2310.02766  [pdf, other

    cs.SI cs.CY

    Likelihood-Based Methods Improve Parameter Estimation in Opinion Dynamics Models

    Authors: Jacopo Lenti, Corrado Monti, Gianmarco De Francisci Morales

    Abstract: We show that a maximum likelihood approach for parameter estimation in agent-based models (ABMs) of opinion dynamics outperforms the typical simulation-based approach. Simulation-based approaches simulate the model repeatedly in search of a set of parameters that generates data similar enough to the observed one. In contrast, likelihood-based approaches derive a likelihood function that connects t… ▽ More

    Submitted 5 October, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

  29. arXiv:2310.00522  [pdf, other

    cs.SI

    Mapping of Internet "Coastlines" via Large Scale Anonymized Network Source Correlations

    Authors: Hayden Jananthan, Jeremy Kepner, Michael Jones, William Arcand, David Bestor, William Bergeron, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg , et al. (3 additional authors not shown)

    Abstract: Expanding the scientific tools available to protect computer networks can be aided by a deeper understanding of the underlying statistical distributions of network traffic and their potential geometric interpretations. Analyses of large scale network observations provide a unique window into studying those underlying statistics. Newly developed GraphBLAS hypersparse matrices and D4M associative ar… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: 9 pages, 7 figures, IEEE HPEC 2023 (accepted)

  30. arXiv:2309.08363  [pdf, other

    cs.CY cs.HC cs.SI

    Narratives of War: Ukrainian Memetic Warfare on Twitter

    Authors: Yelena Mejova, Arthur Capozzi, Corrado Monti, Gianmarco De Francisci Morales

    Abstract: The 2022 Russian invasion of Ukraine has seen an intensification in the use of social media by governmental actors in cyber warfare. Wartime communication via memes has been a successful strategy used not only by independent accounts such as @uamemesforces, but also-for the first time in a full-scale interstate war-by official Ukrainian government accounts such as @Ukraine and @DefenceU. We study… ▽ More

    Submitted 20 January, 2025; v1 submitted 15 September, 2023; originally announced September 2023.

    ACM Class: J.4; K.4

    Journal ref: ACM SIGCHI Conference on Computer-Supported Cooperative Work & Social Computing (CSCW) 2025

  31. pPython Performance Study

    Authors: Chansup Byun, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Anna Klein, Peter Michaleas, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Jeremy Kepner

    Abstract: pPython seeks to provide a parallel capability that provides good speed-up without sacrificing the ease of programming in Python by implementing partitioned global array semantics (PGAS) on top of a simple file-based messaging library (PythonMPI) in pure Python. pPython follows a SPMD (single program multiple data) model of computation. pPython runs on a single-node (e.g., a laptop) running Window… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2208.14908

  32. Deployment of Real-Time Network Traffic Analysis using GraphBLAS Hypersparse Matrices and D4M Associative Arrays

    Authors: Michael Jones, Jeremy Kepner, Andrew Prout, Timothy Davis, William Arcand, David Bestor, William Bergeron, Chansup Byun, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Hayden Jananthan, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Sandeep Pisharody, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Peter Michaleas

    Abstract: Matrix/array analysis of networks can provide significant insight into their behavior and aid in their operation and protection. Prior work has demonstrated the analytic, performance, and compression capabilities of GraphBLAS (graphblas.org) hypersparse matrices and D4M (d4m.mit.edu) associative arrays (a mathematical superset of matrices). Obtaining the benefits of these capabilities requires int… ▽ More

    Submitted 8 December, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE HPEC, 8 pages, 8 figures, 1 table, 69 references. arXiv admin note: text overlap with arXiv:2203.13934. text overlap with arXiv:2309.01806

  33. Focusing and Calibration of Large Scale Network Sensors using GraphBLAS Anonymized Hypersparse Matrices

    Authors: Jeremy Kepner, Michael Jones, Phil Dykstra, Chansup Byun, Timothy Davis, Hayden Jananthan, William Arcand, David Bestor, William Bergeron, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg, Charles Yee , et al. (1 additional authors not shown)

    Abstract: Defending community-owned cyber space requires community-based efforts. Large-scale network observations that uphold the highest regard for privacy are key to protecting our shared cyberspace. Deployment of the necessary network sensors requires careful sensor placement, focusing, and calibration with significant volumes of network observations. This paper demonstrates novel focusing and calibrati… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE HPEC, 9 pages, 12 figures, 1 table, 63 references, 2 appendices

  34. arXiv:2308.10838  [pdf, other

    cs.SI physics.soc-ph

    An impossibility result for Markov Chain Monte Carlo sampling from micro-canonical bipartite graph ensembles

    Authors: Giulia Preti, Gianmarco De Francisci Morales, Matteo Riondato

    Abstract: Markov Chain Monte Carlo (MCMC) algorithms are commonly used to sample from graph ensembles. Two graphs are neighbors in the state space if one can be obtained from the other with only a few modifications, e.g., edge rewirings. For many common ensembles, e.g., those preserving the degree sequences of bipartite graphs, rewiring operations involving two edges are sufficient to create a fully-connect… ▽ More

    Submitted 10 September, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in Physical Review E

  35. arXiv:2306.02696  [pdf, other

    cs.DS

    Hyper-distance Oracles in Hypergraphs

    Authors: Giulia Preti, Gianmarco De Francisci Morales, Francesco Bonchi

    Abstract: We study point-to-point distance estimation in hypergraphs, where the query is parameterized by a positive integer s, which defines the required level of overlap for two hyperedges to be considered adjacent. To answer s-distance queries, we first explore an oracle based on the line graph of the given hypergraph and discuss its limitations: the main one is that the line graph is typically orders of… ▽ More

    Submitted 19 March, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: To appear in VLDBJ

  36. arXiv:2304.04063  [pdf, other

    cs.LG

    Counterfactual Explanations of Neural Network-Generated Response Curves

    Authors: Giorgio Morales, John Sheppard

    Abstract: Response curves exhibit the magnitude of the response of a sensitive system to a varying stimulus. However, response of such systems may be sensitive to multiple stimuli (i.e., input features) that are not necessarily independent. As a consequence, the shape of response curves generated for a selected input feature (referred to as "active feature") might depend on the values of the other input fea… ▽ More

    Submitted 13 April, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

    Comments: Accepted to appear in the International Joint Conference on Neural Networks 2023

  37. arXiv:2303.12014  [pdf, other

    cs.SI cs.CY

    Authority without Care: Moral Values behind the Mask Mandate Response

    Authors: Yelena Mejova, Kyrieki Kalimeri, Gianmarco De Francisci Morales

    Abstract: Face masks are one of the cheapest and most effective non-pharmaceutical interventions available against airborne diseases such as COVID-19. Unfortunately, they have been met with resistance by a substantial fraction of the populace, especially in the U.S. In this study, we uncover the latent moral values that underpin the response to the mask mandate, and paint them against the country's politica… ▽ More

    Submitted 30 March, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

  38. arXiv:2302.07598  [pdf, other

    cs.CY cs.SI physics.soc-ph

    Evidence of Demographic rather than Ideological Segregation in News Discussion on Reddit

    Authors: Corrado Monti, Jacopo D'Ignazi, Michele Starnini, Gianmarco De Francisci Morales

    Abstract: We evaluate homophily and heterophily among ideological and demographic groups in a typical opinion formation context: online discussions of current news. We analyze user interactions across five years in the r/news community on Reddit, one of the most visited websites in the United States. Then, we estimate demographic and ideological attributes of these users. Thanks to a comparison with a caref… ▽ More

    Submitted 5 July, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: Published at WWW '23

    ACM Class: J.4; K.4

    Journal ref: Proceedings of the ACM Web Conference 2023 (WWW '23), May 1-5, 2023, Austin, TX, USA. ACM

  39. The Thin Ideology of Populist Advertising on Facebook during the 2019 EU Elections

    Authors: Arthur Capozzi, Gianmarco De Francisci Morales, Yelena Mejova, Corrado Monti, André Panisson

    Abstract: Social media has been an important tool in the expansion of the populist message, and it is thought to have contributed to the electoral success of populist parties in the past decade. This study compares how populist parties advertised on Facebook during the 2019 European Parliamentary election. In particular, we examine commonalities and differences in which audiences they reach and on which iss… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Journal ref: In Proceedings of the ACM Web Conference 2023 (WWW '23), May 1-5, 2023, Austin, TX, USA. ACM, New York, NY, USA, 11 pages

  40. Dual Accuracy-Quality-Driven Neural Network for Prediction Interval Generation

    Authors: Giorgio Morales, John W. Sheppard

    Abstract: Accurate uncertainty quantification is necessary to enhance the reliability of deep learning models in real-world applications. In the case of regression tasks, prediction intervals (PIs) should be provided along with the deterministic predictions of deep learning models. Such PIs are useful or "high-quality" as long as they are sufficiently narrow and capture most of the probability density. In t… ▽ More

    Submitted 21 March, 2024; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: Accepted at the IEEE Transactions on Neural Networks and Learning Systems

    Journal ref: G. Morales and J. W. Sheppard, "Dual Accuracy-Quality-Driven Neural Network for Prediction Interval Generation," in IEEE Transactions on Neural Networks and Learning Systems, 2023

  41. arXiv:2210.17234  [pdf, other

    cs.CY physics.soc-ph

    The language of opinion change on social media under the lens of communicative action

    Authors: Corrado Monti, Luca Maria Aiello, Gianmarco De Francisci Morales, Francesco Bonchi

    Abstract: Which messages are more effective at inducing a change of opinion in the listener? We approach this question within the frame of Habermas' theory of communicative action, which posits that the illocutionary intent of the message (its pragmatic meaning) is the key. Thanks to recent advances in natural language processing, we are able to operationalize this theory by extracting the latent social dim… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Main paper: 13 pages, 1 figure, 3 tables. Supplementary material: 9 pages, 6 figures, 8 tables

    ACM Class: H.4.0; K.4.0

    Journal ref: Nature Scientific Reports 12, 17920 (2022)

  42. Python Implementation of the Dynamic Distributed Dimensional Data Model

    Authors: Hayden Jananthan, Lauren Milechin, Michael Jones, William Arcand, William Bergeron, David Bestor, Chansup Byun, Michael Houle, Matthew Hubbell, Vijay Gadepally, Anna Klein, Peter Michaleas, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Jeremy Kepner

    Abstract: Python has become a standard scientific computing language with fast-growing support of machine learning and data analysis modules, as well as an increasing usage of big data. The Dynamic Distributed Dimensional Data Model (D4M) offers a highly composable, unified data model with strong performance built to handle big data fast and efficiently. In this work we present an implementation of D4M in P… ▽ More

    Submitted 22 November, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: 8 pages, 7 figures, accepted to HPEC 2022

  43. arXiv:2208.14989  [pdf, other

    cs.LG stat.ME stat.ML

    Learning Multiscale Non-stationary Causal Structures

    Authors: Gabriele D'Acunto, Gianmarco De Francisci Morales, Paolo Bajardi, Francesco Bonchi

    Abstract: This paper addresses a gap in the current state of the art by providing a solution for modeling causal relationships that evolve over time and occur at different time scales. Specifically, we introduce the multiscale non-stationary directed acyclic graph (MN-DAG), a framework for modeling multivariate time series data. Our contribution is twofold. Firstly, we expose a probabilistic generative mode… ▽ More

    Submitted 17 November, 2023; v1 submitted 31 August, 2022; originally announced August 2022.

    Journal ref: Transactions on Machine Learning Research, 2023, ISSN 2835-8856

  44. pPython for Parallel Python Programming

    Authors: Chansup Byun, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Kurt Keville, Anna Klein, Peter Michaleas, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Jeremy Kepner

    Abstract: pPython seeks to provide a parallel capability that provides good speed-up without sacrificing the ease of programming in Python by implementing partitioned global array semantics (PGAS) on top of a simple file-based messaging library (PythonMPI) in pure Python. The core data structure in pPython is a distributed numerical array whose distribution onto multiple processors is specified with a map c… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:astro-ph/0606464

  45. arXiv:2207.12196  [pdf, other

    cs.SI cs.CY

    On the Relation Between Opinion Change and Information Consumption on Reddit

    Authors: Flavio Petruzzellis, Corrado Monti, Gianmarco De Francisci Morales, Francesco Bonchi

    Abstract: While much attention has been devoted to the causes of opinion change, little is known about its consequences. Our study sheds a light on the relationship between one user's opinion change episode and subsequent behavioral change on an online social media, Reddit. In particular, we look at r/ChangeMyView, an online community dedicated to debating one's own opinions. Interestingly, this forum adopt… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: To appear in Proceedings of the International AAAI Conference on Web and Social Media (ICWSM 2023)

    ACM Class: J.4; K.4

  46. arXiv:2205.12992  [pdf, other

    cs.RO cs.AI

    Open Arms: Open-Source Arms, Hands & Control

    Authors: David Hanson, Alishba Imran, Gerardo Morales, Vytas Krisciunas, Aditya Sagi, Aman Malali, Rushali Mohbe, Raviteja Upadrashta

    Abstract: Open Arms is a novel open-source platform of realistic human-like robotic hands and arms hardware with 28 Degree-of-Freedom (DoF), designed to extend the capabilities and accessibility of humanoid robotic grasping and manipulation. The Open Arms framework includes an open SDK and development environment, simulation tools, and application development tools to build and operate Open Arms. This paper… ▽ More

    Submitted 15 July, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: Submitted to 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  47. arXiv:2205.05052  [pdf, other

    physics.soc-ph cs.LG econ.EM

    On learning agent-based models from data

    Authors: Corrado Monti, Marco Pangallo, Gianmarco De Francisci Morales, Francesco Bonchi

    Abstract: Agent-Based Models (ABMs) are used in several fields to study the evolution of complex systems from micro-level assumptions. However, ABMs typically can not estimate agent-specific (or "micro") variables: this is a major limitation which prevents ABMs from harnessing micro-level data availability and which greatly limits their predictive power. In this paper, we propose a protocol to learn the lat… ▽ More

    Submitted 23 November, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

  48. arXiv:2205.00308  [pdf, other

    cs.SI cs.CY

    Modeling Political Activism around Gun Debate via Social Media

    Authors: Yelena Mejova, Jisun An, Gianmarco De Francisci Morales, Haewoon Kwak

    Abstract: The United States have some of the highest rates of gun violence among developed countries. Yet, there is a disagreement about the extent to which firearms should be regulated. In this study, we employ social media signals to examine the predictors of offline political activism, at both population and individual level. We show that it is possible to classify the stance of users on the gun issue, e… ▽ More

    Submitted 30 April, 2022; originally announced May 2022.

    Journal ref: ACM Transactions on Social Computing. 2022

  49. FreSCo: Mining Frequent Patterns in Simplicial Complexes

    Authors: Giulia Preti, Gianmarco De Francisci Morales, Francesco Bonchi

    Abstract: Simplicial complexes are a generalization of graphs that model higher-order relations. In this paper, we introduce simplicial patterns -- that we call simplets -- and generalize the task of frequent pattern mining from the realm of graphs to that of simplicial complexes. Our task is particularly challenging due to the enormous search space and the need for higher-order isomorphism. We show that fi… ▽ More

    Submitted 26 January, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: To appear at The Web Conference 2022

  50. arXiv:2111.08069  [pdf, other

    cs.CV eess.IV

    Two-dimensional Deep Regression for Early Yield Prediction of Winter Wheat

    Authors: Giorgio Morales, John W. Sheppard

    Abstract: Crop yield prediction is one of the tasks of Precision Agriculture that can be automated based on multi-source periodic observations of the fields. We tackle the yield prediction problem using a Convolutional Neural Network (CNN) trained on data that combines radar satellite imagery and on-ground information. We present a CNN architecture called Hyper3DNetReg that takes in a multi-channel input im… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: Accepted to appear in the SPIE Future Sensing Technologies 2021