Search | arXiv e-print repository

Are the Hidden States Hiding Something? Testing the Limits of Factuality-Encoding Capabilities in LLMs

Authors: Giovanni Servedio, Alessandro De Bellis, Dario Di Palma, Vito Walter Anelli, Tommaso Di Noia

Abstract: Factual hallucinations are a major challenge for Large Language Models (LLMs). They undermine reliability and user trust by generating inaccurate or fabricated content. Recent studies suggest that when generating false statements, the internal states of LLMs encode information about truthfulness. However, these studies often rely on synthetic datasets that lack realism, which limits generalization… ▽ More Factual hallucinations are a major challenge for Large Language Models (LLMs). They undermine reliability and user trust by generating inaccurate or fabricated content. Recent studies suggest that when generating false statements, the internal states of LLMs encode information about truthfulness. However, these studies often rely on synthetic datasets that lack realism, which limits generalization when evaluating the factual accuracy of text generated by the model itself. In this paper, we challenge the findings of previous work by investigating truthfulness encoding capabilities, leading to the generation of a more realistic and challenging dataset. Specifically, we extend previous work by introducing: (1) a strategy for sampling plausible true-false factoid sentences from tabular data and (2) a procedure for generating realistic, LLM-dependent true-false datasets from Question Answering collections. Our analysis of two open-source LLMs reveals that while the findings from previous studies are partially validated, generalization to LLM-generated datasets remains challenging. This study lays the groundwork for future research on factuality in LLMs and offers practical guidelines for more effective evaluation. △ Less

Submitted 30 May, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

arXiv:2505.16491 [pdf, ps, other]

LLaMAs Have Feelings Too: Unveiling Sentiment and Emotion Representations in LLaMA Models Through Probing

Authors: Dario Di Palma, Alessandro De Bellis, Giovanni Servedio, Vito Walter Anelli, Fedelucio Narducci, Tommaso Di Noia

Abstract: Large Language Models (LLMs) have rapidly become central to NLP, demonstrating their ability to adapt to various tasks through prompting techniques, including sentiment analysis. However, we still have a limited understanding of how these models capture sentiment-related information. This study probes the hidden layers of Llama models to pinpoint where sentiment features are most represented and t… ▽ More Large Language Models (LLMs) have rapidly become central to NLP, demonstrating their ability to adapt to various tasks through prompting techniques, including sentiment analysis. However, we still have a limited understanding of how these models capture sentiment-related information. This study probes the hidden layers of Llama models to pinpoint where sentiment features are most represented and to assess how this affects sentiment analysis. Using probe classifiers, we analyze sentiment encoding across layers and scales, identifying the layers and pooling methods that best capture sentiment signals. Our results show that sentiment information is most concentrated in mid-layers for binary polarity tasks, with detection accuracy increasing up to 14% over prompting techniques. Additionally, we find that in decoder-only models, the last token is not consistently the most informative for sentiment encoding. Finally, this approach enables sentiment tasks to be performed with memory requirements reduced by an average of 57%. These insights contribute to a broader understanding of sentiment in LLMs, suggesting layer-specific probing as an effective approach for sentiment tasks beyond prompting, with potential to enhance model utility and reduce memory requirements. △ Less

Submitted 30 May, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

arXiv:2505.10212 [pdf, ps, other]

doi 10.1145/3726302.3730178

Do LLMs Memorize Recommendation Datasets? A Preliminary Study on MovieLens-1M

Authors: Dario Di Palma, Felice Antonio Merra, Maurizio Sfilio, Vito Walter Anelli, Fedelucio Narducci, Tommaso Di Noia

Abstract: Large Language Models (LLMs) have become increasingly central to recommendation scenarios due to their remarkable natural language understanding and generation capabilities. Although significant research has explored the use of LLMs for various recommendation tasks, little effort has been dedicated to verifying whether they have memorized public recommendation dataset as part of their training dat… ▽ More Large Language Models (LLMs) have become increasingly central to recommendation scenarios due to their remarkable natural language understanding and generation capabilities. Although significant research has explored the use of LLMs for various recommendation tasks, little effort has been dedicated to verifying whether they have memorized public recommendation dataset as part of their training data. This is undesirable because memorization reduces the generalizability of research findings, as benchmarking on memorized datasets does not guarantee generalization to unseen datasets. Furthermore, memorization can amplify biases, for example, some popular items may be recommended more frequently than others. In this work, we investigate whether LLMs have memorized public recommendation datasets. Specifically, we examine two model families (GPT and Llama) across multiple sizes, focusing on one of the most widely used dataset in recommender systems: MovieLens-1M. First, we define dataset memorization as the extent to which item attributes, user profiles, and user-item interactions can be retrieved by prompting the LLMs. Second, we analyze the impact of memorization on recommendation performance. Lastly, we examine whether memorization varies across model families and model sizes. Our results reveal that all models exhibit some degree of memorization of MovieLens-1M, and that recommendation performance is related to the extent of memorization. We have made all the code publicly available at: https://github.com/sisinflab/LLM-MemoryInspector △ Less

Submitted 15 May, 2025; originally announced May 2025.

arXiv:2505.03880 [pdf, ps, other]

Assessing the connection between galactic conformity and assembly-type bias

Authors: Ivan Lacerna, Nelson Padilla, Daniela Palma

Abstract: Context. Galaxies in the Universe show a conformity in the fraction of quenched galaxies out to large distances, being quite larger around quenched central galaxies than for star-forming ones. On the other hand, simulations have shown that the clustering of halos and the galaxies within them depends on secondary properties other than halo mass, a phenomenon termed assembly bias. Aims. Our aim is t… ▽ More Context. Galaxies in the Universe show a conformity in the fraction of quenched galaxies out to large distances, being quite larger around quenched central galaxies than for star-forming ones. On the other hand, simulations have shown that the clustering of halos and the galaxies within them depends on secondary properties other than halo mass, a phenomenon termed assembly bias. Aims. Our aim is to study whether samples that show galactic conformity also show assembly bias and to see if the amplitude of these two effects is correlated. Methods. We use synthetic galaxies at $z = 0$ from the semi-analytical model SAG run on the MultiDark Planck 2 (MDPL2) cosmological simulation and measure both conformity and galaxy assembly bias for different samples of central galaxies at fixed host halo mass. We focus on central galaxies hosted by low-mass halos of 10$^{11.6}$ $\leq$ $M_{\rm h}$/$h^{-1}$ M$_{\odot}$ $<$ 10$^{11.8}$ because it is a mass range where the assembly bias has been reported to be strong. The samples of central galaxies are separated according to their specific star formation rate and stellar age. Results. We find that the level of conformity shown by our different samples is correlated with the level of assembly bias measured for them. We also find that removing galaxies around massive halos diminishes the conformity signal and lowers the amount of assembly bias. Conclusions. The high correlation in the amplitude of conformity and assembly bias for different samples with and without removing galaxies near massive halos clearly indicates the strong relationship between both phenomena. △ Less

Submitted 6 May, 2025; originally announced May 2025.

Comments: 8 pages and 5 figures without appendix. Submitted to A&A

arXiv:2503.18473 [pdf, other]

The On-Board Computer of the AcubeSAT Mission

Authors: Konstantinos Tsoupos, Stylianos Tzelepis, Georgios Sklavenitis, Dimitrios Stoupis, Grigorios Pavlakis, Panagiotis Bountzioukas, Christina Athanasiadou, Lily Ha, David Palma, Loris Franchi, Alkis Hatzopoulos

Abstract: AcubeSAT is an open-source CubeSat mission aiming to explore the effects of microgravity and radiation on eukaryotic cells using a compact microfluidic lab-on-a-chip platform. It is developed by SpaceDot, a volunteer, interdisciplinary student team at the Aristotle University of Thessaloniki and supported by the "Fly Your Satellite! 3" program of the European Space Agency (ESA) Education Office.… ▽ More AcubeSAT is an open-source CubeSat mission aiming to explore the effects of microgravity and radiation on eukaryotic cells using a compact microfluidic lab-on-a-chip platform. It is developed by SpaceDot, a volunteer, interdisciplinary student team at the Aristotle University of Thessaloniki and supported by the "Fly Your Satellite! 3" program of the European Space Agency (ESA) Education Office. The nanosatellite features an in-house designed on-board computer subsystem responsible for telecommand execution, telemetry fetching, onboard time synchronization, in-orbit patching, and fault recovery. The subsystem is designed on one PC/104 standard compatible Printed Circuit Board (PCB) that hosts the On-board Computer (OBC) on the one side and the Attitude and Orbit Control Subsystem (AOCS) on the other, and it is compatible with the LibreCube standard. The hosted subsystems are functionally isolated and feature an ARM Cortex-M7, radiation-tolerant microcontroller each. Before sending anything to space thorough testing is required and specifically the on-board computer board underwent vibration and thermal cycling tests to ensure nominal operation in all conditions. This paper aims to elucidate the decision-making process, design iterations, and development stages of the custom board and accompanying in-house software. Insights garnered from the initial partially successful environmental test campaign at the ESA CubeSat Support Facility will be shared, along with the ensuing preparations, results, and lessons learned from subsequent testing endeavors in April 2024. Furthermore, the current developmental status will be discussed alongside future electromagnetic compatibility testing, integration plan on a FlatSat, and prospects for the open-source design as a cost-effective, and modular solution that can be tailored with little effort for upcoming missions. △ Less

Submitted 24 March, 2025; originally announced March 2025.

Comments: 52nd IAF Student Conference, Held at the 75th International Astronautical Congress (IAC 2024)

arXiv:2406.12977 [pdf, other]

doi 10.1051/0004-6361/202450976

The evolution of low-mass central galaxies in the vicinity of massive structures and its impact on the two-halo conformity

Authors: Daniela Palma, Ivan Lacerna, M. Celeste Artale, Antonio D. Montero-Dorta, Andrés N. Ruiz, Sofía A. Cora, Facundo Rodriguez, Diego Pallero, Ana O'Mill, Nelvy Choque-Challapa

Abstract: We investigated the population of low-mass central galaxies with Mstar = $10^{9.5}-10^{10}$ Msun/h, inhabiting regions near massive groups and clusters of galaxies using the TNG300 and MDPL2-SAG simulations. We set out to study their evolutionary histories, aiming to find hints about the large-scale conformity signal they produce. We also used a control sample of central galaxies with the same ste… ▽ More We investigated the population of low-mass central galaxies with Mstar = $10^{9.5}-10^{10}$ Msun/h, inhabiting regions near massive groups and clusters of galaxies using the TNG300 and MDPL2-SAG simulations. We set out to study their evolutionary histories, aiming to find hints about the large-scale conformity signal they produce. We also used a control sample of central galaxies with the same stellar mass range located far away from massive structures. For both samples, we find a subpopulation of galaxies accreted by another halo in the past, but now considered central galaxies; we refer to these objects as former satellites. The number of former satellites is higher for quenched central galaxies near massive systems, with fractions of 45% and 17% in TNG300 and MDPL2-SAG. Our results in TNG300 show that former satellites pollute the sample of central galaxies because they suffered environmental processes when they were satellites hosted typically by massive dark matter halos (M200 $\geq 10^{13}$ Msun/h) since z$\lesssim$0.5. After removing former satellites, the evolutionary trends for quenched central galaxies near massive structures are fairly similar to those of the quenched control galaxies, showing small differences at low redshift. For MDPL2-SAG instead, former satellites were hosted by less massive halos, with a mean halo mass around $10^{11.4}$ Msun/h, and the evolutionary trends remain equal before and after removing former satellite galaxies. We also measured the two-halo conformity, i.e, the correlation in the sSFR between low-mass central galaxies and their neighbors at Mpc scales, and how former satellites contribute to the signal at three different redshifts: z=0, 0.3, and 1. The time evolution of the conformity signal in the simulations presents apparent contradictory results: it decreases from z=0 to z=1 in MDPL2-SAG, while it increases in TNG300 (abridged). △ Less

Submitted 27 January, 2025; v1 submitted 18 June, 2024; originally announced June 2024.

Comments: 17 pages, 11 figures. Published in A&A

Journal ref: A&A, 693 (2025) A67

arXiv:2405.07850 [pdf, other]

Knowledge Graph Embedding in Intent-Based Networking

Authors: Kashif Mehmood, Katina Kralevska, David Palma

Abstract: This paper presents a novel approach to network management by integrating intent-based networking (IBN) with knowledge graphs (KGs), creating a more intuitive and efficient pipeline for service orchestration. By mapping high-level business intents onto network configurations using KGs, the system dynamically adapts to network changes and service demands, ensuring optimal performance and resource a… ▽ More This paper presents a novel approach to network management by integrating intent-based networking (IBN) with knowledge graphs (KGs), creating a more intuitive and efficient pipeline for service orchestration. By mapping high-level business intents onto network configurations using KGs, the system dynamically adapts to network changes and service demands, ensuring optimal performance and resource allocation. We utilize knowledge graph embedding (KGE) to acquire context information from the network and service providers. The KGE model is trained using a custom KG and Gaussian embedding model and maps intents to services via service prediction and intent validation processes. The proposed intent lifecycle enables intent translation and assurance by only deploying validated intents according to network and resource availability. We evaluate the trained model for its efficiency in service mapping and intent validation tasks using simulated environments and extensive experiments. The service prediction and intent verification accuracy greater than 80 percent is achieved for the trained KGE model on a custom service orchestration intent knowledge graph (IKG) based on TMForum's intent common model. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: Accepted at WIN 2024 (IEEE NetSoft24)

arXiv:2312.10231 [pdf, other]

Two simple criterion to prove the existence of patterns in reaction-diffusion models of two components

Authors: Francisco J. Vielma-Leal, Miguel A. D. R. Palma, Miguel Montenegro-Concha

Abstract: The aim of this work is to study the effect of diffusion on the stability of the equilibria in a general two-components reaction-diffusion system with Neumann boundary conditions in the space of continuous functions. As by product, we establish sufficient conditions on the diffusive coefficients and other parameters for such a reaction-diffusion model to exhibit patterns and we analyze their stabi… ▽ More The aim of this work is to study the effect of diffusion on the stability of the equilibria in a general two-components reaction-diffusion system with Neumann boundary conditions in the space of continuous functions. As by product, we establish sufficient conditions on the diffusive coefficients and other parameters for such a reaction-diffusion model to exhibit patterns and we analyze their stability. We apply the results obtained in this paper to explore under which parameters values a Turing bifurcation can occur, given rise to non uniform stationary solutions (patterns) for a reaction-diffusion predator-prey model with variable mortality and Hollyn's type II functional response. △ Less

Submitted 15 December, 2023; originally announced December 2023.

Comments: 31 pages, 12 figures

MSC Class: 35K57; 92D25

arXiv:2310.16134 [pdf]

The Evolution from Design to Verification of the Antenna System and Mechanisms in the AcubeSAT mission

Authors: Panagiotis Bountzioukas, Georgios Kikas, Christoforos Tsiolakis, Dimitrios Stoupis, Eleftheria Chatziargyriou, Alkis Hatzopoulos, Vasiliki Kourampa-Gottfroh, Ilektra Karakosta-Amarantidou, Aggelos Mavropoulos, Ioannis-Nikolaos Komis, Afroditi Kita, David Palma, Loris Franchi

Abstract: AcubeSAT is an open-source CubeSat mission aiming to explore the effects of microgravity and radiation on eukaryotic cells using a compact microfluidic LoC platform. It is developed by SpaceDot, a volunteer, interdisciplinary student team at the Aristotle University of Thessaloniki and supported by the "Fly Your Satellite! 3" program of the ESA Education Office. The scientific data of the mission… ▽ More AcubeSAT is an open-source CubeSat mission aiming to explore the effects of microgravity and radiation on eukaryotic cells using a compact microfluidic LoC platform. It is developed by SpaceDot, a volunteer, interdisciplinary student team at the Aristotle University of Thessaloniki and supported by the "Fly Your Satellite! 3" program of the ESA Education Office. The scientific data of the mission is comprised of microscope images captured through the on-board integrated camera setup. As the total size of the payload data is expected to be close to 2GB over 12 months, a fast and efficient downlink fulfilling the restrictive power, cost and complexity budgets is required. Currently, there is no open-source communications system design which fully supports these specific constraints, so we opted to develop our own solutions. The antenna system underwent multiple iterations as the design matured, a process highly aided by the feedback received from the ESA experts. The final communications system configuration consists of an S-band microstrip antenna operating at 2.4GHz and a UHF deployable antenna, for the payload data and TM&TC respectively, both in-house designed. In this paper, we will present AcubeSAT's antenna system iterations that span over 3 years, as well as the rationale and analysis results behind each. The development decisions will be highlighted throughout the paper in an effort to aid in the future development of such a low-cost CubeSat mission communications system. △ Less

Submitted 24 October, 2023; originally announced October 2023.

Comments: 74th International Astronautical Congress

arXiv:2309.04530 [pdf, other]

Cosmological simulations of a momentum coupling between dark matter and quintessence

Authors: Daniela Palma, Graeme N. Candlish

Abstract: Dark energy is frequently modelled as an additional dynamical scalar field component in the Universe, referred to as "quintessence", which drives the late-time acceleration. Furthermore, the quintessence field may be coupled to dark matter and/or baryons, leading to a fifth force. In this paper we explore the consequences for non-linear cosmological structure formation arising from a momentum coup… ▽ More Dark energy is frequently modelled as an additional dynamical scalar field component in the Universe, referred to as "quintessence", which drives the late-time acceleration. Furthermore, the quintessence field may be coupled to dark matter and/or baryons, leading to a fifth force. In this paper we explore the consequences for non-linear cosmological structure formation arising from a momentum coupling between the quintessence field and dark matter only. The coupling leads to a modified Euler equation, which we implement in an N-body cosmological simulation. We then analyse the effects of the coupling on the non-linear power spectrum and the properties of the dark matter halos. We find that, for certain quintessence potentials, a positive coupling can lead to significantly reduced structure on small scales and somewhat enhanced structure on large scales, as well as reduced halo density profiles and increased velocity dispersions. △ Less

Submitted 8 September, 2023; originally announced September 2023.

Comments: 19 pages, 20 figures, accepted by MNRAS

arXiv:2309.03613 [pdf, other]

Evaluating ChatGPT as a Recommender System: A Rigorous Approach

Authors: Dario Di Palma, Giovanni Maria Biancofiore, Vito Walter Anelli, Fedelucio Narducci, Tommaso Di Noia, Eugenio Di Sciascio

Abstract: Large Language Models (LLMs) have recently shown impressive abilities in handling various natural language-related tasks. Among different LLMs, current studies have assessed ChatGPT's superior performance across manifold tasks, especially under the zero/few-shot prompting conditions. Given such successes, the Recommender Systems (RSs) research community have started investigating its potential app… ▽ More Large Language Models (LLMs) have recently shown impressive abilities in handling various natural language-related tasks. Among different LLMs, current studies have assessed ChatGPT's superior performance across manifold tasks, especially under the zero/few-shot prompting conditions. Given such successes, the Recommender Systems (RSs) research community have started investigating its potential applications within the recommendation scenario. However, although various methods have been proposed to integrate ChatGPT's capabilities into RSs, current research struggles to comprehensively evaluate such models while considering the peculiarities of generative models. Often, evaluations do not consider hallucinations, duplications, and out-of-the-closed domain recommendations and solely focus on accuracy metrics, neglecting the impact on beyond-accuracy facets. To bridge this gap, we propose a robust evaluation pipeline to assess ChatGPT's ability as an RS and post-process ChatGPT recommendations to account for these aspects. Through this pipeline, we investigate ChatGPT-3.5 and ChatGPT-4 performance in the recommendation task under the zero-shot condition employing the role-playing prompt. We analyze the model's functionality in three settings: the Top-N Recommendation, the cold-start recommendation, and the re-ranking of a list of recommendations, and in three domains: movies, music, and books. The experiments reveal that ChatGPT exhibits higher accuracy than the baselines on books domain. It also excels in re-ranking and cold-start scenarios while maintaining reasonable beyond-accuracy metrics. Furthermore, we measure the similarity between the ChatGPT recommendations and the other recommenders, providing insights about how ChatGPT could be categorized in the realm of recommender systems. The evaluation pipeline is publicly released for future research. △ Less

Submitted 4 June, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

arXiv:2304.01692 [pdf]

doi 10.1016/j.fusengdes.2023.113590

Lessons learned after three years of SPIDER operation and the first MITICA integrated tests

Authors: D. Marcuzzi, V. Toigo, M. Boldrin, G. Chitarin, S. Dal Bello, L. Grando, A. Luchetta, R. Pasqualotto, M. Pavei, G. Serianni, L. Zanotto, R. Agnello, P. Agostinetti, M. Agostini, D. Aprile, M. Barbisan, M. Battistella, G. Berton, M. Bigi, M. Brombin, V. Candela, V. Candeloro, A. Canton, R. Casagrande, C. Cavallini , et al. (117 additional authors not shown)

Abstract: ITER envisages the use of two heating neutral beam injectors plus an optional one as part of the auxiliary heating and current drive system. The 16.5 MW expected neutral beam power per injector is several notches higher than worldwide existing facilities. A Neutral Beam Test Facility (NBTF) was established at Consorzio RFX, exploiting the synergy of two test beds, SPIDER and MITICA. SPIDER is dedi… ▽ More ITER envisages the use of two heating neutral beam injectors plus an optional one as part of the auxiliary heating and current drive system. The 16.5 MW expected neutral beam power per injector is several notches higher than worldwide existing facilities. A Neutral Beam Test Facility (NBTF) was established at Consorzio RFX, exploiting the synergy of two test beds, SPIDER and MITICA. SPIDER is dedicated to developing and characterizing large efficient negative ion sources at relevant parameters in ITER-like conditions: source and accelerator located in the same vacuum where the beam propagates, immunity to electromagnetic interferences of multiple radio-frequency (RF) antennas, avoidance of RF-induced discharges on the outside of the source. Three years of experiments on SPIDER have addressed to the necessary design modifications to enable full performances. The source is presently under a long shut-down phase to incorporate learnings from the experimental campaign. Parallelly, developments on MITICA, the full-scale prototype of the ITER NBI featuring a 1 MV accelerator and ion neutralization, are underway including manufacturing of in-vessel components, while power supplies and auxiliary plants are already under final testing and commissioning. Integration, commissioning and tests of the 1MV power supplies are essential for this first-of-kind system, unparalleled both in research and industry field. The integrated test to confirm 1MV output by combining invertor systems, DC generators and transmission lines extracted errors/accidents in some components. To realize a concrete system for ITER, solutions for the repair and the improvement of the system were developed. Hence, NBTF is emerging as a necessary facility, due to the large gap with existing injectors, effectively dedicated to identify issues and find solutions to enable successful ITER NBI operations in a time bound fashion. △ Less

Submitted 4 April, 2023; originally announced April 2023.

Journal ref: Fusion Engineering and Design 191 (2023) 113590

arXiv:2302.08544 [pdf, other]

Knowledge-based Intent Modeling for Next Generation Cellular Networks

Authors: Kashif Mehmood, Katina Kralevska, David Palma

Abstract: Intent-based networking (IBN) facilitates the representation of consumer expectations in a declarative and domain-independent form. However, mapping intents to service and resource models remains an open challenge. IBN requires handling existing system data in a structured yet flexible structure way. Knowledge graphs provide an efficient conceptual framework for constructing contexts and organizin… ▽ More Intent-based networking (IBN) facilitates the representation of consumer expectations in a declarative and domain-independent form. However, mapping intents to service and resource models remains an open challenge. IBN requires handling existing system data in a structured yet flexible structure way. Knowledge graphs provide an efficient conceptual framework for constructing contexts and organizing known information. We utilize knowledge graphs to construct a knowledge-based for modeling of intents in the networking domain. In addition, this work also proposes a knowledge-based intent modeling and processing methodology, extending the standardized intent common model proposed by TM Forum for next-generation cellular networks and services. The proposed knowledge-based IBN approach is demonstrated for next-generation cellular services, validating its potential. △ Less

Submitted 24 July, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

Comments: Accepted at MeditCom 2023

arXiv:2205.03932 [pdf, other]

Mission-Critical Public Safety Networking: An Intent-Driven Service Orchestration Perspective

Authors: Kashif Mehmood, David Palma, Katina Kralevska

Abstract: Intent-based networking (IBN) provides a promising approach for managing networks and orchestrating services in beyond 5G (B5G) deployments using modern service-based architectures. Public safety (PS) services form the basis of keeping society functional, owing to the responsiveness and availability throughout the network. The provisioning of these services requires efficient and agile network man… ▽ More Intent-based networking (IBN) provides a promising approach for managing networks and orchestrating services in beyond 5G (B5G) deployments using modern service-based architectures. Public safety (PS) services form the basis of keeping society functional, owing to the responsiveness and availability throughout the network. The provisioning of these services requires efficient and agile network management techniques with low-overhead and embedded intelligence. IBN incorporates the service subscribers in a model-driven approach to provision different user-centric services. However, it requires domain-specific and contextual processing of intents for abstracted management of network functions. This work proposes an intent definition for PS and mission critical (MC) services in beyond B5G networks, as well as a processing and orchestration architecture on top of MC push-to-talk (PTT) use case. The simulation results show that MC PTT services adhere to the key performance indicators of access time and mouth-to-ear latency bounded by approximately 250 and 150 milliseconds, respectively, with an additional overhead experienced during the intent processing in the range of 20- 40 milliseconds. This validates the premise of IBN in providing flexible and scalable management and service orchestration solution for PS next generation networks. △ Less

Submitted 8 May, 2022; originally announced May 2022.

Comments: Accepted for Publication at WIN2022 (under IEEE NetSoft 2022 Conference)

arXiv:2204.05226 [pdf, other]

doi 10.1126/science.abm3231

A Gamma-ray Pulsar Timing Array Constrains the Nanohertz Gravitational Wave Background

Authors: M. Ajello, W. B. Atwood, L. Baldini, J. Ballet, G. Barbiellini, D. Bastieri, R. Bellazzini, A. Berretta, B. Bhattacharyya, E. Bissaldi, R. D. Blandford, E. Bloom, R. Bonino, P. Bruel, R. Buehler, E. Burns, S. Buson, R. A. Cameron, P. A. Caraveo, E. Cavazzuti, N. Cibrario, S. Ciprini, C. J. Clark, I. Cognard, J. Coronado-Blázquez , et al. (107 additional authors not shown)

Abstract: After large galaxies merge, their central supermassive black holes are expected to form binary systems whose orbital motion generates a gravitational wave background (GWB) at nanohertz frequencies. Searches for this background utilize pulsar timing arrays, which perform long-term monitoring of millisecond pulsars (MSPs) at radio wavelengths. We use 12.5 years of Fermi Large Area Telescope data to… ▽ More After large galaxies merge, their central supermassive black holes are expected to form binary systems whose orbital motion generates a gravitational wave background (GWB) at nanohertz frequencies. Searches for this background utilize pulsar timing arrays, which perform long-term monitoring of millisecond pulsars (MSPs) at radio wavelengths. We use 12.5 years of Fermi Large Area Telescope data to form a gamma-ray pulsar timing array. Results from 35 bright gamma-ray pulsars place a 95\% credible limit on the GWB characteristic strain of $1.0\times10^{-14}$ at 1 yr$^{-1}$, which scales as the observing time span $t_{\mathrm{obs}}^{-13/6}$. This direct measurement provides an independent probe of the GWB while offering a check on radio noise models. △ Less

Submitted 11 April, 2022; originally announced April 2022.

Comments: 3 figures in the main text. 3 figures and 8 tables are in the supplementary material

arXiv:2111.11807 [pdf, other]

RepoMiner: a Language-agnostic Python Framework to Mine Software Repositories for Defect Prediction

Authors: Stefano Dalla Palma, Dario Di Nucci, Damian Tamburri

Abstract: Data originating from open-source software projects provide valuable information to enhance software quality. In the scope of Software Defect Prediction, one of the most challenging parts is extracting valid data about failure-prone software components from these repositories, which can help develop more robust software. In particular, collecting data, calculating metrics, and synthesizing results… ▽ More Data originating from open-source software projects provide valuable information to enhance software quality. In the scope of Software Defect Prediction, one of the most challenging parts is extracting valid data about failure-prone software components from these repositories, which can help develop more robust software. In particular, collecting data, calculating metrics, and synthesizing results from these repositories is a tedious and error-prone task, which often requires understanding the programming languages involved in the mined repositories, eventually leading to a proliferation of language-specific data-mining software. This paper presents RepoMiner, a language-agnostic tool developed to support software engineering researchers in creating datasets to support any study on defect prediction. RepoMiner automatically collects failure data from software components, labels them as failure-prone or neutral, and calculates metrics to be used as ground truth for defect prediction models. We present its implementation and provide examples of its application. △ Less

Submitted 23 November, 2021; originally announced November 2021.

arXiv:2108.04560 [pdf]

doi 10.1016/j.comnet.2022.109477

Intent-driven autonomous network and service management in future cellular networks: A structured literature review

Authors: Kashif Mehmood, Katina Kralevska, David Palma

Abstract: Intent-driven networks are an essential stepping stone in the evolution of network and service management towards a truly autonomous paradigm. User centric intents provide an abstracted means of impacting the design, provisioning, deployment and assurance of network infrastructure and services with the help of service level agreements and minimum network capability exposure. The concept of Intent… ▽ More Intent-driven networks are an essential stepping stone in the evolution of network and service management towards a truly autonomous paradigm. User centric intents provide an abstracted means of impacting the design, provisioning, deployment and assurance of network infrastructure and services with the help of service level agreements and minimum network capability exposure. The concept of Intent Based Networking (IBN) poses several challenges in terms of the contextual definition of intents, role of different stakeholders, and a generalized architecture. In this review, we provide a comprehensive analysis of the state-of-the-art in IBN including the intent description models, intent lifecycle management, significance of IBN and a generalized architectural framework along with challenges and prospects for IBN in future cellular networks. An analytical study is performed on the data collected from relevant studies primarily focusing on the inter-working of IBN with softwarized networking based on NFV/SDN infrastructures. Critical functions required in the IBN management and service model design are explored with different abstract modeling techniques and a converged architectural framework is proposed. The key findings include: (1) benefits and role of IBN in autonomous networking, (2) improvements needed to integrate intents as fundamental policies for service modeling and network management, (3) need for appropriate representation models for intents in domain agnostic abstract manner, and (4) need to include learning as a fundamental function in autonomous networks. These observations provide the basis for in-depth investigation and standardization efforts for IBN as a fundamental network management paradigm in beyond 5G cellular networks. △ Less

Submitted 9 May, 2023; v1 submitted 10 August, 2021; originally announced August 2021.

arXiv:2101.12644 [pdf, other]

5G Network Slicing for Wi-Fi Networks

Authors: Matteo Nerini, David Palma

Abstract: Future networks will pave the way for a myriad of applications with different requirements and Wi-Fi will play an important role in local area networks. This is why network slicing is proposed by 5G networks, allowing to offer multiple logical networks tailored to the different user requirements, over a common infrastructure. However, this is not supported by current Wi-Fi networks. In this paper,… ▽ More Future networks will pave the way for a myriad of applications with different requirements and Wi-Fi will play an important role in local area networks. This is why network slicing is proposed by 5G networks, allowing to offer multiple logical networks tailored to the different user requirements, over a common infrastructure. However, this is not supported by current Wi-Fi networks. In this paper, we propose a standard-compliant network slicing approach for the radio access segment of Wi-Fi by defining multiple Service Set Identifiers (SSIDs) per Access Point (AP). We present two algorithms, one that assigns resources according to the requirements of slices in a static way, and another that dynamically configures the slices according to the network's conditions and relevant Key Performance Indicators (KPIs). The proposed algorithms were validated through extensive simulations, conducted in the ns-3 network simulator, and complemented by theoretical assessments. The obtained results reveal that the two proposed slicing approaches outperform today's Wi-Fi access technique, reaching lower error probability for bandwidth intensive slices and lower latency for time-critical slices. Simultaneously, the proposed approach is up to 32 times more energy efficient, when considering slices tailored for low-power and low-bandwidth devices, while increasing the overall spectrum efficiency. △ Less

Submitted 29 January, 2021; originally announced January 2021.

Comments: 9 pages, 8 figures, to be published in the 17th IFIP/IEEE International Symposium on Integrated Network Management (IM 2021)

arXiv:2009.10801 [pdf, ps, other]

DeepIaC: Deep Learning-Based Linguistic Anti-pattern Detection in IaC

Authors: Nemania Borovits, Indika Kumara, Parvathy Krishnan, Stefano Dalla Palma, Dario Di Nucci, Fabio Palomba, Damian A. Tamburri, Willem-Jan van den Heuvel

Abstract: Linguistic anti-patterns are recurring poor practices concerning inconsistencies among the naming, documentation, and implementation of an entity. They impede readability, understandability, and maintainability of source code. This paper attempts to detect linguistic anti-patterns in infrastructure as code (IaC) scripts used to provision and manage computing environments. In particular, we conside… ▽ More Linguistic anti-patterns are recurring poor practices concerning inconsistencies among the naming, documentation, and implementation of an entity. They impede readability, understandability, and maintainability of source code. This paper attempts to detect linguistic anti-patterns in infrastructure as code (IaC) scripts used to provision and manage computing environments. In particular, we consider inconsistencies between the logic/body of IaC code units and their names. To this end, we propose a novel automated approach that employs word embeddings and deep learning techniques. We build and use the abstract syntax tree of IaC code units to create their code embedments. Our experiments with a dataset systematically extracted from open source repositories show that our approach yields an accuracy between0.785and0.915in detecting inconsistencies △ Less

Submitted 22 September, 2020; originally announced September 2020.

Comments: 6 pages

arXiv:2007.12283 [pdf, other]

Blockchain and Cryptocurrencies: a Classification and Comparison of Architecture Drivers

Authors: Martin Garriga, Stefano Dalla Palma, Maximiliano Arias, Alan De Renzis, Remo Pareschi, Damian Andrew Tamburri

Abstract: Blockchain is a decentralized transaction and data management solution, the technological leap behind the success of Bitcoin and other cryptocurrencies. As the variety of existing blockchains and distributed ledgers continues to increase, adopters should focus on selecting the solution that best fits their needs and the requirements of their decentralized applications, rather than developing yet a… ▽ More Blockchain is a decentralized transaction and data management solution, the technological leap behind the success of Bitcoin and other cryptocurrencies. As the variety of existing blockchains and distributed ledgers continues to increase, adopters should focus on selecting the solution that best fits their needs and the requirements of their decentralized applications, rather than developing yet another blockchain from scratch. In this paper we present a conceptual framework to aid software architects, developers, and decision makers to adopt the right blockchain technology. The framework exposes the interrelation between technological decisions and architectural features, capturing the knowledge from existing academic literature, industrial products, technical forums/blogs, and experts' feedback. We empirically show the applicability of our framework by dissecting the platforms behind Bitcoin and other top 10 cryptocurrencies, aided by a focus group with researchers and industry practitioners. Then, we leverage the framework together with key notions of the Architectural Tradeoff Analysis Method (ATAM) to analyze four real-world blockchain case studies from industry and academia. Results shown that applying our framework leads to a deeper understanding of the architectural tradeoffs, allowing to assess technologies more objectively and select the one that best fit developers needs, ultimately cutting costs, reducing time-to-market and accelerating return on investment. △ Less

Submitted 23 July, 2020; originally announced July 2020.

Comments: Accepted for publication at journal Concurrency and Computation: Practice and Experience. Special Issue on distributed large scale applications and environments

arXiv:2007.08980 [pdf, other]

doi 10.1038/s41598-020-79463-z

Lightning optimizes: a threshold mechanism ensures minimum-path flow

Authors: Franco Blanchini, Daniele Casagrande, Filippo Fabiani, Giulia Giordano, David Palma, Raffaele Pesenti

Abstract: A well-known property of linear resistive electrical networks is that the current distribution minimizes the total dissipated energy. When the circuit includes resistors with nonlinear monotonic characteristic, the current distribution minimizes in general a different functional. We show that, if the nonlinear characteristic is a threshold-like function and the energy generator is concentrated in… ▽ More A well-known property of linear resistive electrical networks is that the current distribution minimizes the total dissipated energy. When the circuit includes resistors with nonlinear monotonic characteristic, the current distribution minimizes in general a different functional. We show that, if the nonlinear characteristic is a threshold-like function and the energy generator is concentrated in a single point, as in the case of lightning or dielectric discharge, then the current flow is concentrated along a single path, which is a minimum path to the ground with respect to the threshold. We also propose a dynamic model that explains and qualitatively reproduces the lightning transient behaviour: initial generation of several plasma branches and subsequent dismissal of all branches but the one reaching the ground first, which is the optimal one. △ Less

Submitted 8 October, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

arXiv:2005.13474 [pdf, other]

Towards a Catalogue of Software Quality Metrics for Infrastructure Code

Authors: Stefano Dalla Palma, Dario Di Nucci, Fabio Palomba, Damian A. Tamburri

Abstract: Infrastructure-as-code (IaC) is a practice to implement continuous deployment by allowing management and provisioning of infrastructure through the definition of machine-readable files and automation around them, rather than physical hardware configuration or interactive configuration tools. On the one hand, although IaC represents an ever-increasing widely adopted practice nowadays, still little… ▽ More Infrastructure-as-code (IaC) is a practice to implement continuous deployment by allowing management and provisioning of infrastructure through the definition of machine-readable files and automation around them, rather than physical hardware configuration or interactive configuration tools. On the one hand, although IaC represents an ever-increasing widely adopted practice nowadays, still little is known concerning how to best maintain, speedily evolve, and continuously improve the code behind the IaC practice in a measurable fashion. On the other hand, source code measurements are often computed and analyzed to evaluate the different quality aspects of the software developed. However, unlike general-purpose programming languages (GPLs), IaC scripts use domain-specific languages, and metrics used for GPLs may not be applicable for IaC scripts. This article proposes a catalogue consisting of 46 metrics to identify IaC properties focusing on Ansible, one of the most popular IaC language to date, and shows how they can be used to analyze IaC scripts. △ Less

Submitted 7 July, 2020; v1 submitted 27 May, 2020; originally announced May 2020.

arXiv:1410.7764 [pdf, other]

doi 10.1088/0264-9381/32/11/115012

Characterization of the LIGO detectors during their sixth science run

Authors: The LIGO Scientific Collaboration, The Virgo Collaboration, J. Aasi, J. Abadie, B. P. Abbott, R. Abbott, T. Abbott, M. R. Abernathy, T. Accadia, F. Acernese, C. Adams, T. Adams, R. X. Adhikari, C. Affeldt, M. Agathos, N. Aggarwal, O. D. Aguiar, P. Ajith, B. Allen, A. Allocca, E. Amador. Ceron, D. Amariutei, R. A. Anderson, S. B. Anderson, W. G. Anderson , et al. (846 additional authors not shown)

Abstract: In 2009-2010, the Laser Interferometer Gravitational-wave Observa- tory (LIGO) operated together with international partners Virgo and GEO600 as a network to search for gravitational waves of astrophysical origin. The sensitiv- ity of these detectors was limited by a combination of noise sources inherent to the instrumental design and its environment, often localized in time or frequency, that cou… ▽ More In 2009-2010, the Laser Interferometer Gravitational-wave Observa- tory (LIGO) operated together with international partners Virgo and GEO600 as a network to search for gravitational waves of astrophysical origin. The sensitiv- ity of these detectors was limited by a combination of noise sources inherent to the instrumental design and its environment, often localized in time or frequency, that couple into the gravitational-wave readout. Here we review the performance of the LIGO instruments during this epoch, the work done to characterize the de- tectors and their data, and the effect that transient and continuous noise artefacts have on the sensitivity of LIGO to a variety of astrophysical sources. △ Less

Submitted 18 November, 2014; v1 submitted 28 October, 2014; originally announced October 2014.

Comments: 31 pages, 13 figures

arXiv:1410.6211 [pdf, ps, other]

doi 10.1103/PhysRevD.91.022003

Searching for stochastic gravitational waves using data from the two co-located LIGO Hanford detectors

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, J. Aasi, J. Abadie, B. P. Abbott, R. Abbott, T. Abbott, M. R. Abernathy, T. Accadia, F. Acernese, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, C. Affeldt, M. Agathos, N. Aggarwal, O. D. Aguiar, P. Ajith, B. Allen, A. Allocca, E. Amado. Ceron, D. Amariutei, R. A. Anderson, S. B. Anderson , et al. (852 additional authors not shown)

Abstract: Searches for a stochastic gravitational-wave background (SGWB) using terrestrial detectors typically involve cross-correlating data from pairs of detectors. The sensitivity of such cross-correlation analyses depends, among other things, on the separation between the two detectors: the smaller the separation, the better the sensitivity. Hence, a co-located detector pair is more sensitive to a gravi… ▽ More Searches for a stochastic gravitational-wave background (SGWB) using terrestrial detectors typically involve cross-correlating data from pairs of detectors. The sensitivity of such cross-correlation analyses depends, among other things, on the separation between the two detectors: the smaller the separation, the better the sensitivity. Hence, a co-located detector pair is more sensitive to a gravitational-wave background than a non-co-located detector pair. However, co-located detectors are also expected to suffer from correlated noise from instrumental and environmental effects that could contaminate the measurement of the background. Hence, methods to identify and mitigate the effects of correlated noise are necessary to achieve the potential increase in sensitivity of co-located detectors. Here we report on the first SGWB analysis using the two LIGO Hanford detectors and address the complications arising from correlated environmental noise. We apply correlated noise identification and mitigation techniques to data taken by the two LIGO Hanford detectors, H1 and H2, during LIGO's fifth science run. At low frequencies, 40 - 460 Hz, we are unable to sufficiently mitigate the correlated noise to a level where we may confidently measure or bound the stochastic gravitational-wave signal. However, at high frequencies, 460-1000 Hz, these techniques are sufficient to set a $95%$ confidence level (C.L.) upper limit on the gravitational-wave energy density of Ω(f)<7.7 x 10^{-4} (f/ 900 Hz)^3, which improves on the previous upper limit by a factor of $\sim 180$. In doing so, we demonstrate techniques that will be useful for future searches using advanced detectors, where correlated noise (e.g., from global magnetic fields) may affect even widely separated detectors. △ Less

Submitted 2 December, 2014; v1 submitted 22 October, 2014; originally announced October 2014.

Comments: 21 pages, 10 figures, 5 tables

Journal ref: Phys. Rev. D 91, 022003 (2015)

arXiv:1406.1449 [pdf, other]

doi 10.1371/journal.pcbi.1004152

Predicting epidemic risk from past temporal contact data

Authors: Eugenio Valdano, Chiara Poletto, Armando Giovannini, Diana Palma, Lara Savini, Vittoria Colizza

Abstract: Understanding how epidemics spread in a system is a crucial step to prevent and control outbreaks, with broad implications on the system's functioning, health, and associated costs. This can be achieved by identifying the elements at higher risk of infection and implementing targeted surveillance and control measures. One important ingredient to consider is the pattern of disease-transmission cont… ▽ More Understanding how epidemics spread in a system is a crucial step to prevent and control outbreaks, with broad implications on the system's functioning, health, and associated costs. This can be achieved by identifying the elements at higher risk of infection and implementing targeted surveillance and control measures. One important ingredient to consider is the pattern of disease-transmission contacts among the elements, however lack of data or delays in providing updated records may hinder its use, especially for time-varying patterns. Here we explore to what extent it is possible to use past temporal data of a system's pattern of contacts to predict the risk of infection of its elements during an emerging outbreak, in absence of updated data. We focus on two real-world temporal systems; a livestock displacements trade network among animal holdings, and a network of sexual encounters in high-end prostitution. We define the node's loyalty as a local measure of its tendency to maintain contacts with the same elements over time, and uncover important non-trivial correlations with the node's epidemic risk. We show that a risk assessment analysis incorporating this knowledge and based on past structural and temporal pattern properties provides accurate predictions for both systems. Its generalizability is tested by introducing a theoretical model for generating synthetic temporal networks. High accuracy of our predictions is recovered across different settings, while the amount of possible predictions is system-specific. The proposed method can provide crucial information for the setup of targeted intervention strategies. △ Less

Submitted 13 March, 2015; v1 submitted 5 June, 2014; originally announced June 2014.

Comments: 24 pages, 5 figures + SI (18 pages, 15 figures)

Journal ref: Valdano E, Poletto C, Giovannini A, Palma D, Savini L, et al. (2015) Predicting Epidemic Risk from Past Temporal Contact Data. PLoS Comput Biol 11(3): e1004152. doi:10.1371/journal.pcbi.1004152

arXiv:1310.6866 [pdf]

doi 10.1007/978-3-319-04639-6_20

Novel Scintillating Materials Based on Phenyl-Polysiloxane for Neutron Detection and Monitoring

Authors: M. Degerlier, S. Carturan, F. Gramegna, T. Marchi, M. Dalla Palma, M. Cinausero, G. Maggioni, A. Quaranta, G. Collazuol, J. Bermudez

Abstract: Neutron detectors are extensively used at many nuclear research facilities across Europe. Their application range covers many topics in basic and applied nuclear research: in nuclear structure and reaction dynamics (reaction reconstruction and decay studies); in nuclear astrophysics (neutron emission probabilities); in nuclear technology (nuclear data measurements and in-core/off-core monitors); i… ▽ More Neutron detectors are extensively used at many nuclear research facilities across Europe. Their application range covers many topics in basic and applied nuclear research: in nuclear structure and reaction dynamics (reaction reconstruction and decay studies); in nuclear astrophysics (neutron emission probabilities); in nuclear technology (nuclear data measurements and in-core/off-core monitors); in nuclear medicine (radiation monitors, dosimeters); in materials science (neutron imaging techniques); in homeland security applications (fissile materials investigation and cargo inspection). Liquid scintillators, widely used at present, have however some drawbacks given by toxicity, flammability, volatility and sensitivity to oxygen that limit their duration and quality. Even plastic scintillators are not satisfactory because they have low radiation hardness and low thermal stability. Moreover organic solvents may affect their optical properties due to crazing. In order to overcome these problems, phenyl-polysiloxane based scintillators have been recently developed at Legnaro National Laboratory. This new solution showed very good chemical and thermal stability and high radiation hardness. The results on the different samples performance will be presented, paying special attention to a characterization comparison between synthesized phenyl containing polysiloxane resins where a Pt catalyst has been used and a scintillating material obtained by condensation reaction, where tin based compounds are used as catalysts. Different structural arrangements as a result of different substituents on the main chain have been investigated by High Resolution X-Ray Diffraction, while the effect of improved optical transmittance on the scintillation yield has been elucidated by a combination of excitation/fluorescence measurements and scintillation yield under exposure to alpha and γ-rays. △ Less

Submitted 25 October, 2013; originally announced October 2013.

Comments: InterM 2013 - International Multidisciplinary Microscopy Congress

Showing 1–26 of 26 results for author: Palma, D