-
Are the Hidden States Hiding Something? Testing the Limits of Factuality-Encoding Capabilities in LLMs
Authors:
Giovanni Servedio,
Alessandro De Bellis,
Dario Di Palma,
Vito Walter Anelli,
Tommaso Di Noia
Abstract:
Factual hallucinations are a major challenge for Large Language Models (LLMs). They undermine reliability and user trust by generating inaccurate or fabricated content. Recent studies suggest that when generating false statements, the internal states of LLMs encode information about truthfulness. However, these studies often rely on synthetic datasets that lack realism, which limits generalization…
▽ More
Factual hallucinations are a major challenge for Large Language Models (LLMs). They undermine reliability and user trust by generating inaccurate or fabricated content. Recent studies suggest that when generating false statements, the internal states of LLMs encode information about truthfulness. However, these studies often rely on synthetic datasets that lack realism, which limits generalization when evaluating the factual accuracy of text generated by the model itself. In this paper, we challenge the findings of previous work by investigating truthfulness encoding capabilities, leading to the generation of a more realistic and challenging dataset. Specifically, we extend previous work by introducing: (1) a strategy for sampling plausible true-false factoid sentences from tabular data and (2) a procedure for generating realistic, LLM-dependent true-false datasets from Question Answering collections. Our analysis of two open-source LLMs reveals that while the findings from previous studies are partially validated, generalization to LLM-generated datasets remains challenging. This study lays the groundwork for future research on factuality in LLMs and offers practical guidelines for more effective evaluation.
△ Less
Submitted 30 May, 2025; v1 submitted 22 May, 2025;
originally announced May 2025.
-
LLaMAs Have Feelings Too: Unveiling Sentiment and Emotion Representations in LLaMA Models Through Probing
Authors:
Dario Di Palma,
Alessandro De Bellis,
Giovanni Servedio,
Vito Walter Anelli,
Fedelucio Narducci,
Tommaso Di Noia
Abstract:
Large Language Models (LLMs) have rapidly become central to NLP, demonstrating their ability to adapt to various tasks through prompting techniques, including sentiment analysis. However, we still have a limited understanding of how these models capture sentiment-related information. This study probes the hidden layers of Llama models to pinpoint where sentiment features are most represented and t…
▽ More
Large Language Models (LLMs) have rapidly become central to NLP, demonstrating their ability to adapt to various tasks through prompting techniques, including sentiment analysis. However, we still have a limited understanding of how these models capture sentiment-related information. This study probes the hidden layers of Llama models to pinpoint where sentiment features are most represented and to assess how this affects sentiment analysis.
Using probe classifiers, we analyze sentiment encoding across layers and scales, identifying the layers and pooling methods that best capture sentiment signals. Our results show that sentiment information is most concentrated in mid-layers for binary polarity tasks, with detection accuracy increasing up to 14% over prompting techniques. Additionally, we find that in decoder-only models, the last token is not consistently the most informative for sentiment encoding. Finally, this approach enables sentiment tasks to be performed with memory requirements reduced by an average of 57%.
These insights contribute to a broader understanding of sentiment in LLMs, suggesting layer-specific probing as an effective approach for sentiment tasks beyond prompting, with potential to enhance model utility and reduce memory requirements.
△ Less
Submitted 30 May, 2025; v1 submitted 22 May, 2025;
originally announced May 2025.
-
Do LLMs Memorize Recommendation Datasets? A Preliminary Study on MovieLens-1M
Authors:
Dario Di Palma,
Felice Antonio Merra,
Maurizio Sfilio,
Vito Walter Anelli,
Fedelucio Narducci,
Tommaso Di Noia
Abstract:
Large Language Models (LLMs) have become increasingly central to recommendation scenarios due to their remarkable natural language understanding and generation capabilities. Although significant research has explored the use of LLMs for various recommendation tasks, little effort has been dedicated to verifying whether they have memorized public recommendation dataset as part of their training dat…
▽ More
Large Language Models (LLMs) have become increasingly central to recommendation scenarios due to their remarkable natural language understanding and generation capabilities. Although significant research has explored the use of LLMs for various recommendation tasks, little effort has been dedicated to verifying whether they have memorized public recommendation dataset as part of their training data. This is undesirable because memorization reduces the generalizability of research findings, as benchmarking on memorized datasets does not guarantee generalization to unseen datasets. Furthermore, memorization can amplify biases, for example, some popular items may be recommended more frequently than others.
In this work, we investigate whether LLMs have memorized public recommendation datasets. Specifically, we examine two model families (GPT and Llama) across multiple sizes, focusing on one of the most widely used dataset in recommender systems: MovieLens-1M. First, we define dataset memorization as the extent to which item attributes, user profiles, and user-item interactions can be retrieved by prompting the LLMs. Second, we analyze the impact of memorization on recommendation performance. Lastly, we examine whether memorization varies across model families and model sizes. Our results reveal that all models exhibit some degree of memorization of MovieLens-1M, and that recommendation performance is related to the extent of memorization. We have made all the code publicly available at: https://github.com/sisinflab/LLM-MemoryInspector
△ Less
Submitted 15 May, 2025;
originally announced May 2025.
-
Assessing the connection between galactic conformity and assembly-type bias
Authors:
Ivan Lacerna,
Nelson Padilla,
Daniela Palma
Abstract:
Context. Galaxies in the Universe show a conformity in the fraction of quenched galaxies out to large distances, being quite larger around quenched central galaxies than for star-forming ones. On the other hand, simulations have shown that the clustering of halos and the galaxies within them depends on secondary properties other than halo mass, a phenomenon termed assembly bias. Aims. Our aim is t…
▽ More
Context. Galaxies in the Universe show a conformity in the fraction of quenched galaxies out to large distances, being quite larger around quenched central galaxies than for star-forming ones. On the other hand, simulations have shown that the clustering of halos and the galaxies within them depends on secondary properties other than halo mass, a phenomenon termed assembly bias. Aims. Our aim is to study whether samples that show galactic conformity also show assembly bias and to see if the amplitude of these two effects is correlated. Methods. We use synthetic galaxies at $z = 0$ from the semi-analytical model SAG run on the MultiDark Planck 2 (MDPL2) cosmological simulation and measure both conformity and galaxy assembly bias for different samples of central galaxies at fixed host halo mass. We focus on central galaxies hosted by low-mass halos of 10$^{11.6}$ $\leq$ $M_{\rm h}$/$h^{-1}$ M$_{\odot}$ $<$ 10$^{11.8}$ because it is a mass range where the assembly bias has been reported to be strong. The samples of central galaxies are separated according to their specific star formation rate and stellar age. Results. We find that the level of conformity shown by our different samples is correlated with the level of assembly bias measured for them. We also find that removing galaxies around massive halos diminishes the conformity signal and lowers the amount of assembly bias. Conclusions. The high correlation in the amplitude of conformity and assembly bias for different samples with and without removing galaxies near massive halos clearly indicates the strong relationship between both phenomena.
△ Less
Submitted 6 May, 2025;
originally announced May 2025.
-
The On-Board Computer of the AcubeSAT Mission
Authors:
Konstantinos Tsoupos,
Stylianos Tzelepis,
Georgios Sklavenitis,
Dimitrios Stoupis,
Grigorios Pavlakis,
Panagiotis Bountzioukas,
Christina Athanasiadou,
Lily Ha,
David Palma,
Loris Franchi,
Alkis Hatzopoulos
Abstract:
AcubeSAT is an open-source CubeSat mission aiming to explore the effects of microgravity and radiation on eukaryotic cells using a compact microfluidic lab-on-a-chip platform. It is developed by SpaceDot, a volunteer, interdisciplinary student team at the Aristotle University of Thessaloniki and supported by the "Fly Your Satellite! 3" program of the European Space Agency (ESA) Education Office.…
▽ More
AcubeSAT is an open-source CubeSat mission aiming to explore the effects of microgravity and radiation on eukaryotic cells using a compact microfluidic lab-on-a-chip platform. It is developed by SpaceDot, a volunteer, interdisciplinary student team at the Aristotle University of Thessaloniki and supported by the "Fly Your Satellite! 3" program of the European Space Agency (ESA) Education Office.
The nanosatellite features an in-house designed on-board computer subsystem responsible for telecommand execution, telemetry fetching, onboard time synchronization, in-orbit patching, and fault recovery. The subsystem is designed on one PC/104 standard compatible Printed Circuit Board (PCB) that hosts the On-board Computer (OBC) on the one side and the Attitude and Orbit Control Subsystem (AOCS) on the other, and it is compatible with the LibreCube standard. The hosted subsystems are functionally isolated and feature an ARM Cortex-M7, radiation-tolerant microcontroller each.
Before sending anything to space thorough testing is required and specifically the on-board computer board underwent vibration and thermal cycling tests to ensure nominal operation in all conditions.
This paper aims to elucidate the decision-making process, design iterations, and development stages of the custom board and accompanying in-house software. Insights garnered from the initial partially successful environmental test campaign at the ESA CubeSat Support Facility will be shared, along with the ensuing preparations, results, and lessons learned from subsequent testing endeavors in April 2024. Furthermore, the current developmental status will be discussed alongside future electromagnetic compatibility testing, integration plan on a FlatSat, and prospects for the open-source design as a cost-effective, and modular solution that can be tailored with little effort for upcoming missions.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
The evolution of low-mass central galaxies in the vicinity of massive structures and its impact on the two-halo conformity
Authors:
Daniela Palma,
Ivan Lacerna,
M. Celeste Artale,
Antonio D. Montero-Dorta,
Andrés N. Ruiz,
Sofía A. Cora,
Facundo Rodriguez,
Diego Pallero,
Ana O'Mill,
Nelvy Choque-Challapa
Abstract:
We investigated the population of low-mass central galaxies with Mstar = $10^{9.5}-10^{10}$ Msun/h, inhabiting regions near massive groups and clusters of galaxies using the TNG300 and MDPL2-SAG simulations. We set out to study their evolutionary histories, aiming to find hints about the large-scale conformity signal they produce. We also used a control sample of central galaxies with the same ste…
▽ More
We investigated the population of low-mass central galaxies with Mstar = $10^{9.5}-10^{10}$ Msun/h, inhabiting regions near massive groups and clusters of galaxies using the TNG300 and MDPL2-SAG simulations. We set out to study their evolutionary histories, aiming to find hints about the large-scale conformity signal they produce. We also used a control sample of central galaxies with the same stellar mass range located far away from massive structures. For both samples, we find a subpopulation of galaxies accreted by another halo in the past, but now considered central galaxies; we refer to these objects as former satellites. The number of former satellites is higher for quenched central galaxies near massive systems, with fractions of 45% and 17% in TNG300 and MDPL2-SAG. Our results in TNG300 show that former satellites pollute the sample of central galaxies because they suffered environmental processes when they were satellites hosted typically by massive dark matter halos (M200 $\geq 10^{13}$ Msun/h) since z$\lesssim$0.5. After removing former satellites, the evolutionary trends for quenched central galaxies near massive structures are fairly similar to those of the quenched control galaxies, showing small differences at low redshift. For MDPL2-SAG instead, former satellites were hosted by less massive halos, with a mean halo mass around $10^{11.4}$ Msun/h, and the evolutionary trends remain equal before and after removing former satellite galaxies. We also measured the two-halo conformity, i.e, the correlation in the sSFR between low-mass central galaxies and their neighbors at Mpc scales, and how former satellites contribute to the signal at three different redshifts: z=0, 0.3, and 1. The time evolution of the conformity signal in the simulations presents apparent contradictory results: it decreases from z=0 to z=1 in MDPL2-SAG, while it increases in TNG300 (abridged).
△ Less
Submitted 27 January, 2025; v1 submitted 18 June, 2024;
originally announced June 2024.
-
Knowledge Graph Embedding in Intent-Based Networking
Authors:
Kashif Mehmood,
Katina Kralevska,
David Palma
Abstract:
This paper presents a novel approach to network management by integrating intent-based networking (IBN) with knowledge graphs (KGs), creating a more intuitive and efficient pipeline for service orchestration. By mapping high-level business intents onto network configurations using KGs, the system dynamically adapts to network changes and service demands, ensuring optimal performance and resource a…
▽ More
This paper presents a novel approach to network management by integrating intent-based networking (IBN) with knowledge graphs (KGs), creating a more intuitive and efficient pipeline for service orchestration. By mapping high-level business intents onto network configurations using KGs, the system dynamically adapts to network changes and service demands, ensuring optimal performance and resource allocation. We utilize knowledge graph embedding (KGE) to acquire context information from the network and service providers. The KGE model is trained using a custom KG and Gaussian embedding model and maps intents to services via service prediction and intent validation processes. The proposed intent lifecycle enables intent translation and assurance by only deploying validated intents according to network and resource availability. We evaluate the trained model for its efficiency in service mapping and intent validation tasks using simulated environments and extensive experiments. The service prediction and intent verification accuracy greater than 80 percent is achieved for the trained KGE model on a custom service orchestration intent knowledge graph (IKG) based on TMForum's intent common model.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Two simple criterion to prove the existence of patterns in reaction-diffusion models of two components
Authors:
Francisco J. Vielma-Leal,
Miguel A. D. R. Palma,
Miguel Montenegro-Concha
Abstract:
The aim of this work is to study the effect of diffusion on the stability of the equilibria in a general two-components reaction-diffusion system with Neumann boundary conditions in the space of continuous functions. As by product, we establish sufficient conditions on the diffusive coefficients and other parameters for such a reaction-diffusion model to exhibit patterns and we analyze their stabi…
▽ More
The aim of this work is to study the effect of diffusion on the stability of the equilibria in a general two-components reaction-diffusion system with Neumann boundary conditions in the space of continuous functions. As by product, we establish sufficient conditions on the diffusive coefficients and other parameters for such a reaction-diffusion model to exhibit patterns and we analyze their stability. We apply the results obtained in this paper to explore under which parameters values a Turing bifurcation can occur, given rise to non uniform stationary solutions (patterns) for a reaction-diffusion predator-prey model with variable mortality and Hollyn's type II functional response.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
The Evolution from Design to Verification of the Antenna System and Mechanisms in the AcubeSAT mission
Authors:
Panagiotis Bountzioukas,
Georgios Kikas,
Christoforos Tsiolakis,
Dimitrios Stoupis,
Eleftheria Chatziargyriou,
Alkis Hatzopoulos,
Vasiliki Kourampa-Gottfroh,
Ilektra Karakosta-Amarantidou,
Aggelos Mavropoulos,
Ioannis-Nikolaos Komis,
Afroditi Kita,
David Palma,
Loris Franchi
Abstract:
AcubeSAT is an open-source CubeSat mission aiming to explore the effects of microgravity and radiation on eukaryotic cells using a compact microfluidic LoC platform. It is developed by SpaceDot, a volunteer, interdisciplinary student team at the Aristotle University of Thessaloniki and supported by the "Fly Your Satellite! 3" program of the ESA Education Office. The scientific data of the mission…
▽ More
AcubeSAT is an open-source CubeSat mission aiming to explore the effects of microgravity and radiation on eukaryotic cells using a compact microfluidic LoC platform. It is developed by SpaceDot, a volunteer, interdisciplinary student team at the Aristotle University of Thessaloniki and supported by the "Fly Your Satellite! 3" program of the ESA Education Office. The scientific data of the mission is comprised of microscope images captured through the on-board integrated camera setup. As the total size of the payload data is expected to be close to 2GB over 12 months, a fast and efficient downlink fulfilling the restrictive power, cost and complexity budgets is required. Currently, there is no open-source communications system design which fully supports these specific constraints, so we opted to develop our own solutions. The antenna system underwent multiple iterations as the design matured, a process highly aided by the feedback received from the ESA experts. The final communications system configuration consists of an S-band microstrip antenna operating at 2.4GHz and a UHF deployable antenna, for the payload data and TM&TC respectively, both in-house designed. In this paper, we will present AcubeSAT's antenna system iterations that span over 3 years, as well as the rationale and analysis results behind each. The development decisions will be highlighted throughout the paper in an effort to aid in the future development of such a low-cost CubeSat mission communications system.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Cosmological simulations of a momentum coupling between dark matter and quintessence
Authors:
Daniela Palma,
Graeme N. Candlish
Abstract:
Dark energy is frequently modelled as an additional dynamical scalar field component in the Universe, referred to as "quintessence", which drives the late-time acceleration. Furthermore, the quintessence field may be coupled to dark matter and/or baryons, leading to a fifth force. In this paper we explore the consequences for non-linear cosmological structure formation arising from a momentum coup…
▽ More
Dark energy is frequently modelled as an additional dynamical scalar field component in the Universe, referred to as "quintessence", which drives the late-time acceleration. Furthermore, the quintessence field may be coupled to dark matter and/or baryons, leading to a fifth force. In this paper we explore the consequences for non-linear cosmological structure formation arising from a momentum coupling between the quintessence field and dark matter only. The coupling leads to a modified Euler equation, which we implement in an N-body cosmological simulation. We then analyse the effects of the coupling on the non-linear power spectrum and the properties of the dark matter halos. We find that, for certain quintessence potentials, a positive coupling can lead to significantly reduced structure on small scales and somewhat enhanced structure on large scales, as well as reduced halo density profiles and increased velocity dispersions.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Evaluating ChatGPT as a Recommender System: A Rigorous Approach
Authors:
Dario Di Palma,
Giovanni Maria Biancofiore,
Vito Walter Anelli,
Fedelucio Narducci,
Tommaso Di Noia,
Eugenio Di Sciascio
Abstract:
Large Language Models (LLMs) have recently shown impressive abilities in handling various natural language-related tasks. Among different LLMs, current studies have assessed ChatGPT's superior performance across manifold tasks, especially under the zero/few-shot prompting conditions. Given such successes, the Recommender Systems (RSs) research community have started investigating its potential app…
▽ More
Large Language Models (LLMs) have recently shown impressive abilities in handling various natural language-related tasks. Among different LLMs, current studies have assessed ChatGPT's superior performance across manifold tasks, especially under the zero/few-shot prompting conditions. Given such successes, the Recommender Systems (RSs) research community have started investigating its potential applications within the recommendation scenario. However, although various methods have been proposed to integrate ChatGPT's capabilities into RSs, current research struggles to comprehensively evaluate such models while considering the peculiarities of generative models. Often, evaluations do not consider hallucinations, duplications, and out-of-the-closed domain recommendations and solely focus on accuracy metrics, neglecting the impact on beyond-accuracy facets. To bridge this gap, we propose a robust evaluation pipeline to assess ChatGPT's ability as an RS and post-process ChatGPT recommendations to account for these aspects. Through this pipeline, we investigate ChatGPT-3.5 and ChatGPT-4 performance in the recommendation task under the zero-shot condition employing the role-playing prompt. We analyze the model's functionality in three settings: the Top-N Recommendation, the cold-start recommendation, and the re-ranking of a list of recommendations, and in three domains: movies, music, and books. The experiments reveal that ChatGPT exhibits higher accuracy than the baselines on books domain. It also excels in re-ranking and cold-start scenarios while maintaining reasonable beyond-accuracy metrics. Furthermore, we measure the similarity between the ChatGPT recommendations and the other recommenders, providing insights about how ChatGPT could be categorized in the realm of recommender systems. The evaluation pipeline is publicly released for future research.
△ Less
Submitted 4 June, 2024; v1 submitted 7 September, 2023;
originally announced September 2023.
-
Lessons learned after three years of SPIDER operation and the first MITICA integrated tests
Authors:
D. Marcuzzi,
V. Toigo,
M. Boldrin,
G. Chitarin,
S. Dal Bello,
L. Grando,
A. Luchetta,
R. Pasqualotto,
M. Pavei,
G. Serianni,
L. Zanotto,
R. Agnello,
P. Agostinetti,
M. Agostini,
D. Aprile,
M. Barbisan,
M. Battistella,
G. Berton,
M. Bigi,
M. Brombin,
V. Candela,
V. Candeloro,
A. Canton,
R. Casagrande,
C. Cavallini
, et al. (117 additional authors not shown)
Abstract:
ITER envisages the use of two heating neutral beam injectors plus an optional one as part of the auxiliary heating and current drive system. The 16.5 MW expected neutral beam power per injector is several notches higher than worldwide existing facilities. A Neutral Beam Test Facility (NBTF) was established at Consorzio RFX, exploiting the synergy of two test beds, SPIDER and MITICA. SPIDER is dedi…
▽ More
ITER envisages the use of two heating neutral beam injectors plus an optional one as part of the auxiliary heating and current drive system. The 16.5 MW expected neutral beam power per injector is several notches higher than worldwide existing facilities. A Neutral Beam Test Facility (NBTF) was established at Consorzio RFX, exploiting the synergy of two test beds, SPIDER and MITICA. SPIDER is dedicated to developing and characterizing large efficient negative ion sources at relevant parameters in ITER-like conditions: source and accelerator located in the same vacuum where the beam propagates, immunity to electromagnetic interferences of multiple radio-frequency (RF) antennas, avoidance of RF-induced discharges on the outside of the source. Three years of experiments on SPIDER have addressed to the necessary design modifications to enable full performances. The source is presently under a long shut-down phase to incorporate learnings from the experimental campaign. Parallelly, developments on MITICA, the full-scale prototype of the ITER NBI featuring a 1 MV accelerator and ion neutralization, are underway including manufacturing of in-vessel components, while power supplies and auxiliary plants are already under final testing and commissioning. Integration, commissioning and tests of the 1MV power supplies are essential for this first-of-kind system, unparalleled both in research and industry field. The integrated test to confirm 1MV output by combining invertor systems, DC generators and transmission lines extracted errors/accidents in some components. To realize a concrete system for ITER, solutions for the repair and the improvement of the system were developed. Hence, NBTF is emerging as a necessary facility, due to the large gap with existing injectors, effectively dedicated to identify issues and find solutions to enable successful ITER NBI operations in a time bound fashion.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
Knowledge-based Intent Modeling for Next Generation Cellular Networks
Authors:
Kashif Mehmood,
Katina Kralevska,
David Palma
Abstract:
Intent-based networking (IBN) facilitates the representation of consumer expectations in a declarative and domain-independent form. However, mapping intents to service and resource models remains an open challenge. IBN requires handling existing system data in a structured yet flexible structure way. Knowledge graphs provide an efficient conceptual framework for constructing contexts and organizin…
▽ More
Intent-based networking (IBN) facilitates the representation of consumer expectations in a declarative and domain-independent form. However, mapping intents to service and resource models remains an open challenge. IBN requires handling existing system data in a structured yet flexible structure way. Knowledge graphs provide an efficient conceptual framework for constructing contexts and organizing known information. We utilize knowledge graphs to construct a knowledge-based for modeling of intents in the networking domain. In addition, this work also proposes a knowledge-based intent modeling and processing methodology, extending the standardized intent common model proposed by TM Forum for next-generation cellular networks and services. The proposed knowledge-based IBN approach is demonstrated for next-generation cellular services, validating its potential.
△ Less
Submitted 24 July, 2023; v1 submitted 16 February, 2023;
originally announced February 2023.
-
Mission-Critical Public Safety Networking: An Intent-Driven Service Orchestration Perspective
Authors:
Kashif Mehmood,
David Palma,
Katina Kralevska
Abstract:
Intent-based networking (IBN) provides a promising approach for managing networks and orchestrating services in beyond 5G (B5G) deployments using modern service-based architectures. Public safety (PS) services form the basis of keeping society functional, owing to the responsiveness and availability throughout the network. The provisioning of these services requires efficient and agile network man…
▽ More
Intent-based networking (IBN) provides a promising approach for managing networks and orchestrating services in beyond 5G (B5G) deployments using modern service-based architectures. Public safety (PS) services form the basis of keeping society functional, owing to the responsiveness and availability throughout the network. The provisioning of these services requires efficient and agile network management techniques with low-overhead and embedded intelligence. IBN incorporates the service subscribers in a model-driven approach to provision different user-centric services. However, it requires domain-specific and contextual processing of intents for abstracted management of network functions. This work proposes an intent definition for PS and mission critical (MC) services in beyond B5G networks, as well as a processing and orchestration architecture on top of MC push-to-talk (PTT) use case. The simulation results show that MC PTT services adhere to the key performance indicators of access time and mouth-to-ear latency bounded by approximately 250 and 150 milliseconds, respectively, with an additional overhead experienced during the intent processing in the range of 20- 40 milliseconds. This validates the premise of IBN in providing flexible and scalable management and service orchestration solution for PS next generation networks.
△ Less
Submitted 8 May, 2022;
originally announced May 2022.
-
A Gamma-ray Pulsar Timing Array Constrains the Nanohertz Gravitational Wave Background
Authors:
M. Ajello,
W. B. Atwood,
L. Baldini,
J. Ballet,
G. Barbiellini,
D. Bastieri,
R. Bellazzini,
A. Berretta,
B. Bhattacharyya,
E. Bissaldi,
R. D. Blandford,
E. Bloom,
R. Bonino,
P. Bruel,
R. Buehler,
E. Burns,
S. Buson,
R. A. Cameron,
P. A. Caraveo,
E. Cavazzuti,
N. Cibrario,
S. Ciprini,
C. J. Clark,
I. Cognard,
J. Coronado-Blázquez
, et al. (107 additional authors not shown)
Abstract:
After large galaxies merge, their central supermassive black holes are expected to form binary systems whose orbital motion generates a gravitational wave background (GWB) at nanohertz frequencies. Searches for this background utilize pulsar timing arrays, which perform long-term monitoring of millisecond pulsars (MSPs) at radio wavelengths. We use 12.5 years of Fermi Large Area Telescope data to…
▽ More
After large galaxies merge, their central supermassive black holes are expected to form binary systems whose orbital motion generates a gravitational wave background (GWB) at nanohertz frequencies. Searches for this background utilize pulsar timing arrays, which perform long-term monitoring of millisecond pulsars (MSPs) at radio wavelengths. We use 12.5 years of Fermi Large Area Telescope data to form a gamma-ray pulsar timing array. Results from 35 bright gamma-ray pulsars place a 95\% credible limit on the GWB characteristic strain of $1.0\times10^{-14}$ at 1 yr$^{-1}$, which scales as the observing time span $t_{\mathrm{obs}}^{-13/6}$. This direct measurement provides an independent probe of the GWB while offering a check on radio noise models.
△ Less
Submitted 11 April, 2022;
originally announced April 2022.
-
RepoMiner: a Language-agnostic Python Framework to Mine Software Repositories for Defect Prediction
Authors:
Stefano Dalla Palma,
Dario Di Nucci,
Damian Tamburri
Abstract:
Data originating from open-source software projects provide valuable information to enhance software quality. In the scope of Software Defect Prediction, one of the most challenging parts is extracting valid data about failure-prone software components from these repositories, which can help develop more robust software. In particular, collecting data, calculating metrics, and synthesizing results…
▽ More
Data originating from open-source software projects provide valuable information to enhance software quality. In the scope of Software Defect Prediction, one of the most challenging parts is extracting valid data about failure-prone software components from these repositories, which can help develop more robust software. In particular, collecting data, calculating metrics, and synthesizing results from these repositories is a tedious and error-prone task, which often requires understanding the programming languages involved in the mined repositories, eventually leading to a proliferation of language-specific data-mining software. This paper presents RepoMiner, a language-agnostic tool developed to support software engineering researchers in creating datasets to support any study on defect prediction. RepoMiner automatically collects failure data from software components, labels them as failure-prone or neutral, and calculates metrics to be used as ground truth for defect prediction models. We present its implementation and provide examples of its application.
△ Less
Submitted 23 November, 2021;
originally announced November 2021.
-
Intent-driven autonomous network and service management in future cellular networks: A structured literature review
Authors:
Kashif Mehmood,
Katina Kralevska,
David Palma
Abstract:
Intent-driven networks are an essential stepping stone in the evolution of network and service management towards a truly autonomous paradigm. User centric intents provide an abstracted means of impacting the design, provisioning, deployment and assurance of network infrastructure and services with the help of service level agreements and minimum network capability exposure. The concept of Intent…
▽ More
Intent-driven networks are an essential stepping stone in the evolution of network and service management towards a truly autonomous paradigm. User centric intents provide an abstracted means of impacting the design, provisioning, deployment and assurance of network infrastructure and services with the help of service level agreements and minimum network capability exposure. The concept of Intent Based Networking (IBN) poses several challenges in terms of the contextual definition of intents, role of different stakeholders, and a generalized architecture. In this review, we provide a comprehensive analysis of the state-of-the-art in IBN including the intent description models, intent lifecycle management, significance of IBN and a generalized architectural framework along with challenges and prospects for IBN in future cellular networks. An analytical study is performed on the data collected from relevant studies primarily focusing on the inter-working of IBN with softwarized networking based on NFV/SDN infrastructures. Critical functions required in the IBN management and service model design are explored with different abstract modeling techniques and a converged architectural framework is proposed. The key findings include: (1) benefits and role of IBN in autonomous networking, (2) improvements needed to integrate intents as fundamental policies for service modeling and network management, (3) need for appropriate representation models for intents in domain agnostic abstract manner, and (4) need to include learning as a fundamental function in autonomous networks. These observations provide the basis for in-depth investigation and standardization efforts for IBN as a fundamental network management paradigm in beyond 5G cellular networks.
△ Less
Submitted 9 May, 2023; v1 submitted 10 August, 2021;
originally announced August 2021.
-
5G Network Slicing for Wi-Fi Networks
Authors:
Matteo Nerini,
David Palma
Abstract:
Future networks will pave the way for a myriad of applications with different requirements and Wi-Fi will play an important role in local area networks. This is why network slicing is proposed by 5G networks, allowing to offer multiple logical networks tailored to the different user requirements, over a common infrastructure. However, this is not supported by current Wi-Fi networks. In this paper,…
▽ More
Future networks will pave the way for a myriad of applications with different requirements and Wi-Fi will play an important role in local area networks. This is why network slicing is proposed by 5G networks, allowing to offer multiple logical networks tailored to the different user requirements, over a common infrastructure. However, this is not supported by current Wi-Fi networks. In this paper, we propose a standard-compliant network slicing approach for the radio access segment of Wi-Fi by defining multiple Service Set Identifiers (SSIDs) per Access Point (AP). We present two algorithms, one that assigns resources according to the requirements of slices in a static way, and another that dynamically configures the slices according to the network's conditions and relevant Key Performance Indicators (KPIs). The proposed algorithms were validated through extensive simulations, conducted in the ns-3 network simulator, and complemented by theoretical assessments. The obtained results reveal that the two proposed slicing approaches outperform today's Wi-Fi access technique, reaching lower error probability for bandwidth intensive slices and lower latency for time-critical slices. Simultaneously, the proposed approach is up to 32 times more energy efficient, when considering slices tailored for low-power and low-bandwidth devices, while increasing the overall spectrum efficiency.
△ Less
Submitted 29 January, 2021;
originally announced January 2021.
-
DeepIaC: Deep Learning-Based Linguistic Anti-pattern Detection in IaC
Authors:
Nemania Borovits,
Indika Kumara,
Parvathy Krishnan,
Stefano Dalla Palma,
Dario Di Nucci,
Fabio Palomba,
Damian A. Tamburri,
Willem-Jan van den Heuvel
Abstract:
Linguistic anti-patterns are recurring poor practices concerning inconsistencies among the naming, documentation, and implementation of an entity. They impede readability, understandability, and maintainability of source code. This paper attempts to detect linguistic anti-patterns in infrastructure as code (IaC) scripts used to provision and manage computing environments. In particular, we conside…
▽ More
Linguistic anti-patterns are recurring poor practices concerning inconsistencies among the naming, documentation, and implementation of an entity. They impede readability, understandability, and maintainability of source code. This paper attempts to detect linguistic anti-patterns in infrastructure as code (IaC) scripts used to provision and manage computing environments. In particular, we consider inconsistencies between the logic/body of IaC code units and their names. To this end, we propose a novel automated approach that employs word embeddings and deep learning techniques. We build and use the abstract syntax tree of IaC code units to create their code embedments. Our experiments with a dataset systematically extracted from open source repositories show that our approach yields an accuracy between0.785and0.915in detecting inconsistencies
△ Less
Submitted 22 September, 2020;
originally announced September 2020.
-
Blockchain and Cryptocurrencies: a Classification and Comparison of Architecture Drivers
Authors:
Martin Garriga,
Stefano Dalla Palma,
Maximiliano Arias,
Alan De Renzis,
Remo Pareschi,
Damian Andrew Tamburri
Abstract:
Blockchain is a decentralized transaction and data management solution, the technological leap behind the success of Bitcoin and other cryptocurrencies. As the variety of existing blockchains and distributed ledgers continues to increase, adopters should focus on selecting the solution that best fits their needs and the requirements of their decentralized applications, rather than developing yet a…
▽ More
Blockchain is a decentralized transaction and data management solution, the technological leap behind the success of Bitcoin and other cryptocurrencies. As the variety of existing blockchains and distributed ledgers continues to increase, adopters should focus on selecting the solution that best fits their needs and the requirements of their decentralized applications, rather than developing yet another blockchain from scratch. In this paper we present a conceptual framework to aid software architects, developers, and decision makers to adopt the right blockchain technology. The framework exposes the interrelation between technological decisions and architectural features, capturing the knowledge from existing academic literature, industrial products, technical forums/blogs, and experts' feedback. We empirically show the applicability of our framework by dissecting the platforms behind Bitcoin and other top 10 cryptocurrencies, aided by a focus group with researchers and industry practitioners. Then, we leverage the framework together with key notions of the Architectural Tradeoff Analysis Method (ATAM) to analyze four real-world blockchain case studies from industry and academia. Results shown that applying our framework leads to a deeper understanding of the architectural tradeoffs, allowing to assess technologies more objectively and select the one that best fit developers needs, ultimately cutting costs, reducing time-to-market and accelerating return on investment.
△ Less
Submitted 23 July, 2020;
originally announced July 2020.
-
Lightning optimizes: a threshold mechanism ensures minimum-path flow
Authors:
Franco Blanchini,
Daniele Casagrande,
Filippo Fabiani,
Giulia Giordano,
David Palma,
Raffaele Pesenti
Abstract:
A well-known property of linear resistive electrical networks is that the current distribution minimizes the total dissipated energy. When the circuit includes resistors with nonlinear monotonic characteristic, the current distribution minimizes in general a different functional. We show that, if the nonlinear characteristic is a threshold-like function and the energy generator is concentrated in…
▽ More
A well-known property of linear resistive electrical networks is that the current distribution minimizes the total dissipated energy. When the circuit includes resistors with nonlinear monotonic characteristic, the current distribution minimizes in general a different functional. We show that, if the nonlinear characteristic is a threshold-like function and the energy generator is concentrated in a single point, as in the case of lightning or dielectric discharge, then the current flow is concentrated along a single path, which is a minimum path to the ground with respect to the threshold. We also propose a dynamic model that explains and qualitatively reproduces the lightning transient behaviour: initial generation of several plasma branches and subsequent dismissal of all branches but the one reaching the ground first, which is the optimal one.
△ Less
Submitted 8 October, 2020; v1 submitted 17 July, 2020;
originally announced July 2020.
-
Towards a Catalogue of Software Quality Metrics for Infrastructure Code
Authors:
Stefano Dalla Palma,
Dario Di Nucci,
Fabio Palomba,
Damian A. Tamburri
Abstract:
Infrastructure-as-code (IaC) is a practice to implement continuous deployment by allowing management and provisioning of infrastructure through the definition of machine-readable files and automation around them, rather than physical hardware configuration or interactive configuration tools. On the one hand, although IaC represents an ever-increasing widely adopted practice nowadays, still little…
▽ More
Infrastructure-as-code (IaC) is a practice to implement continuous deployment by allowing management and provisioning of infrastructure through the definition of machine-readable files and automation around them, rather than physical hardware configuration or interactive configuration tools. On the one hand, although IaC represents an ever-increasing widely adopted practice nowadays, still little is known concerning how to best maintain, speedily evolve, and continuously improve the code behind the IaC practice in a measurable fashion. On the other hand, source code measurements are often computed and analyzed to evaluate the different quality aspects of the software developed. However, unlike general-purpose programming languages (GPLs), IaC scripts use domain-specific languages, and metrics used for GPLs may not be applicable for IaC scripts. This article proposes a catalogue consisting of 46 metrics to identify IaC properties focusing on Ansible, one of the most popular IaC language to date, and shows how they can be used to analyze IaC scripts.
△ Less
Submitted 7 July, 2020; v1 submitted 27 May, 2020;
originally announced May 2020.
-
Characterization of the LIGO detectors during their sixth science run
Authors:
The LIGO Scientific Collaboration,
The Virgo Collaboration,
J. Aasi,
J. Abadie,
B. P. Abbott,
R. Abbott,
T. Abbott,
M. R. Abernathy,
T. Accadia,
F. Acernese,
C. Adams,
T. Adams,
R. X. Adhikari,
C. Affeldt,
M. Agathos,
N. Aggarwal,
O. D. Aguiar,
P. Ajith,
B. Allen,
A. Allocca,
E. Amador. Ceron,
D. Amariutei,
R. A. Anderson,
S. B. Anderson,
W. G. Anderson
, et al. (846 additional authors not shown)
Abstract:
In 2009-2010, the Laser Interferometer Gravitational-wave Observa- tory (LIGO) operated together with international partners Virgo and GEO600 as a network to search for gravitational waves of astrophysical origin. The sensitiv- ity of these detectors was limited by a combination of noise sources inherent to the instrumental design and its environment, often localized in time or frequency, that cou…
▽ More
In 2009-2010, the Laser Interferometer Gravitational-wave Observa- tory (LIGO) operated together with international partners Virgo and GEO600 as a network to search for gravitational waves of astrophysical origin. The sensitiv- ity of these detectors was limited by a combination of noise sources inherent to the instrumental design and its environment, often localized in time or frequency, that couple into the gravitational-wave readout. Here we review the performance of the LIGO instruments during this epoch, the work done to characterize the de- tectors and their data, and the effect that transient and continuous noise artefacts have on the sensitivity of LIGO to a variety of astrophysical sources.
△ Less
Submitted 18 November, 2014; v1 submitted 28 October, 2014;
originally announced October 2014.
-
Searching for stochastic gravitational waves using data from the two co-located LIGO Hanford detectors
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
J. Aasi,
J. Abadie,
B. P. Abbott,
R. Abbott,
T. Abbott,
M. R. Abernathy,
T. Accadia,
F. Acernese,
C. Adams,
T. Adams,
P. Addesso,
R. X. Adhikari,
C. Affeldt,
M. Agathos,
N. Aggarwal,
O. D. Aguiar,
P. Ajith,
B. Allen,
A. Allocca,
E. Amado. Ceron,
D. Amariutei,
R. A. Anderson,
S. B. Anderson
, et al. (852 additional authors not shown)
Abstract:
Searches for a stochastic gravitational-wave background (SGWB) using terrestrial detectors typically involve cross-correlating data from pairs of detectors. The sensitivity of such cross-correlation analyses depends, among other things, on the separation between the two detectors: the smaller the separation, the better the sensitivity. Hence, a co-located detector pair is more sensitive to a gravi…
▽ More
Searches for a stochastic gravitational-wave background (SGWB) using terrestrial detectors typically involve cross-correlating data from pairs of detectors. The sensitivity of such cross-correlation analyses depends, among other things, on the separation between the two detectors: the smaller the separation, the better the sensitivity. Hence, a co-located detector pair is more sensitive to a gravitational-wave background than a non-co-located detector pair. However, co-located detectors are also expected to suffer from correlated noise from instrumental and environmental effects that could contaminate the measurement of the background. Hence, methods to identify and mitigate the effects of correlated noise are necessary to achieve the potential increase in sensitivity of co-located detectors. Here we report on the first SGWB analysis using the two LIGO Hanford detectors and address the complications arising from correlated environmental noise. We apply correlated noise identification and mitigation techniques to data taken by the two LIGO Hanford detectors, H1 and H2, during LIGO's fifth science run. At low frequencies, 40 - 460 Hz, we are unable to sufficiently mitigate the correlated noise to a level where we may confidently measure or bound the stochastic gravitational-wave signal. However, at high frequencies, 460-1000 Hz, these techniques are sufficient to set a $95%$ confidence level (C.L.) upper limit on the gravitational-wave energy density of Ω(f)<7.7 x 10^{-4} (f/ 900 Hz)^3, which improves on the previous upper limit by a factor of $\sim 180$. In doing so, we demonstrate techniques that will be useful for future searches using advanced detectors, where correlated noise (e.g., from global magnetic fields) may affect even widely separated detectors.
△ Less
Submitted 2 December, 2014; v1 submitted 22 October, 2014;
originally announced October 2014.
-
Predicting epidemic risk from past temporal contact data
Authors:
Eugenio Valdano,
Chiara Poletto,
Armando Giovannini,
Diana Palma,
Lara Savini,
Vittoria Colizza
Abstract:
Understanding how epidemics spread in a system is a crucial step to prevent and control outbreaks, with broad implications on the system's functioning, health, and associated costs. This can be achieved by identifying the elements at higher risk of infection and implementing targeted surveillance and control measures. One important ingredient to consider is the pattern of disease-transmission cont…
▽ More
Understanding how epidemics spread in a system is a crucial step to prevent and control outbreaks, with broad implications on the system's functioning, health, and associated costs. This can be achieved by identifying the elements at higher risk of infection and implementing targeted surveillance and control measures. One important ingredient to consider is the pattern of disease-transmission contacts among the elements, however lack of data or delays in providing updated records may hinder its use, especially for time-varying patterns. Here we explore to what extent it is possible to use past temporal data of a system's pattern of contacts to predict the risk of infection of its elements during an emerging outbreak, in absence of updated data. We focus on two real-world temporal systems; a livestock displacements trade network among animal holdings, and a network of sexual encounters in high-end prostitution. We define the node's loyalty as a local measure of its tendency to maintain contacts with the same elements over time, and uncover important non-trivial correlations with the node's epidemic risk. We show that a risk assessment analysis incorporating this knowledge and based on past structural and temporal pattern properties provides accurate predictions for both systems. Its generalizability is tested by introducing a theoretical model for generating synthetic temporal networks. High accuracy of our predictions is recovered across different settings, while the amount of possible predictions is system-specific. The proposed method can provide crucial information for the setup of targeted intervention strategies.
△ Less
Submitted 13 March, 2015; v1 submitted 5 June, 2014;
originally announced June 2014.
-
Novel Scintillating Materials Based on Phenyl-Polysiloxane for Neutron Detection and Monitoring
Authors:
M. Degerlier,
S. Carturan,
F. Gramegna,
T. Marchi,
M. Dalla Palma,
M. Cinausero,
G. Maggioni,
A. Quaranta,
G. Collazuol,
J. Bermudez
Abstract:
Neutron detectors are extensively used at many nuclear research facilities across Europe. Their application range covers many topics in basic and applied nuclear research: in nuclear structure and reaction dynamics (reaction reconstruction and decay studies); in nuclear astrophysics (neutron emission probabilities); in nuclear technology (nuclear data measurements and in-core/off-core monitors); i…
▽ More
Neutron detectors are extensively used at many nuclear research facilities across Europe. Their application range covers many topics in basic and applied nuclear research: in nuclear structure and reaction dynamics (reaction reconstruction and decay studies); in nuclear astrophysics (neutron emission probabilities); in nuclear technology (nuclear data measurements and in-core/off-core monitors); in nuclear medicine (radiation monitors, dosimeters); in materials science (neutron imaging techniques); in homeland security applications (fissile materials investigation and cargo inspection). Liquid scintillators, widely used at present, have however some drawbacks given by toxicity, flammability, volatility and sensitivity to oxygen that limit their duration and quality. Even plastic scintillators are not satisfactory because they have low radiation hardness and low thermal stability. Moreover organic solvents may affect their optical properties due to crazing. In order to overcome these problems, phenyl-polysiloxane based scintillators have been recently developed at Legnaro National Laboratory. This new solution showed very good chemical and thermal stability and high radiation hardness. The results on the different samples performance will be presented, paying special attention to a characterization comparison between synthesized phenyl containing polysiloxane resins where a Pt catalyst has been used and a scintillating material obtained by condensation reaction, where tin based compounds are used as catalysts. Different structural arrangements as a result of different substituents on the main chain have been investigated by High Resolution X-Ray Diffraction, while the effect of improved optical transmittance on the scintillation yield has been elucidated by a combination of excitation/fluorescence measurements and scintillation yield under exposure to alpha and γ-rays.
△ Less
Submitted 25 October, 2013;
originally announced October 2013.