Skip to main content

Showing 1–16 of 16 results for author: Ceravolo, P

.
  1. arXiv:2503.24062  [pdf, other

    cs.CL cs.AI cs.LG

    Artificial Conversations, Real Results: Fostering Language Detection with Synthetic Data

    Authors: Fatemeh Mohammadi, Tommaso Romano, Samira Maghool, Paolo Ceravolo

    Abstract: Collecting high-quality training data is essential for fine-tuning Large Language Models (LLMs). However, acquiring such data is often costly and time-consuming, especially for non-English languages such as Italian. Recently, researchers have begun to explore the use of LLMs to generate synthetic datasets as a viable alternative. This study proposes a pipeline for generating synthetic data and a c… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

  2. arXiv:2503.18994  [pdf, other

    cs.CY cs.AI cs.LG

    HH4AI: A methodological Framework for AI Human Rights impact assessment under the EUAI ACT

    Authors: Paolo Ceravolo, Ernesto Damiani, Maria Elisa D'Amico, Bianca de Teffe Erb, Simone Favaro, Nannerel Fiano, Paolo Gambatesa, Simone La Porta, Samira Maghool, Lara Mauri, Niccolo Panigada, Lorenzo Maria Ratto Vaquer, Marta A. Tamborini

    Abstract: This paper introduces the HH4AI Methodology, a structured approach to assessing the impact of AI systems on human rights, focusing on compliance with the EU AI Act and addressing technical, ethical, and regulatory challenges. The paper highlights AIs transformative nature, driven by autonomy, data, and goal-oriented design, and how the EU AI Act promotes transparency, accountability, and safety. A… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

    Comments: 19 pages, 7 figures, 1 table

  3. arXiv:2502.11611  [pdf, other

    cs.CL cs.AI

    Identifying Gender Stereotypes and Biases in Automated Translation from English to Italian using Similarity Networks

    Authors: Fatemeh Mohammadi, Marta Annamaria Tamborini, Paolo Ceravolo, Costanza Nardocci, Samira Maghool

    Abstract: This paper is a collaborative effort between Linguistics, Law, and Computer Science to evaluate stereotypes and biases in automated translation systems. We advocate gender-neutral translation as a means to promote gender inclusion and improve the objectivity of machine translation. Our approach focuses on identifying gender bias in English-to-Italian translations. First, we define gender bias foll… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  4. arXiv:2502.06918  [pdf, other

    cs.LG cs.AI

    Leveraging GPT-4o Efficiency for Detecting Rework Anomaly in Business Processes

    Authors: Mohammad Derakhshan, Paolo Ceravolo, Fatemeh Mohammadi

    Abstract: This paper investigates the effectiveness of GPT-4o-2024-08-06, one of the Large Language Models (LLM) from OpenAI, in detecting business process anomalies, with a focus on rework anomalies. In our study, we developed a GPT-4o-based tool capable of transforming event logs into a structured format and identifying reworked activities within business event logs. The analysis was performed on a synthe… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: 14 pages, 5 images, 4 tables

  5. arXiv:2411.05648  [pdf, other

    cs.LG

    Enhancing Model Fairness and Accuracy with Similarity Networks: A Methodological Approach

    Authors: Samira Maghool, Paolo Ceravolo

    Abstract: In this paper, we propose an innovative approach to thoroughly explore dataset features that introduce bias in downstream machine-learning tasks. Depending on the data format, we use different techniques to map instances into a similarity feature space. Our method's ability to adjust the resolution of pairwise similarity provides clear insights into the relationship between the dataset classificat… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

    Comments: 7 pages, 4 figures

  6. arXiv:2406.06596  [pdf, other

    cs.CL cs.AI cs.DB

    Are Large Language Models the New Interface for Data Pipelines?

    Authors: Sylvio Barbon Junior, Paolo Ceravolo, Sven Groppe, Mustafa Jarrar, Samira Maghool, Florence Sèdes, Soror Sahri, Maurice Van Keulen

    Abstract: A Language Model is a term that encompasses various types of models designed to understand and generate human communication. Large Language Models (LLMs) have gained significant attention due to their ability to process text with human-like fluency and coherence, making them valuable for a wide range of data-related tasks fashioned as pipelines. The capabilities of LLMs in natural language underst… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  7. arXiv:2403.05556  [pdf

    cs.CY cs.LG

    Modeling and predicting students' engagement behaviors using mixture Markov models

    Authors: R. Maqsood, P. Ceravolo, C. Romero, S. Ventura

    Abstract: Students' engagements reflect their level of involvement in an ongoing learning process which can be estimated through their interactions with a computer-based learning or assessment system. A pre-requirement for stimulating student engagement lies in the capability to have an approximate representation model for comprehending students' varied (dis)engagement behaviors. In this paper, we utilized… ▽ More

    Submitted 10 February, 2024; originally announced March 2024.

    Journal ref: Knowledge and Information System (2022); 64:1349-1384

  8. arXiv:2310.11196  [pdf

    astro-ph.EP astro-ph.SR

    Kilometer-precise (UII) Umbriel physical properties from the multichord stellar occultation on 2020 September 21

    Authors: M. Assafin, S. Santos-Filho, B. E. Morgado, A. R. Gomes-Júnior, B. Sicardy, G. Margoti, G. Benedetti-Rossi, F. Braga-Ribas, T. Laidler, J. I. B. Camargo, R. Vieira-Martins, T. Swift, D. Dunham, T. George, J. Bardecker, C. Anderson, R. Nolthenius, K. Bender, G. Viscome, D. Oesper, R. Dunford, K. Getrost, C. Kitting, K. Green, R. Bria , et al. (17 additional authors not shown)

    Abstract: We report the results of the stellar occultation by (UII) Umbriel on September 21st, 2020. The shadow crossed the USA and Canada, and 19 positive chords were obtained. A limb parameter accounted for putative topographic features in the limb fittings. Ellipse fittings were not robust - only upper limits were derived for the true size/shape of a putative Umbriel ellipsoid. The adopted spherical solu… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  9. Scaling slowly rotating asteroids by stellar occultations

    Authors: A. Marciniak, J. Ďurech, A. Choukroun, J. Hanuš, W. Ogłoza, R. Szakáts, L. Molnár, A. Pál, F. Monteiro, E. Frappa, W. Beisker, H. Pavlov, J. Moore, R. Adomavičienė, R. Aikawa, S. Andersson, P. Antonini, Y. Argentin, A. Asai, P. Assoignon, J. Barton, P. Baruffetti, K. L. Bath, R. Behrend, L. Benedyktowicz , et al. (154 additional authors not shown)

    Abstract: As evidenced by recent survey results, majority of asteroids are slow rotators (P>12 h), but lack spin and shape models due to selection bias. This bias is skewing our overall understanding of the spins, shapes, and sizes of asteroids, as well as of their other properties. Also, diameter determinations for large (>60km) and medium-sized asteroids (between 30 and 60 km) often vary by over 30% for m… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted to Astronomy & Astrophysics. 12 pages + appendices

    Journal ref: A&A 679, A60 (2023)

  10. A large topographic feature on the surface of the trans-Neptunian object (307261) 2002 MS$_4$ measured from stellar occultations

    Authors: F. L. Rommel, F. Braga-Ribas, J. L. Ortiz, B. Sicardy, P. Santos-Sanz, J. Desmars, J. I. B. Camargo, R. Vieira-Martins, M. Assafin, B. E. Morgado, R. C. Boufleur, G. Benedetti-Rossi, A. R. Gomes-Júnior, E. Fernández-Valenzuela, B. J. Holler, D. Souami, R. Duffard, G. Margoti, M. Vara-Lubiano, J. Lecacheux, J. L. Plouvier, N. Morales, A. Maury, J. Fabrega, P. Ceravolo , et al. (179 additional authors not shown)

    Abstract: This work aims at constraining the size, shape, and geometric albedo of the dwarf planet candidate 2002 MS4 through the analysis of nine stellar occultation events. Using multichord detection, we also studied the object's topography by analyzing the obtained limb and the residuals between observed chords and the best-fitted ellipse. We predicted and organized the observational campaigns of nine st… ▽ More

    Submitted 23 August, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

    Journal ref: A&A 678, A167 (2023)

  11. Tailoring Machine Learning for Process Mining

    Authors: Paolo Ceravolo, Sylvio Barbon Junior, Ernesto Damiani, Wil van der Aalst

    Abstract: Machine learning models are routinely integrated into process mining pipelines to carry out tasks like data transformation, noise reduction, anomaly detection, classification, and prediction. Often, the design of such models is based on some ad-hoc assumptions about the corresponding data distributions, which are not necessarily in accordance with the non-parametric distributions typically observe… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

    Comments: 16 pages

    MSC Class: 68 ACM Class: I.2.6

  12. arXiv:2303.17879  [pdf, other

    cs.AI cs.LG

    CoSMo: a Framework to Instantiate Conditioned Process Simulation Models

    Authors: Rafael S. Oyamada, Gabriel M. Tavares, Sylvio Barbon Junior, Paolo Ceravolo

    Abstract: Process simulation is gaining attention for its ability to assess potential performance improvements and risks associated with business process changes. The existing literature presents various techniques, generally grounded in process models discovered from event log data or built upon deep learning algorithms. These techniques have specific strengths and limitations. Traditional data-driven appr… ▽ More

    Submitted 25 June, 2024; v1 submitted 31 March, 2023; originally announced March 2023.

  13. arXiv:2301.02167  [pdf, other

    cs.LG cs.DB

    Trace Encoding in Process Mining: a survey and benchmarking

    Authors: Sylvio Barbon Jr., Paolo Ceravolo, Rafael S. Oyamada, Gabriel M. Tavares

    Abstract: Encoding methods are employed across several process mining tasks, including predictive process monitoring, anomalous case detection, trace clustering, etc. These methods are usually performed as preprocessing steps and are responsible for transforming complex information into a numerical feature space. Most papers choose existing encoding methods arbitrarily or employ a strategy based on a specif… ▽ More

    Submitted 5 January, 2023; originally announced January 2023.

  14. arXiv:2109.00635  [pdf, other

    cs.LG cs.IR cs.SE

    Selecting Optimal Trace Clustering Pipelines with AutoML

    Authors: Sylvio Barbon Jr, Paolo Ceravolo, Ernesto Damiani, Gabriel Marques Tavares

    Abstract: Trace clustering has been extensively used to preprocess event logs. By grouping similar behavior, these techniques guide the identification of sub-logs, producing more understandable models and conformance analytics. Nevertheless, little attention has been posed to the relationship between event log properties and clustering quality. In this work, we propose an Automatic Machine Learning (AutoML)… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: 17 pages, 7 figures

  15. arXiv:2103.12874  [pdf, other

    cs.LG cs.IR cs.SE

    Using Meta-learning to Recommend Process Discovery Methods

    Authors: Sylvio Barbon Jr, Paolo Ceravolo, Ernesto Damiani, Gabriel Marques Tavares

    Abstract: Process discovery methods have obtained remarkable achievements in Process Mining, delivering comprehensible process models to enhance management capabilities. However, selecting the suitable method for a specific event log highly relies on human expertise, hindering its broad application. Solutions based on Meta-learning (MtL) have been promising for creating systems with reduced human assistance… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: 16 pages, 6 figures

  16. arXiv:1708.03529  [pdf, other

    cs.CY

    Quantify resilience enhancement of UTS through exploiting connect community and internet of everything emerging technologies

    Authors: Emanuele Bellini, Paolo Ceravolo, Paolo Besi

    Abstract: This work aims at investigating and quantifying the Urban Transport System (UTS) resilience enhancement enabled by the adoption of emerging technology such as Internet of Everything (IoE) and the new trend of the Connected Community (CC). A conceptual extension of Functional Resonance Analysis Method (FRAM) and its formalization have been proposed and used to model UTS complexity. The scope is to… ▽ More

    Submitted 11 August, 2017; originally announced August 2017.