Search | arXiv e-print repository

AI Governance through Markets

Authors: Philip Moreira Tomei, Rupal Jain, Matija Franklin

Abstract: This paper argues that market governance mechanisms should be considered a key approach in the governance of artificial intelligence (AI), alongside traditional regulatory frameworks. While current governance approaches have predominantly focused on regulation, we contend that market-based mechanisms offer effective incentives for responsible AI development. We examine four emerging vectors of mar… ▽ More This paper argues that market governance mechanisms should be considered a key approach in the governance of artificial intelligence (AI), alongside traditional regulatory frameworks. While current governance approaches have predominantly focused on regulation, we contend that market-based mechanisms offer effective incentives for responsible AI development. We examine four emerging vectors of market governance: insurance, auditing, procurement, and due diligence, demonstrating how these mechanisms can affirm the relationship between AI risk and financial risk while addressing capital allocation inefficiencies. While we do not claim that market forces alone can adequately protect societal interests, we maintain that standardised AI disclosures and market mechanisms can create powerful incentives for safe and responsible AI development. This paper urges regulators, economists, and machine learning researchers to investigate and implement market-based approaches to AI governance. △ Less

Submitted 5 March, 2025; v1 submitted 29 January, 2025; originally announced January 2025.

arXiv:2409.05407 [pdf, other]

KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction

Authors: Davide Di Nucci, Alessandro Simoni, Matteo Tomei, Luca Ciuffreda, Roberto Vezzani, Rita Cucchiara

Abstract: The three-dimensional representation of objects or scenes starting from a set of images has been a widely discussed topic for years and has gained additional attention after the diffusion of NeRF-based approaches. However, an underestimated prerequisite is the knowledge of camera poses or, more specifically, the estimation of the extrinsic calibration parameters. Although excellent general-purpose… ▽ More The three-dimensional representation of objects or scenes starting from a set of images has been a widely discussed topic for years and has gained additional attention after the diffusion of NeRF-based approaches. However, an underestimated prerequisite is the knowledge of camera poses or, more specifically, the estimation of the extrinsic calibration parameters. Although excellent general-purpose Structure-from-Motion methods are available as a pre-processing step, their computational load is high and they require a lot of frames to guarantee sufficient overlapping among the views. This paper introduces KRONC, a novel approach aimed at inferring view poses by leveraging prior knowledge about the object to reconstruct and its representation through semantic keypoints. With a focus on vehicle scenes, KRONC is able to estimate the position of the views as a solution to a light optimization problem targeting the convergence of keypoints' back-projections to a singular point. To validate the method, a specific dataset of real-world car scenes has been collected. Experiments confirm KRONC's ability to generate excellent estimates of camera poses starting from very coarse initialization. Results are comparable with Structure-from-Motion methods with huge savings in computation. Code and data will be made publicly available. △ Less

Submitted 9 September, 2024; originally announced September 2024.

Comments: Accepted at ECCVW

arXiv:2308.16364 [pdf, ps, other]

Strengthening the EU AI Act: Defining Key Terms on AI Manipulation

Authors: Matija Franklin, Philip Moreira Tomei, Rebecca Gorman

Abstract: The European Union's Artificial Intelligence Act aims to regulate manipulative and harmful uses of AI, but lacks precise definitions for key concepts. This paper provides technical recommendations to improve the Act's conceptual clarity and enforceability. We review psychological models to define "personality traits," arguing the Act should protect full "psychometric profiles." We urge expanding "… ▽ More The European Union's Artificial Intelligence Act aims to regulate manipulative and harmful uses of AI, but lacks precise definitions for key concepts. This paper provides technical recommendations to improve the Act's conceptual clarity and enforceability. We review psychological models to define "personality traits," arguing the Act should protect full "psychometric profiles." We urge expanding "behavior" to include "preferences" since preferences causally influence and are influenced by behavior. Clear definitions are provided for "subliminal," "manipulative," and "deceptive" techniques, considering incentives, intent, and covertness. We distinguish "exploiting individuals" from "exploiting groups," emphasising different policy needs. An "informed decision" is defined by four facets: comprehension, accurate information, no manipulation, and understanding AI's influence. We caution the Act's therapeutic use exemption given the lack of regulation of digital therapeutics by the EMA. Overall, the recommendations strengthen definitions of vague concepts in the EU AI Act, enhancing precise applicability to regulate harmful AI manipulation. △ Less

Submitted 30 August, 2023; originally announced August 2023.

Comments: 10 pages

arXiv:2307.12718 [pdf, other]

CarPatch: A Synthetic Benchmark for Radiance Field Evaluation on Vehicle Components

Authors: Davide Di Nucci, Alessandro Simoni, Matteo Tomei, Luca Ciuffreda, Roberto Vezzani, Rita Cucchiara

Abstract: Neural Radiance Fields (NeRFs) have gained widespread recognition as a highly effective technique for representing 3D reconstructions of objects and scenes derived from sets of images. Despite their efficiency, NeRF models can pose challenges in certain scenarios such as vehicle inspection, where the lack of sufficient data or the presence of challenging elements (e.g. reflections) strongly impact… ▽ More Neural Radiance Fields (NeRFs) have gained widespread recognition as a highly effective technique for representing 3D reconstructions of objects and scenes derived from sets of images. Despite their efficiency, NeRF models can pose challenges in certain scenarios such as vehicle inspection, where the lack of sufficient data or the presence of challenging elements (e.g. reflections) strongly impact the accuracy of the reconstruction. To this aim, we introduce CarPatch, a novel synthetic benchmark of vehicles. In addition to a set of images annotated with their intrinsic and extrinsic camera parameters, the corresponding depth maps and semantic segmentation masks have been generated for each view. Global and part-based metrics have been defined and used to evaluate, compare, and better characterize some state-of-the-art techniques. The dataset is publicly released at https://aimagelab.ing.unimore.it/go/carpatch and can be used as an evaluation guide and as a baseline for future work on this challenging topic. △ Less

Submitted 24 July, 2023; originally announced July 2023.

Comments: Accepted at ICIAP2023

arXiv:2102.07624 [pdf, other]

RMS-Net: Regression and Masking for Soccer Event Spotting

Authors: Matteo Tomei, Lorenzo Baraldi, Simone Calderara, Simone Bronzin, Rita Cucchiara

Abstract: The recently proposed action spotting task consists in finding the exact timestamp in which an event occurs. This task fits particularly well for soccer videos, where events correspond to salient actions strictly defined by soccer rules (a goal occurs when the ball crosses the goal line). In this paper, we devise a lightweight and modular network for action spotting, which can simultaneously predi… ▽ More The recently proposed action spotting task consists in finding the exact timestamp in which an event occurs. This task fits particularly well for soccer videos, where events correspond to salient actions strictly defined by soccer rules (a goal occurs when the ball crosses the goal line). In this paper, we devise a lightweight and modular network for action spotting, which can simultaneously predict the event label and its temporal offset using the same underlying features. We enrich our model with two training strategies: the first one for data balancing and uniform sampling, the second for masking ambiguous frames and keeping the most discriminative visual cues. When tested on the SoccerNet dataset and using standard features, our full proposal exceeds the current state of the art by 3 Average-mAP points. Additionally, it reaches a gain of more than 10 Average-mAP points on the test set when fine-tuned in combination with a strong 2D backbone. △ Less

Submitted 15 February, 2021; originally announced February 2021.

arXiv:2011.04069 [pdf, ps, other]

The Twelvefold Way of Non-Sequential Lossless Compression

Authors: Taha Ameen ur Rahman, Alton S. Barbehenn, Xinan Chen, Hassan Dbouk, James A. Douglas, Yuncong Geng, Ian George, John B. Harvill, Sung Woo Jeon, Kartik K. Kansal, Kiwook Lee, Kelly A. Levick, Bochao Li, Ziyue Li, Yashaswini Murthy, Adarsh Muthuveeru-Subramaniam, S. Yagiz Olmez, Matthew J. Tomei, Tanya Veeravalli, Xuechao Wang, Eric A. Wayman, Fan Wu, Peng Xu, Shen Yan, Heling Zhang , et al. (5 additional authors not shown)

Abstract: Many information sources are not just sequences of distinguishable symbols but rather have invariances governed by alternative counting paradigms such as permutations, combinations, and partitions. We consider an entire classification of these invariances called the twelvefold way in enumerative combinatorics and develop a method to characterize lossless compression limits. Explicit computations f… ▽ More Many information sources are not just sequences of distinguishable symbols but rather have invariances governed by alternative counting paradigms such as permutations, combinations, and partitions. We consider an entire classification of these invariances called the twelvefold way in enumerative combinatorics and develop a method to characterize lossless compression limits. Explicit computations for all twelve settings are carried out for i.i.d. uniform and Bernoulli distributions. Comparisons among settings provide quantitative insight. △ Less

Submitted 20 January, 2021; v1 submitted 8 November, 2020; originally announced November 2020.

Comments: DCC 2021

arXiv:1912.04316 [pdf, other]

doi 10.1016/j.cviu.2021.103187

Video action detection by learning graph-based spatio-temporal interactions

Authors: Matteo Tomei, Lorenzo Baraldi, Simone Calderara, Simone Bronzin, Rita Cucchiara

Abstract: Action Detection is a complex task that aims to detect and classify human actions in video clips. Typically, it has been addressed by processing fine-grained features extracted from a video classification backbone. Recently, thanks to the robustness of object and people detectors, a deeper focus has been added on relationship modelling. Following this line, we propose a graph-based framework to le… ▽ More Action Detection is a complex task that aims to detect and classify human actions in video clips. Typically, it has been addressed by processing fine-grained features extracted from a video classification backbone. Recently, thanks to the robustness of object and people detectors, a deeper focus has been added on relationship modelling. Following this line, we propose a graph-based framework to learn high-level interactions between people and objects, in both space and time. In our formulation, spatio-temporal relationships are learned through self-attention on a multi-layer graph structure which can connect entities from consecutive clips, thus considering long-range spatial and temporal dependencies. The proposed module is backbone independent by design and does not require end-to-end training. Extensive experiments are conducted on the AVA dataset, where our model demonstrates state-of-the-art results and consistent improvements over baselines built with different backbones. Code is publicly available at https://github.com/aimagelab/STAGE_action_detection. △ Less

Submitted 1 March, 2021; v1 submitted 9 December, 2019; originally announced December 2019.

Comments: This is the authors version of an article accepted for publication in Computer Vision and Image Understanding (CVIU), available online February 2021

Journal ref: Computer Vision and Image Understanding (CVIU), 2021

arXiv:1811.10666 [pdf, other]

Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation

Authors: Matteo Tomei, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara

Abstract: The applicability of computer vision to real paintings and artworks has been rarely investigated, even though a vast heritage would greatly benefit from techniques which can understand and process data from the artistic domain. This is partially due to the small amount of annotated artistic data, which is not even comparable to that of natural images captured by cameras. In this paper, we propose… ▽ More The applicability of computer vision to real paintings and artworks has been rarely investigated, even though a vast heritage would greatly benefit from techniques which can understand and process data from the artistic domain. This is partially due to the small amount of annotated artistic data, which is not even comparable to that of natural images captured by cameras. In this paper, we propose a semantic-aware architecture which can translate artworks to photo-realistic visualizations, thus reducing the gap between visual features of artistic and realistic data. Our architecture can generate natural images by retrieving and learning details from real photos through a similarity matching strategy which leverages a weakly-supervised semantic understanding of the scene. Experimental results show that the proposed technique leads to increased realism and to a reduction in domain shift, which improves the performance of pre-trained architectures for classification, detection, and segmentation. Code is publicly available at: https://github.com/aimagelab/art2real. △ Less

Submitted 17 May, 2019; v1 submitted 26 November, 2018; originally announced November 2018.

Comments: CVPR 2019

arXiv:0704.3552 [pdf, other]

doi 10.1051/0004-6361:20077359

An X-ray Survey in SA 57 with XMM-Newton

Authors: D. Trevese, F. Vagnetti, S. Puccetti, F. Fiore, M. Tomei, M. A. Bershady

Abstract: The maximum number density of Active Galactic Nuclei (AGNs), as deduced from X-ray studies, occurs at z<~1, with lower luminosity objects peaking at smaller redshifts. Optical studies lead to a different evolutionary behaviour, with a number density peaking at z~2 independently of the intrinsic luminosity, but this result is limited to active nuclei brighter than the host galaxy. A selection bas… ▽ More The maximum number density of Active Galactic Nuclei (AGNs), as deduced from X-ray studies, occurs at z<~1, with lower luminosity objects peaking at smaller redshifts. Optical studies lead to a different evolutionary behaviour, with a number density peaking at z~2 independently of the intrinsic luminosity, but this result is limited to active nuclei brighter than the host galaxy. A selection based on optical variability can detect low luminosity AGNs (LLAGNs), where the host galaxy light prevents the identification by non-stellar colours. We want to collect X-ray data in a field where it exists an optically-selected sample of "variable galaxies'', i.e. variable objects with diffuse appearance, to investigate the X-ray and optical properties of the population of AGNs, particularly of low luminosity ones, where the host galaxy is visible. We observed a field of 0.2 deg^2 in the Selected Area 57, for 67ks with XMM-Newton. We detected X-ray sources, and we correlated the list with a photographic survey of SA 57, complete to B_J~23 and with available spectroscopic data. We obtained a catalogue of 140 X-ray sources to limiting fluxes 5x10^-16, 2x10^-15 erg/cm^2/s in the 0.5-2 keV and 2-10 keV respectively, 98 of which are identified in the optical bands. The X-ray detection of part of the variability-selected candidates confirms their AGN nature. Diffuse variable objects populate the low luminosity side of the sample. Only 25/44 optically-selected QSOs are detected in X-rays. 15% of all QSOs in the field have X/O<0.1. △ Less

Submitted 26 April, 2007; originally announced April 2007.

Comments: 13 pages, 6 figures, 4 tables, A&A in press

Showing 1–9 of 9 results for author: Tomei, M