Skip to main content

Showing 1–4 of 4 results for author: Pittaras, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.16425  [pdf, ps, other

    cs.CY cs.AI cs.LG

    Lessons for Editors of AI Incidents from the AI Incident Database

    Authors: Kevin Paeth, Daniel Atherton, Nikiforos Pittaras, Heather Frase, Sean McGregor

    Abstract: As artificial intelligence (AI) systems become increasingly deployed across the world, they are also increasingly implicated in AI incidents - harm events to individuals and society. As a result, industry, civil society, and governments worldwide are developing best practices and regulations for monitoring and analyzing AI incidents. The AI Incident Database (AIID) is a project that catalogs AI in… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: 8 pages, 0 figures

  2. arXiv:2211.07280  [pdf, other

    cs.AI cs.CY

    A taxonomic system for failure cause analysis of open source AI incidents

    Authors: Nikiforos Pittaras, Sean McGregor

    Abstract: While certain industrial sectors (e.g., aviation) have a long history of mandatory incident reporting complete with analytical findings, the practice of artificial intelligence (AI) safety benefits from no such mandate and thus analyses must be performed on publicly known ``open source'' AI incidents. Although the exact causes of AI incidents are seldom known by outsiders, this work demonstrates h… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

  3. arXiv:2210.12841  [pdf, other

    cs.LG cs.GT cs.MA

    A Cooperative Reinforcement Learning Environment for Detecting and Penalizing Betrayal

    Authors: Nikiforos Pittaras

    Abstract: In this paper we present a Reinforcement Learning environment that leverages agent cooperation and communication, aimed at detection, learning and ultimately penalizing betrayal patterns that emerge in the behavior of self-interested agents. We provide a description of game rules, along with interesting cases of betrayal and trade-offs that arise. Preliminary experimental investigations illustrate… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

  4. arXiv:2102.04521  [pdf, ps, other

    cs.CL

    A study of text representations in Hate Speech Detection

    Authors: Chrysoula Themeli, George Giannakopoulos, Nikiforos Pittaras

    Abstract: The pervasiveness of the Internet and social media have enabled the rapid and anonymous spread of Hate Speech content on microblogging platforms such as Twitter. Current EU and US legislation against hateful language, in conjunction with the large amount of data produced in these platforms has led to automatic tools being a necessary component of the Hate Speech detection task and pipeline. In thi… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: 14 pages, CICLing2019