Skip to main content

Showing 1–8 of 8 results for author: Foorthuis, R

Searching in archive cs. Search in all archives.
.
  1. A Typology of Data Anomalies

    Authors: Ralph Foorthuis

    Abstract: Anomalies are cases that are in some way unusual and do not appear to fit the general patterns present in the dataset. Several conceptualizations exist to distinguish between different types of anomalies. However, these are either too specific to be generally applicable or so abstract that they neither provide concrete insight into the nature of anomaly types nor facilitate the functional evaluati… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

    Comments: 13 pages, 5 figures. Presented at the 17th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2018). Note: for a fully developed and more detailed typology of anomalies, see the follow-up publication 'On the Nature and Types of Anomalies: A Review of Deviations in Data'. arXiv admin note: text overlap with arXiv:2007.15634

    MSC Class: 62G07 ACM Class: G.3; I.2.6; I.5

  2. Algorithmic Frameworks for the Detection of High Density Anomalies

    Authors: Ralph Foorthuis

    Abstract: This study explores the concept of high-density anomalies. As opposed to the traditional concept of anomalies as isolated occurrences, high-density anomalies are deviant cases positioned in the most normal regions of the data space. Such anomalies are relevant for various practical use cases, such as misbehavior detection and data quality analysis. Effective methods for identifying them are partic… ▽ More

    Submitted 4 April, 2021; v1 submitted 9 October, 2020; originally announced October 2020.

    Comments: 10 pages, 9 figures, 6 tables. Accepted for presentation at IEEE SSCI CIDM 2020 (Symposium on Computational Intelligence in Data Mining)

    MSC Class: 62G07 ACM Class: G.3; I.2.6; I.5

  3. arXiv:2008.12330  [pdf

    cs.DB cs.AI cs.LG stat.ML

    The Impact of Discretization Method on the Detection of Six Types of Anomalies in Datasets

    Authors: Ralph Foorthuis

    Abstract: Anomaly detection is the process of identifying cases, or groups of cases, that are in some way unusual and do not fit the general patterns present in the dataset. Numerous algorithms use discretization of numerical data in their detection processes. This study investigates the effect of the discretization method on the unsupervised detection of each of the six anomaly types acknowledged in a rece… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Comments: 16 pages, 5 figures, 2 tables. Presented at the 30th Benelux Conference on Artificial Intelligence (BNAIC 2018)

    MSC Class: 62G07 ACM Class: G.3; I.2.6; I.5

  4. arXiv:2008.11026  [pdf

    cs.CY

    On Course, But Not There Yet: Enterprise Architecture Conformance and Benefits in Systems Development

    Authors: Ralph Foorthuis, Marlies van Steenbergen, Nino Mushkudiani, Wiel Bruls, Sjaak Brinkkemper, Rik Bos

    Abstract: Various claims have been made regarding the benefits that Enterprise Architecture (EA) delivers for both individual systems development projects and the organization as a whole. This paper presents the statistical findings of a survey study (n=293) carried out to empirically test these claims. First, we investigated which techniques are used in practice to stimulate conformance to EA. Secondly, we… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Comments: 19 pages (excluding cover pages), 2 figures, 11 tables. Proceedings of the Thirty First International Conference on Information Systems (ICIS 2010), St. Louis, Missouri, USA. arXiv admin note: text overlap with arXiv:2008.08112

    ACM Class: K.4.3; K.5.2

  5. A Theory Building Study of Enterprise Architecture Practices and Benefits

    Authors: Ralph Foorthuis, Marlies van Steenbergen, Sjaak Brinkkemper, Wiel Bruls

    Abstract: Academics and practitioners have made various claims regarding the benefits that Enterprise Architecture (EA) delivers for both individual projects and the organization as a whole. At the same time, there is a lack of explanatory theory regarding how EA delivers these benefits. Moreover, EA practices and benefits have not been extensively investigated by empirical research, with especially quantit… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: 28 pages, 4 figures, 12 tables

    ACM Class: K.4.3; K.5.2

    Journal ref: Information Systems Frontiers, Vol. 18, No. 3, 2016, pp. 541-564

  6. arXiv:2008.06869  [pdf

    cs.DB cs.AI cs.LG stat.OT

    SECODA: Segmentation- and Combination-Based Detection of Anomalies

    Authors: Ralph Foorthuis

    Abstract: This study introduces SECODA, a novel general-purpose unsupervised non-parametric anomaly detection algorithm for datasets containing continuous and categorical attributes. The method is guaranteed to identify cases with unique or sparse combinations of attribute values. Continuous attributes are discretized repeatedly in order to correctly determine the frequency of such value combinations. The c… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

    Comments: 12 pages (including DSAA conference poster), 9 figures, 3 tables. Presented at DSAA 2017, the IEEE International Conference on Data Science and Advanced Analytics

    MSC Class: 62G07 ACM Class: G.3; I.2.6; I.5

  7. arXiv:2008.03775  [pdf

    cs.CY

    Tactics for Internal Compliance: A Literature Review

    Authors: Ralph Foorthuis

    Abstract: Compliance of organizations with internal and external norms is a highly relevant topic for both practitioners and academics nowadays. However, the substantive, elementary compliance tactics that organizations can use for achieving internal compliance have been described in a fragmented manner and in the literatures of distinct academic disciplines. Using a multidisciplinary structured literature… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

    Comments: 47 pages (excl. references), 4 figures, 4 tables. Chapter of 'Project Compliance with Enterprise Architecture' (ISBN 978-90-393-5834-4)

    ACM Class: K.4; K.5

  8. arXiv:2007.15634  [pdf

    cs.DB cs.AI cs.LG stat.OT

    On the Nature and Types of Anomalies: A Review of Deviations in Data

    Authors: Ralph Foorthuis

    Abstract: Anomalies are occurrences in a dataset that are in some way unusual and do not fit the general patterns. The concept of the anomaly is typically ill-defined and perceived as vague and domain-dependent. Moreover, despite some 250 years of publications on the topic, no comprehensive and concrete overviews of the different types of anomalies have hitherto been published. By means of an extensive lite… ▽ More

    Submitted 29 May, 2023; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: 39 pages (30 pages content), 10 figures and 3 tables. Preprint; comments will be appreciated. Improvements in version 4: Small textual updates, added publication details on JDSA journal. International Journal of Data Science and Analytics, Springer (2021)

    MSC Class: 62A01 ACM Class: G.3; I.2.6; I.5