Skip to main content

Showing 1–6 of 6 results for author: Brad, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.01452  [pdf, other

    cs.LG cs.AI

    Time Series Anomaly Detection using Diffusion-based Models

    Authors: Ioana Pintilie, Andrei Manolache, Florin Brad

    Abstract: Diffusion models have been recently used for anomaly detection (AD) in images. In this paper we investigate whether they can also be leveraged for AD on multivariate time series (MTS). We test two diffusion-based models and compare them to several strong neural baselines. We also extend the PA%K protocol, by computing a ROCK-AUC metric, which is agnostic to both the detection threshold and the rat… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted at the AI4TS workshop of the 23rd IEEE International Conference on Data Mining (ICDM 2023), 9 pages, 7 figures, 2 tables

  2. arXiv:2207.03477  [pdf, other

    cs.CL

    VeriDark: A Large-Scale Benchmark for Authorship Verification on the Dark Web

    Authors: Andrei Manolache, Florin Brad, Antonio Barbalau, Radu Tudor Ionescu, Marius Popescu

    Abstract: The DarkWeb represents a hotbed for illicit activity, where users communicate on different market forums in order to exchange goods and services. Law enforcement agencies benefit from forensic tools that perform authorship analysis, in order to identify and profile users based on their textual content. However, authorship analysis has been traditionally studied using corpora featuring literary tex… ▽ More

    Submitted 1 November, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: Accepted at the 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks. 21 pages, 4 figures, 11 tables

  3. arXiv:2206.15476  [pdf, other

    cs.LG

    AnoShift: A Distribution Shift Benchmark for Unsupervised Anomaly Detection

    Authors: Marius Dragoi, Elena Burceanu, Emanuela Haller, Andrei Manolache, Florin Brad

    Abstract: Analyzing the distribution shift of data is a growing research direction in nowadays Machine Learning (ML), leading to emerging new benchmarks that focus on providing a suitable scenario for studying the generalization properties of ML models. The existing benchmarks are focused on supervised learning, and to the best of our knowledge, there is none for unsupervised learning. Therefore, we introdu… ▽ More

    Submitted 3 April, 2023; v1 submitted 30 June, 2022; originally announced June 2022.

  4. arXiv:2112.05125  [pdf, other

    cs.CL

    Rethinking the Authorship Verification Experimental Setups

    Authors: Florin Brad, Andrei Manolache, Elena Burceanu, Antonio Barbalau, Radu Ionescu, Marius Popescu

    Abstract: One of the main drivers of the recent advances in authorship verification is the PAN large-scale authorship dataset. Despite generating significant progress in the field, inconsistent performance differences between the closed and open test sets have been reported. To this end, we improve the experimental setup by proposing five new public splits over the PAN dataset, specifically designed to isol… ▽ More

    Submitted 1 November, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: Accepted as a short paper at the EMNLP 2022 conference. 10 pages, 5 figures, 9 tables

  5. arXiv:2104.05591  [pdf, other

    cs.CL

    DATE: Detecting Anomalies in Text via Self-Supervision of Transformers

    Authors: Andrei Manolache, Florin Brad, Elena Burceanu

    Abstract: Leveraging deep learning models for Anomaly Detection (AD) has seen widespread use in recent years due to superior performances over traditional methods. Recent deep methods for anomalies in images learn better features of normality in an end-to-end self-supervised setting. These methods train a model to discriminate between different transformations applied to visual data and then use the output… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: conference paper at NAACL-HLT 2021, 11 pages, 6 figures, 3 tables

  6. arXiv:1707.03172  [pdf, other

    cs.CL

    Dataset for a Neural Natural Language Interface for Databases (NNLIDB)

    Authors: Florin Brad, Radu Iacob, Ionel Hosu, Traian Rebedea

    Abstract: Progress in natural language interfaces to databases (NLIDB) has been slow mainly due to linguistic issues (such as language ambiguity) and domain portability. Moreover, the lack of a large corpus to be used as a standard benchmark has made data-driven approaches difficult to develop and compare. In this paper, we revisit the problem of NLIDBs and recast it as a sequence translation problem. To th… ▽ More

    Submitted 11 July, 2017; originally announced July 2017.

    Comments: 13 pages, 2 figures

    Journal ref: Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 2017