-
Douglas-Quaid -- Open Source Image Matching Library
Authors:
Vincent Falconieri
Abstract:
Security analysts need to classify, search and correlate numerous images. Automatic classification tools improve the efficiency of such tasks. However, no open-source and turnkey library was found able to reach this goal. The present paper introduces an Open-Source modular library for the specific cases of visual correlation and Image Matching named Douglas-Quaid. The design of the library, chosen…
▽ More
Security analysts need to classify, search and correlate numerous images. Automatic classification tools improve the efficiency of such tasks. However, no open-source and turnkey library was found able to reach this goal. The present paper introduces an Open-Source modular library for the specific cases of visual correlation and Image Matching named Douglas-Quaid. The design of the library, chosen tradeoffs, encountered challenges, envisioned solutions as well as quality and speed results are presented in this paper. We also explore researches directions and future potential developments of the library. Our claim is that even partial automation of screenshots classification would reduce the burden on security teams and that Douglas-Quaid is a step forward in this direction.
△ Less
Submitted 12 August, 2019;
originally announced August 2019.
-
Carl-Hauser -- Open Source Image Matching Algorithms Benchmarking Framework
Authors:
Vincent Falconieri
Abstract:
Security analysts need to classify, search and correlate numerous images. Automatic classification tools improve the efficiency of such tasks. Many Image-Matching algorithms are presented in the litterature. The present paper introduces and provides a Open-Source benchmarking and evaluation tool for these algorithms. Is this paper, the framework evaluates algorithms on illustrative datasets, which…
▽ More
Security analysts need to classify, search and correlate numerous images. Automatic classification tools improve the efficiency of such tasks. Many Image-Matching algorithms are presented in the litterature. The present paper introduces and provides a Open-Source benchmarking and evaluation tool for these algorithms. Is this paper, the framework evaluates algorithms on illustrative datasets, which are constituted of phishing and onion websites. Datasets are provided as Open-Data.
△ Less
Submitted 9 August, 2019;
originally announced August 2019.
-
VisJSClassificator -- Manual Visual Collaborative Classification Graph-based Tool
Authors:
Vincent Falconieri
Abstract:
Analysts need to classify, search and correlate numerous images. Automatic classification tools improve the efficiency of such tasks. However, classified data is a prerequisite to develop these tools. Labelling tools are of great use in case of already known classes, but seemed limited for Open Set Classification. This paper presents a manual and collaborative classification tool, which uses graph…
▽ More
Analysts need to classify, search and correlate numerous images. Automatic classification tools improve the efficiency of such tasks. However, classified data is a prerequisite to develop these tools. Labelling tools are of great use in case of already known classes, but seemed limited for Open Set Classification. This paper presents a manual and collaborative classification tool, which uses graph representation.
△ Less
Submitted 8 August, 2019;
originally announced August 2019.
-
Open Dataset of Phishing and Tor Hidden Services Screen-captures
Authors:
Vincent Falconieri
Abstract:
Security analysts need to classify, search and correlate numerous images. Automatic classification tools improve the efficiency of such tasks. However, the main resources to develop these tools are datasets, which are introduced and provided by the present paper, for the specific cases of visual correlation of phishing and onion websites. CIRCL's Open-Source tools are the sources of these screensh…
▽ More
Security analysts need to classify, search and correlate numerous images. Automatic classification tools improve the efficiency of such tasks. However, the main resources to develop these tools are datasets, which are introduced and provided by the present paper, for the specific cases of visual correlation of phishing and onion websites. CIRCL's Open-Source tools are the sources of these screenshots, which had been manually verified against personal information leaks. Usage examples of these datasets are proposed in the current paper. These researches directions are, however, not the main contribution of the paper. The main contribution is the availability of the two datasets.
△ Less
Submitted 7 August, 2019;
originally announced August 2019.