Skip to main content

Showing 1–4 of 4 results for author: Marciniak, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.19224  [pdf, ps, other

    cs.DB

    Representing and querying data tensors in RDF and SPARQL

    Authors: Piotr Marciniak, Piotr Sowinski, Maria Ganzha

    Abstract: Embedding tensors in databases has recently gained in significance, due to the rapid proliferation of machine learning methods (including LLMs) which produce embeddings in the form of tensors. To support emerging use cases hybridizing machine learning with knowledge graphs, a robust and efficient tensor representation scheme is needed. We introduce a novel approach for representing data tensors as… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

    Comments: Accepted at ESWC 2025 Posters and Demos. For publication in: The Semantic Web: ESWC 2025 Satellite Events, Portoroz, Slovenia, June 1 - 5, 2025, Proceedings

  2. arXiv:2503.01388  [pdf, other

    cs.DS

    Faster ED-String Matching with $k$ Mismatches

    Authors: Paweł Gawrychowski, Adam Górkiewicz, Pola Marciniak, Solon P. Pissis, Karol Pokorski

    Abstract: We revisit the complexity of approximate pattern matching in an elastic-degenerate string. Such a string is a sequence of $n$ finite sets of strings of total length $N$, and compactly describes a collection of strings obtained by first choosing exactly one string in every set, and then concatenating them together. This is motivated by the need of storing a collection of highly similar DNA sequence… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  3. arXiv:2406.16489  [pdf, other

    cs.CL

    Deepfake tweets automatic detection

    Authors: Adam Frej, Adrian Kaminski, Piotr Marciniak, Szymon Szmajdzinski, Soveatin Kuntur, Anna Wroblewska

    Abstract: This study addresses the critical challenge of detecting DeepFake tweets by leveraging advanced natural language processing (NLP) techniques to distinguish between genuine and AI-generated texts. Given the increasing prevalence of misinformation, our research utilizes the TweepFake dataset to train and evaluate various machine learning models. The objective is to identify effective strategies for… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  4. arXiv:0802.1015  [pdf, ps, other

    cs.NI

    Small Is Not Always Beautiful

    Authors: Pawel Marciniak, Nikitas Liogkas, Arnaud Legout, Eddie Kohler

    Abstract: Peer-to-peer content distribution systems have been enjoying great popularity, and are now gaining momentum as a means of disseminating video streams over the Internet. In many of these protocols, including the popular BitTorrent, content is split into mostly fixed-size pieces, allowing a client to download data from many peers simultaneously. This makes piece size potentially critical for perfo… ▽ More

    Submitted 7 February, 2008; originally announced February 2008.

    Journal ref: Dans IPTPS'2008 (2008)