Skip to main content

Showing 1–12 of 12 results for author: Kutzkov, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.12884  [pdf, other

    cs.LG cs.AI

    Learning Graph Node Embeddings by Smooth Pair Sampling

    Authors: Konstantin Kutzkov

    Abstract: Random walk-based node embedding algorithms have attracted a lot of attention due to their scalability and ease of implementation. Previous research has focused on different walk strategies, optimization objectives, and embedding learning models. Inspired by observations on real data, we take a different approach and propose a new regularization technique. More precisely, the frequencies of node p… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

    Comments: Accepted for oral presentation at AISTATS 2025

  2. arXiv:2211.15114  [pdf, other

    cs.LG cs.DS

    LoNe Sampler: Graph node embeddings by coordinated local neighborhood sampling

    Authors: Konstantin Kutzkov

    Abstract: Local graph neighborhood sampling is a fundamental computational problem that is at the heart of algorithms for node representation learning. Several works have presented algorithms for learning discrete node embeddings where graph nodes are represented by discrete features such as attributes of neighborhood nodes. Discrete embeddings offer several advantages compared to continuous word2vec-like n… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted to AAAI 2023. arXiv admin note: substantial text overlap with arXiv:2102.04770

  3. arXiv:2102.04770  [pdf, other

    cs.LG cs.AI cs.DS

    COLOGNE: Coordinated Local Graph Neighborhood Sampling

    Authors: Konstantin Kutzkov

    Abstract: Representation learning for graphs enables the application of standard machine learning algorithms and data analysis tools to graph data. Replacing discrete unordered objects such as graph nodes by real-valued vectors is at the heart of many approaches to learning from graph data. Such vector representations, or embeddings, capture the discrete relationships in the original data by representing no… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

  4. Query-Efficient Correlation Clustering

    Authors: David García-Soriano, Konstantin Kutzkov, Francesco Bonchi, Charalampos Tsourakakis

    Abstract: Correlation clustering is arguably the most natural formulation of clustering. Given n objects and a pairwise similarity measure, the goal is to cluster the objects so that, to the best possible extent, similar objects are put in the same cluster and dissimilar objects are put in different clusters. A main drawback of correlation clustering is that it requires as input the $Θ(n^2)$ pairwise simi… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

    Comments: To appear in WWW 2020

  5. arXiv:1805.10014  [pdf, other

    cs.LG stat.ML

    KONG: Kernels for ordered-neighborhood graphs

    Authors: Moez Draief, Konstantin Kutzkov, Kevin Scaman, Milan Vojnovic

    Abstract: We present novel graph kernels for graphs with node and edge labels that have ordered neighborhoods, i.e. when neighbor nodes follow an order. Graphs with ordered neighborhoods are a natural data representation for evolving graphs where edges are created over time, which induces an order. Combining convolutional subgraph kernels and string kernels, we design new scalable algorithms for generation… ▽ More

    Submitted 29 May, 2018; v1 submitted 25 May, 2018; originally announced May 2018.

  6. arXiv:1605.05273  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Convolutional Neural Networks for Graphs

    Authors: Mathias Niepert, Mohamed Ahmed, Konstantin Kutzkov

    Abstract: Numerous important problems can be framed as learning from graph data. We propose a framework for learning convolutional neural networks for arbitrary graphs. These graphs may be undirected, directed, and with both discrete and continuous node and edge attributes. Analogous to image-based convolutional networks that operate on locally connected regions of the input, we present a general approach t… ▽ More

    Submitted 8 June, 2016; v1 submitted 17 May, 2016; originally announced May 2016.

    Comments: To be presented at ICML 2016

  7. arXiv:1404.4696  [pdf, ps, other

    cs.DS

    Triangle counting in dynamic graph streams

    Authors: Laurent Bulteau, Vincent Froese, Konstantin Kutzkov, Rasmus Pagh

    Abstract: Estimating the number of triangles in graph streams using a limited amount of memory has become a popular topic in the last decade. Different variations of the problem have been studied, depending on whether the graph edges are provided in an arbitrary order or as incidence lists. However, with a few exceptions, the algorithms have considered {\em insert-only} streams. We present a new algorithm e… ▽ More

    Submitted 14 July, 2015; v1 submitted 18 April, 2014; originally announced April 2014.

    Comments: New version of a SWAT 2014 paper with improved results

  8. arXiv:1404.4693  [pdf, ps, other

    cs.DS

    Consistent Subset Sampling

    Authors: Konstantin Kutzkov, Rasmus Pagh

    Abstract: Consistent sampling is a technique for specifying, in small space, a subset $S$ of a potentially large universe $U$ such that the elements in $S$ satisfy a suitably chosen sampling condition. Given a subset $\mathcal{I}\subseteq U$ it should be possible to quickly compute $\mathcal{I}\cap S$, i.e., the elements in $\mathcal{I}$ satisfying the sampling condition. Consistent sampling has important a… ▽ More

    Submitted 18 April, 2014; originally announced April 2014.

    Comments: To appear in SWAT 2014

  9. arXiv:1312.5105  [pdf, ps, other

    cs.DS

    Local correlation clustering

    Authors: Francesco Bonchi, David García-Soriano, Konstantin Kutzkov

    Abstract: Correlation clustering is perhaps the most natural formulation of clustering. Given $n$ objects and a pairwise similarity measure, the goal is to cluster the objects so that, to the best possible extent, similar objects are put in the same cluster and dissimilar objects are put in different clusters. Despite its theoretical appeal, the practical relevance of correlation clustering still remains la… ▽ More

    Submitted 18 December, 2013; originally announced December 2013.

  10. arXiv:1210.0461  [pdf, other

    cs.DS

    On Parallelizing Matrix Multiplication by the Column-Row Method

    Authors: Andrea Campagna, Konstantin Kutzkov, Rasmus Pagh

    Abstract: We consider the problem of sparse matrix multiplication by the column row method in a distributed setting where the matrix product is not necessarily sparse. We present a surprisingly simple method for "consistent" parallel processing of sparse outer products (column-row vector products) over several processors, in a communication-avoiding setting where each processor has a copy of the input. The… ▽ More

    Submitted 19 November, 2012; v1 submitted 1 October, 2012; originally announced October 2012.

    Comments: To appear in ALENEX 2013

  11. arXiv:1209.4508  [pdf, other

    cs.DS math.NA

    Deterministic algorithms for skewed matrix products

    Authors: Konstantin Kutzkov

    Abstract: Recently, Pagh presented a randomized approximation algorithm for the multiplication of real-valued matrices building upon work for detecting the most frequent items in data streams. We continue this line of research and present new {\em deterministic} matrix multiplication algorithms. Motivated by applications in data mining, we first consider the case of real-valued, nonnegative $n$-by-$n$ inp… ▽ More

    Submitted 20 September, 2012; originally announced September 2012.

    ACM Class: F.2.0

  12. arXiv:1007.1166  [pdf, ps, other

    cs.DS

    Using CSP To Improve Deterministic 3-SAT

    Authors: Konstantin Kutzkov, Dominik Scheder

    Abstract: We show how one can use certain deterministic algorithms for higher-value constraint satisfaction problems (CSPs) to speed up deterministic local search for 3-SAT. This way, we improve the deterministic worst-case running time for 3-SAT to O(1.439^n).

    Submitted 26 July, 2010; v1 submitted 7 July, 2010; originally announced July 2010.

    Comments: corrected typos, extended the introduction and added a notation section to make paper self-contained