Skip to main content

Showing 1–12 of 12 results for author: Ordozgoiti, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.05502  [pdf, ps, other

    cs.DS cs.AI cs.CC cs.LG

    Diversity-aware clustering: Computational Complexity and Approximation Algorithms

    Authors: Suhas Thejaswi, Ameet Gadekar, Bruno Ordozgoiti, Aristides Gionis

    Abstract: In this work, we study diversity-aware clustering problems where the data points are associated with multiple attributes resulting in intersecting groups. A clustering solution needs to ensure that the number of chosen cluster centers from each group should be within the range defined by a lower and upper bound threshold for each group, while simultaneously minimizing the clustering objective, whi… ▽ More

    Submitted 20 May, 2025; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: Algorithmic Fairness, Fair Clustering, Diversity-aware Clustering, Intersectionaly, Subgroup fairness

  2. arXiv:2306.04489  [pdf, other

    cs.LG

    Fair Column Subset Selection

    Authors: Antonis Matakos, Bruno Ordozgoiti, Suhas Thejaswi

    Abstract: The problem of column subset selection asks for a subset of columns from an input matrix such that the matrix can be reconstructed as accurately as possible within the span of the selected columns. A natural extension is to consider a setting where the matrix rows are partitioned into two groups, and the goal is to choose a subset of columns that minimizes the maximum reconstruction error of both… ▽ More

    Submitted 12 August, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: KDD 2024

  3. arXiv:2206.08054  [pdf, other

    cs.LG cs.DS

    Generalized Leverage Scores: Geometric Interpretation and Applications

    Authors: Bruno Ordozgoiti, Antonis Matakos, Aristides Gionis

    Abstract: In problems involving matrix computations, the concept of leverage has found a large number of applications. In particular, leverage scores, which relate the columns of a matrix to the subspaces spanned by its leading singular vectors, are helpful in revealing column subsets to approximately factorize a matrix with quality guarantees. As such, they provide a solid foundation for a variety of machi… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: ICML 2022

  4. arXiv:2112.07030  [pdf, other

    cs.DS math.CO

    Clustering with fair-center representation: parameterized approximation algorithms and heuristics

    Authors: Suhas Thejaswi, Ameet Gadekar, Bruno Ordozgoiti, Michal Osadnik

    Abstract: We study a variant of classical clustering formulations in the context of algorithmic fairness, known as diversity-aware clustering. In this variant we are given a collection of facility subsets, and a solution must contain at least a specified number of facilities from each subset while simultaneously minimizing the clustering objective ($k$-median or $k$-means). We investigate the fixed-paramete… ▽ More

    Submitted 24 October, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    ACM Class: G.2.1; F.2.0; F.1.3

  5. arXiv:2106.11696  [pdf, other

    cs.DS

    Diversity-aware $k$-median : Clustering with fair center representation

    Authors: Suhas Thejaswi, Bruno Ordozgoiti, Aristides Gionis

    Abstract: We introduce a novel problem for diversity-aware clustering. We assume that the potential cluster centers belong to a set of groups defined by protected attributes, such as ethnicity, gender, etc. We then ask to find a minimum-cost clustering of the data into $k$ clusters so that a specified minimum number of cluster centers are chosen from each group. We thus require that all groups are represent… ▽ More

    Submitted 24 October, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: To appear in ECML-PKDD 2021

  6. arXiv:2006.13567  [pdf, other

    cs.LG stat.ML

    Off-the-grid: Fast and Effective Hyperparameter Search for Kernel Clustering

    Authors: Bruno Ordozgoiti, Lluís A. Belanche Muñoz

    Abstract: Kernel functions are a powerful tool to enhance the $k$-means clustering algorithm via the kernel trick. It is known that the parameters of the chosen kernel function can have a dramatic impact on the result. In supervised settings, these can be tuned via cross-validation, but for clustering this is not straightforward and heuristics are usually employed. In this paper we study the impact of kerne… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: ECML-PKDD 2020

  7. arXiv:2002.00775  [pdf, other

    cs.SI

    Finding large balanced subgraphs in signed networks

    Authors: Bruno Ordozgoiti, Antonis Matakos, Aristides Gionis

    Abstract: Signed networks are graphs whose edges are labelled with either a positive or a negative sign, and can be used to capture nuances in interactions that are missed by their unsigned counterparts. The concept of balance in signed graph theory determines whether a network can be partitioned into two perfectly opposing subsets, and is therefore useful for modelling phenomena such as the existence of po… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

    Comments: 11 pages, 6 figures, The Web Conference 2020

  8. arXiv:2001.09410  [pdf, other

    cs.SI cs.CY cs.IR

    Searching for polarization in signed graphs: a local spectral approach

    Authors: Han Xiao, Bruno Ordozgoiti, Aristides Gionis

    Abstract: Signed graphs have been used to model interactions in social net-works, which can be either positive (friendly) or negative (antagonistic). The model has been used to study polarization and other related phenomena in social networks, which can be harmful to the process of democratic deliberation in our society. An interesting and challenging task in this application domain is to detect polarized c… ▽ More

    Submitted 26 January, 2020; originally announced January 2020.

    Comments: 11 pages, 6 figures, accepted by WWW 2020, April 20-24, 2020, Taipei, Taiwan

  9. arXiv:1910.02438  [pdf, other

    cs.DS cs.SI

    Discovering Polarized Communities in Signed Networks

    Authors: Francesco Bonchi, Edoardo Galimberti, Aristides Gionis, Bruno Ordozgoiti, Giancarlo Ruffo

    Abstract: Signed networks contain edge annotations to indicate whether each interaction is friendly (positive edge) or antagonistic (negative edge). The model is simple but powerful and it can capture novel and interesting structural properties of real-world phenomena. The analysis of signed networks has many applications from modeling discussions in social media, to mining user reviews, and to recommending… ▽ More

    Submitted 6 October, 2019; originally announced October 2019.

    Journal ref: CIKM 2019, November 3-7, 2019, Beijing, China

  10. Reconciliation k-median: Clustering with Non-Polarized Representatives

    Authors: Bruno Ordozgoiti, Aristides Gionis

    Abstract: We propose a new variant of the k-median problem, where the objective function models not only the cost of assigning data points to cluster representatives, but also a penalty term for disagreement among the representatives. We motivate this novel problem by applications where we are interested in clustering data while avoiding selecting representatives that are too far from each other. For exampl… ▽ More

    Submitted 28 July, 2021; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: The Web Conference 2019

  11. arXiv:1804.04421  [pdf, other

    cs.LG cs.AI stat.ML

    Regularized Greedy Column Subset Selection

    Authors: Bruno Ordozgoiti, Alberto Mozo, Jesús García López de Lacalle

    Abstract: The Column Subset Selection Problem provides a natural framework for unsupervised feature selection. Despite being a hard combinatorial optimization problem, there exist efficient algorithms that provide good approximations. The drawback of the problem formulation is that it incorporates no form of regularization, and is therefore very sensitive to noise when presented with scarce data. In this pa… ▽ More

    Submitted 12 April, 2018; originally announced April 2018.

  12. arXiv:1610.07419  [pdf, other

    cs.NI cs.LG

    Using Machine Learning to Detect Noisy Neighbors in 5G Networks

    Authors: Udi Margolin, Alberto Mozo, Bruno Ordozgoiti, Danny Raz, Elisha Rosensweig, Itai Segall

    Abstract: 5G networks are expected to be more dynamic and chaotic in their structure than current networks. With the advent of Network Function Virtualization (NFV), Network Functions (NF) will no longer be tightly coupled with the hardware they are running on, which poses new challenges in network management. Noisy neighbor is a term commonly used to describe situations in NFV infrastructure where an appli… ▽ More

    Submitted 24 October, 2016; originally announced October 2016.