Skip to main content

Showing 1–9 of 9 results for author: Gürbüz, Y Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.04341  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Efficient Perplexity Bound and Ratio Matching in Discrete Diffusion Language Models

    Authors: Etrit Haxholli, Yeti Z. Gürbüz, Oğul Can, Eli Waxman

    Abstract: While continuous diffusion models excel in modeling continuous distributions, their application to categorical data has been less effective. Recent work has shown that ratio-matching through score-entropy within a continuous-time discrete Markov chain (CTMC) framework serves as a competitive alternative to autoregressive models in language modeling. To enhance this framework, we first introduce th… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  2. arXiv:2411.00759  [pdf, ps, other

    cs.LG stat.ML

    Minibatch Optimal Transport and Perplexity Bound Estimation in Discrete Flow Matching

    Authors: Etrit Haxholli, Yeti Z. Gürbüz, Oğul Can, Eli Waxman

    Abstract: Outperforming autoregressive models on categorical data distributions, such as textual data, remains challenging for continuous diffusion and flow models. Discrete flow matching, a recent framework for modeling categorical data, has shown competitive performance with autoregressive models. Despite its similarities with continuous flow matching, the rectification strategy applied in the continuous… ▽ More

    Submitted 13 November, 2024; v1 submitted 1 November, 2024; originally announced November 2024.

  3. arXiv:2309.02843  [pdf, other

    cs.CV cs.LG stat.ML

    Knowledge Distillation Layer that Lets the Student Decide

    Authors: Ada Gorgun, Yeti Z. Gurbuz, A. Aydin Alatan

    Abstract: Typical technique in knowledge distillation (KD) is regularizing the learning of a limited capacity model (student) by pushing its responses to match a powerful model's (teacher). Albeit useful especially in the penultimate layer and beyond, its action on student's feature transform is rather implicit, limiting its practice in the intermediate layers. To explicitly embed the teacher's knowledge in… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: Accepted at the British Machine Vision Conference 2023 (BMVC 2023)

  4. arXiv:2308.09228  [pdf, other

    cs.CV cs.LG stat.ML

    Generalized Sum Pooling for Metric Learning

    Authors: Yeti Z. Gurbuz, Ozan Sener, A. Aydın Alatan

    Abstract: A common architectural choice for deep metric learning is a convolutional neural network followed by global average pooling (GAP). Albeit simple, GAP is a highly effective way to aggregate information. One possible explanation for the effectiveness of GAP is considering each feature vector as representing a different semantic entity and GAP as a convex combination of them. Following this perspecti… ▽ More

    Submitted 21 August, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: Accepted as a conference paper at International Conference on Computer Vision (ICCV) 2023

  5. arXiv:2307.07620  [pdf, other

    cs.LG cs.CV

    Generalizable Embeddings with Cross-batch Metric Learning

    Authors: Yeti Z. Gurbuz, A. Aydin Alatan

    Abstract: Global average pooling (GAP) is a popular component in deep metric learning (DML) for aggregating features. Its effectiveness is often attributed to treating each feature vector as a distinct semantic entity and GAP as a combination of them. Albeit substantiated, such an explanation's algorithmic implications to learn generalizable entities to represent unseen classes, a crucial DML goal, remain u… ▽ More

    Submitted 24 July, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: \c{opyright} 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  6. arXiv:2210.00992  [pdf, other

    cs.CV

    Feature Embedding by Template Matching as a ResNet Block

    Authors: Ada Gorgun, Yeti Z. Gurbuz, A. Aydin Alatan

    Abstract: Convolution blocks serve as local feature extractors and are the key to success of the neural networks. To make local semantic feature embedding rather explicit, we reformulate convolution blocks as feature selection according to the best matching kernel. In this manner, we show that typical ResNet blocks indeed perform local feature embedding via template matching once batch normalization (BN) fo… ▽ More

    Submitted 15 August, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: Accepted at the British Machine Vision Conference 2022 (BMVC 2022)

  7. arXiv:2209.09060  [pdf, other

    cs.CV cs.LG stat.ML

    Deep Metric Learning with Chance Constraints

    Authors: Yeti Z. Gurbuz, Ogul Can, A. Aydin Alatan

    Abstract: Deep metric learning (DML) aims to minimize empirical expected loss of the pairwise intra-/inter- class proximity violations in the embedding space. We relate DML to feasibility problem of finite chance constraints. We show that minimizer of proxy-based DML satisfies certain chance constraints, and that the worst case generalization performance of the proxy-based methods can be characterized by th… ▽ More

    Submitted 6 September, 2023; v1 submitted 19 September, 2022; originally announced September 2022.

    Comments: Accepted as a conference paper at IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

  8. Blind Deinterleaving of Signals in Time Series with Self-attention Based Soft Min-cost Flow Learning

    Authors: Oğul Can, Yeti Z. Gürbüz, Berkin Yıldırım, A. Aydın Alatan

    Abstract: We propose an end-to-end learning approach to address deinterleaving of patterns in time series, in particular, radar signals. We link signal clustering problem to min-cost flow as an equivalent problem once the proper costs exist. We formulate a bi-level optimization problem involving min-cost flow as a sub-problem to learn such costs from the supervised training data. We then approximate the low… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

    Comments: 4 pages, 2 figures, 1 table

  9. Deep Metric Learning with Alternating Projections onto Feasible Sets

    Authors: Oğul Can, Yeti Ziya Gürbüz, A. Aydın Alatan

    Abstract: During the training of networks for distance metric learning, minimizers of the typical loss functions can be considered as "feasible points" satisfying a set of constraints imposed by the training data. To this end, we reformulate distance metric learning problem as finding a feasible point of a constraint set where the embedding vectors of the training data satisfy desired intra-class and inter-… ▽ More

    Submitted 15 December, 2021; v1 submitted 17 July, 2019; originally announced July 2019.

    Comments: 10 pages, 3 figures, 2 tables