Skip to main content

Showing 1–9 of 9 results for author: Akan, A K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.15878  [pdf, other

    cs.CV cs.LG

    Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation

    Authors: Adil Kaan Akan, Yucel Yemez

    Abstract: We present SlotAdapt, an object-centric learning method that combines slot attention with pretrained diffusion models by introducing adapters for slot-based conditioning. Our method preserves the generative power of pretrained diffusion models, while avoiding their text-centric conditioning bias. We also incorporate an additional guidance loss into our architecture to align cross-attention from ad… ▽ More

    Submitted 1 March, 2025; v1 submitted 27 January, 2025; originally announced January 2025.

    Comments: Accepted to ICLR2025. Project page: https://kaanakan.github.io/SlotAdapt/

  2. arXiv:2307.14187  [pdf, other

    cs.CV cs.RO

    ADAPT: Efficient Multi-Agent Trajectory Prediction with Adaptation

    Authors: Görkay Aydemir, Adil Kaan Akan, Fatma Güney

    Abstract: Forecasting future trajectories of agents in complex traffic scenes requires reliable and efficient predictions for all agents in the scene. However, existing methods for trajectory prediction are either inefficient or sacrifice accuracy. To address this challenge, we propose ADAPT, a novel approach for jointly predicting the trajectories of all agents in the scene with dynamic weight learning. Ou… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: ICCV 2023

  3. arXiv:2209.10693  [pdf, other

    cs.CV cs.LG

    Stochastic Future Prediction in Real World Driving Scenarios

    Authors: Adil Kaan Akan

    Abstract: Uncertainty plays a key role in future prediction. The future is uncertain. That means there might be many possible futures. A future prediction method should cover the whole possibilities to be robust. In autonomous driving, covering multiple modes in the prediction part is crucially important to make safety-critical decisions. Although computer vision systems have advanced tremendously in recent… ▽ More

    Submitted 27 September, 2022; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: MS thesis, overlap with arXiv:2203.13641, arXiv:2203.10528, arXiv:2108.02760

  4. arXiv:2207.00255  [pdf, other

    cs.CV cs.RO

    Trajectory Forecasting on Temporal Graphs

    Authors: Görkay Aydemir, Adil Kaan Akan, Fatma Güney

    Abstract: Predicting future locations of agents in the scene is an important problem in self-driving. In recent years, there has been a significant progress in representing the scene and the agents in it. The interactions of agents with the scene and with each other are typically modeled with a Graph Neural Network. However, the graph structure is mostly static and fails to represent the temporal changes in… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

  5. arXiv:2203.13641  [pdf, other

    cs.CV cs.LG

    StretchBEV: Stretching Future Instance Prediction Spatially and Temporally

    Authors: Adil Kaan Akan, Fatma Güney

    Abstract: In self-driving, predicting future in terms of location and motion of all the agents around the vehicle is a crucial requirement for planning. Recently, a new joint formulation of perception and prediction has emerged by fusing rich sensory information perceived from multiple cameras into a compact bird's-eye view representation to perform prediction. However, the quality of future predictions deg… ▽ More

    Submitted 10 August, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: ECCV 2022

  6. arXiv:2203.10528  [pdf, other

    cs.CV cs.LG

    Stochastic Video Prediction with Structure and Motion

    Authors: Adil Kaan Akan, Sadra Safadoust, Fatma Güney

    Abstract: While stochastic video prediction models enable future prediction under uncertainty, they mostly fail to model the complex dynamics of real-world scenes. For example, they cannot provide reliable predictions for scenes with a moving camera and independently moving foreground objects in driving scenarios. The existing methods fail to fully capture the dynamics of the structured world by only focusi… ▽ More

    Submitted 29 April, 2022; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: Under review at TPAMI

  7. arXiv:2108.02760  [pdf, other

    cs.CV

    SLAMP: Stochastic Latent Appearance and Motion Prediction

    Authors: Adil Kaan Akan, Erkut Erdem, Aykut Erdem, Fatma Güney

    Abstract: Motion is an important cue for video prediction and often utilized by separating video content into static and dynamic components. Most of the previous work utilizing motion is deterministic but there are stochastic methods that can model the inherent uncertainty of the future. Existing stochastic models either do not reason about motion explicitly or make limiting assumptions about the static par… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: ICCV 2021

  8. arXiv:2102.08079  [pdf, other

    cs.CV eess.IV

    Just Noticeable Difference for Machine Perception and Generation of Regularized Adversarial Images with Minimal Perturbation

    Authors: Adil Kaan Akan, Emre Akbas, Fatos T. Yarman Vural

    Abstract: In this study, we introduce a measure for machine perception, inspired by the concept of Just Noticeable Difference (JND) of human perception. Based on this measure, we suggest an adversarial image generation algorithm, which iteratively distorts an image by an additive noise until the model detects the change in the image by outputting a false label. The noise added to the original image is defin… ▽ More

    Submitted 29 November, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: Accepted to Signal, Image and Video Processing

  9. arXiv:2001.11064  [pdf, other

    eess.IV cs.CV

    Just Noticeable Difference for Machines to Generate Adversarial Images

    Authors: Adil Kaan Akan, Mehmet Ali Genc, Fatos T. Yarman Vural

    Abstract: One way of designing a robust machine learning algorithm is to generate authentic adversarial images which can trick the algorithms as much as possible. In this study, we propose a new method to generate adversarial images which are very similar to true images, yet, these images are discriminated from the original ones and are assigned into another category by the model. The proposed method is bas… ▽ More

    Submitted 29 January, 2020; originally announced January 2020.

    Comments: 5 pages, 4 figures, submitted to ICIP2020