Skip to main content

Showing 1–23 of 23 results for author: Ure, N K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.13474  [pdf, other

    cs.RO

    Iterative Active-Inactive Obstacle Classification for Time-Optimal Collision Avoidance

    Authors: Mehmetcan Kaymaz, Nazim Kemal Ure

    Abstract: Time-optimal obstacle avoidance is a prevalent problem encountered in various fields, including robotics and autonomous vehicles, where the task involves determining a path for a moving vehicle to reach its goal while navigating around obstacles within its environment. This problem becomes increasingly challenging as the number of obstacles in the environment rises. We propose an iterative active-… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: This paper is under review in IROS24

  2. arXiv:2401.08663  [pdf, other

    cs.AI cs.LG cs.RO eess.SY

    An Integrated Imitation and Reinforcement Learning Methodology for Robust Agile Aircraft Control with Limited Pilot Demonstration Data

    Authors: Gulay Goktas Sever, Umut Demir, Abdullah Sadik Satir, Mustafa Cagatay Sahin, Nazim Kemal Ure

    Abstract: In this paper, we present a methodology for constructing data-driven maneuver generation models for agile aircraft that can generalize across a wide range of trim conditions and aircraft model parameters. Maneuver generation models play a crucial role in the testing and evaluation of aircraft prototypes, providing insights into the maneuverability and agility of the aircraft. However, constructing… ▽ More

    Submitted 27 December, 2023; originally announced January 2024.

    Comments: Preprint submitted to Aerospace Science and Technology

  3. arXiv:2310.08198  [pdf, other

    cs.LG cs.AI

    Beyond Traditional DoE: Deep Reinforcement Learning for Optimizing Experiments in Model Identification of Battery Dynamics

    Authors: Gokhan Budan, Francesca Damiani, Can Kurtulus, N. Kemal Ure

    Abstract: Model identification of battery dynamics is a central problem in energy research; many energy management systems and design processes rely on accurate battery models for efficiency optimization. The standard methodology for battery modelling is traditional design of experiments (DoE), where the battery dynamics are excited with many different current profiles and the measured outputs are used to e… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  4. Reinforcement Learning Based Self-play and State Stacking Techniques for Noisy Air Combat Environment

    Authors: Ahmet Semih Tasbas, Safa Onur Sahin, Nazim Kemal Ure

    Abstract: Reinforcement learning (RL) has recently proven itself as a powerful instrument for solving complex problems and even surpassed human performance in several challenging applications. This signifies that RL algorithms can be used in the autonomous air combat problem, which has been studied for many years. The complexity of air combat arises from aggressive close-range maneuvers and agile enemy beha… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 10 pages, 4 figures

  5. arXiv:2302.14604  [pdf, other

    cs.MA cs.AI cs.GT cs.LG

    IQ-Flow: Mechanism Design for Inducing Cooperative Behavior to Self-Interested Agents in Sequential Social Dilemmas

    Authors: Bengisu Guresti, Abdullah Vanlioglu, Nazim Kemal Ure

    Abstract: Achieving and maintaining cooperation between agents to accomplish a common objective is one of the central goals of Multi-Agent Reinforcement Learning (MARL). Nevertheless in many real-world scenarios, separately trained and specialized agents are deployed into a shared environment, or the environment requires multiple objectives to be achieved by different coexisting parties. These variations am… ▽ More

    Submitted 4 March, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: AAMAS 2023

  6. arXiv:2212.02909  [pdf, other

    cs.AI cs.MA cs.RO

    Scalable Planning and Learning Framework Development for Swarm-to-Swarm Engagement Problems

    Authors: Umut Demir, A. Sadik Satir, Gulay Goktas Sever, Cansu Yikilmaz, Nazim Kemal Ure

    Abstract: Development of guidance, navigation and control frameworks/algorithms for swarms attracted significant attention in recent years. That being said, algorithms for planning swarm allocations/trajectories for engaging with enemy swarms is largely an understudied problem. Although small-scale scenarios can be addressed with tools from differential game theory, existing approaches fail to scale for lar… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Accepted to SciTech2023

  7. Self-Improving Safety Performance of Reinforcement Learning Based Driving with Black-Box Verification Algorithms

    Authors: Resul Dagdanov, Halil Durmus, Nazim Kemal Ure

    Abstract: In this work, we propose a self-improving artificial intelligence system to enhance the safety performance of reinforcement learning (RL)-based autonomous driving (AD) agents using black-box verification methods. RL algorithms have become popular in AD applications in recent years. However, the performance of existing RL algorithms heavily depends on the diversity of training scenarios. A lack of… ▽ More

    Submitted 9 July, 2023; v1 submitted 29 October, 2022; originally announced October 2022.

    Comments: 7 pages, 7 figures, 2 tables, published in IEEE International Conference on Robotics and Automation (ICRA), June 2, 2023, London, UK

  8. arXiv:2210.16567  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous Driving

    Authors: Resul Dagdanov, Feyza Eksen, Halil Durmus, Ferhat Yurdakul, Nazim Kemal Ure

    Abstract: Safely navigating through an urban environment without violating any traffic rules is a crucial performance target for reliable autonomous driving. In this paper, we present a Reinforcement Learning (RL) based methodology to DEtect and FIX (DeFIX) failures of an Imitation Learning (IL) agent by extracting infraction spots and re-constructing mini-scenarios on these infraction areas to train an RL… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

    Comments: 6 pages, 4 figures, 2 tables, published in IEEE International Conference on Intelligent Transportation Systems (ITSC), October 12, 2022, Macau, China

    Journal ref: 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), 2022, pp. 4215-4220

  9. arXiv:2210.08319  [pdf, other

    cs.RO cs.AI cs.LG

    A Scalable Reinforcement Learning Approach for Attack Allocation in Swarm to Swarm Engagement Problems

    Authors: Umut Demir, Nazim Kemal Ure

    Abstract: In this work we propose a reinforcement learning (RL) framework that controls the density of a large-scale swarm for engaging with adversarial swarm attacks. Although there is a significant amount of existing work in applying artificial intelligence methods to swarm control, analysis of interactions between two adversarial swarms is a rather understudied area. Most of the existing work in this sub… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

    Comments: submitted to ICRA 2023

  10. Obstacle Identification and Ellipsoidal Decomposition for Fast Motion Planning in Unknown Dynamic Environments

    Authors: Mehmetcan Kaymaz, Nazim Kemal Ure

    Abstract: Collision avoidance in the presence of dynamic obstacles in unknown environments is one of the most critical challenges for unmanned systems. In this paper, we present a method that identifies obstacles in terms of ellipsoids to estimate linear and angular obstacle velocities. Our proposed method is based on the idea of any object can be approximately expressed by ellipsoids. To achieve this, we p… ▽ More

    Submitted 9 July, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: accepted to IEEE International Conference on Robotics and Automation (ICRA), 2023, London, UK

  11. arXiv:2206.14256  [pdf, other

    cs.LG cs.AI cs.CV

    GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning

    Authors: Doğay Kamar, Nazım Kemal Üre, Gözde Ünal

    Abstract: In this study, we address the problem of efficient exploration in reinforcement learning. Most common exploration approaches depend on random action selection, however these approaches do not work well in environments with sparse or no rewards. We propose Generative Adversarial Network-based Intrinsic Reward Module that learns the distribution of the observed states and sends an intrinsic reward t… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    MSC Class: 68T20 (Primary) 68T05; 68T07 (Secondary) ACM Class: I.2.8

    Journal ref: International Conference on Agents and Artificial Intelligence - ICAART, Volume 2, 264-272 (2022)

  12. arXiv:2205.15767  [pdf, other

    cs.SE

    Quality Characteristics of a Software Platform for Human-AI Teaming in Smart Manufacturing

    Authors: Philipp Haindl, Thomas Hoch, Javier Dominguez, Julen Aperribai, Nazim Kemal Ure, Mehmet Tunçel

    Abstract: As AI-enabled software systems become more prevalent in smart manufacturing, their role shifts from a reactive to a proactive one that provides context-specific support to machine operators. In the context of an international research project, we develop an AI-based software platform that shall facilitate the collaboration between human operators and manufacturing machines. We conducted 14 structu… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: Preprint: to appear in QUATIC'22 International Conference on the Quality of Information and Communications Technology

  13. arXiv:2111.14177  [pdf, other

    cs.MA cs.AI cs.LG

    Evaluating Generalization and Transfer Capacity of Multi-Agent Reinforcement Learning Across Variable Number of Agents

    Authors: Bengisu Guresti, Nazim Kemal Ure

    Abstract: Multi-agent Reinforcement Learning (MARL) problems often require cooperation among agents in order to solve a task. Centralization and decentralization are two approaches used for cooperation in MARL. While fully decentralized methods are prone to converge to suboptimal solutions due to partial observability and nonstationarity, the methods involving centralization suffer from scalability limitati… ▽ More

    Submitted 28 November, 2021; originally announced November 2021.

    Comments: accepted to COMARL AAAI 2021

  14. arXiv:2104.02491  [pdf, other

    eess.SY cs.AI math.OC

    Nonlinear Model Based Guidance with Deep Learning Based Target Trajectory Prediction Against Aerial Agile Attack Patterns

    Authors: A. Sadik Satir, Umut Demir, Gulay Goktas Sever, N. Kemal Ure

    Abstract: In this work, we propose a novel missile guidance algorithm that combines deep learning based trajectory prediction with nonlinear model predictive control. Although missile guidance and threat interception is a well-studied problem, existing algorithms' performance degrades significantly when the target is pulling high acceleration attack maneuvers while rapidly changing its direction. We argue t… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: Accepted for the 2021 American Control Conference (ACC)

    MSC Class: 93-10 ACM Class: I.2.6

  15. arXiv:2103.07903  [pdf, other

    cs.AI

    Investigating Value of Curriculum Reinforcement Learning in Autonomous Driving Under Diverse Road and Weather Conditions

    Authors: Anil Ozturk, Mustafa Burak Gunel, Resul Dagdanov, Mirac Ekim Vural, Ferhat Yurdakul, Melih Dal, Nazim Kemal Ure

    Abstract: Applications of reinforcement learning (RL) are popular in autonomous driving tasks. That being said, tuning the performance of an RL agent and guaranteeing the generalization performance across variety of different driving scenarios is still largely an open problem. In particular, getting good performance on complex road and weather conditions require exhaustive tuning and computation time. Curri… ▽ More

    Submitted 2 August, 2021; v1 submitted 14 March, 2021; originally announced March 2021.

    Comments: 6 pages, IV2021 Workshop

  16. PURSUhInT: In Search of Informative Hint Points Based on Layer Clustering for Knowledge Distillation

    Authors: Reyhan Kevser Keser, Aydin Ayanzadeh, Omid Abdollahi Aghdam, Caglar Kilcioglu, Behcet Ugur Toreyin, Nazim Kemal Ure

    Abstract: One of the most efficient methods for model compression is hint distillation, where the student model is injected with information (hints) from several different layers of the teacher model. Although the selection of hint points can drastically alter the compression performance, conventional distillation approaches overlook this fact and use the same hint points as in the early studies. Therefore,… ▽ More

    Submitted 3 November, 2022; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: Our codes are published on Code Ocean, where the link to our codes is: https://codeocean.com/capsule/4245746/tree/v1

    Journal ref: Expert Systems with Applications, Volume 213, Part B, March 2023, 119040

  17. arXiv:2012.06410  [pdf, ps, other

    cs.RO eess.SY

    Learning How to Trade-Off Safety with Agility Using Deep Covariance Estimation for Perception Driven UAV Motion Planning

    Authors: Onur Akgun, Kamil Canberk Atik, Mustafa Erdem, Mehmetcan Kaymaz, Bugrahan Yamak, N. Kemal Ure

    Abstract: We investigate how to utilize predictive models for selecting appropriate motion planning strategies based on perception uncertainty estimation for agile unmanned aerial vehicle (UAV) navigation tasks. Although there are variety of motion planning and perception algorithms for such tasks, the impact of perception uncertainty is not explicitly handled in many of the current motion algorithms, which… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: A paper on intelligent motion planning for agile drones. It is currently being reviewed for ICRA 2021

  18. arXiv:2012.02303  [pdf, other

    math.OC cs.MA math.DS math.PR

    Decentralized State-Dependent Markov Chain Synthesis with an Application to Swarm Guidance

    Authors: Samet Uzun, Nazim Kemal Ure, Behcet Acikmese

    Abstract: This paper introduces a decentralized state-dependent Markov chain synthesis (DSMC) algorithm for finite-state Markov chains. We present a state-dependent consensus protocol that achieves exponential convergence under mild technical conditions, without relying on any connectivity assumptions regarding the dynamic network topology. Utilizing the proposed consensus protocol, we develop the DSMC algo… ▽ More

    Submitted 26 April, 2024; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: arXiv admin note: text overlap with arXiv:2012.01928

  19. arXiv:2012.01928  [pdf, other

    math.OC cs.MA math.DS math.PR

    A Probabilistic Guidance Approach to Swarm-to-Swarm Engagement Problem

    Authors: Samet Uzun, Nazim Kemal Ure

    Abstract: This paper introduces a probabilistic guidance approach for the swarm-to-swarm engagement problem. The idea is based on driving the controlled swarm towards an adversary swarm, where the adversary swarm aims to converge to a stationary distribution that corresponds to a defended base location. The probabilistic approach is based on designing a Markov chain for the distribution of the swarm to conv… ▽ More

    Submitted 28 November, 2020; originally announced December 2020.

  20. arXiv:2009.11905  [pdf, other

    cs.AI cs.LG cs.RO

    A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward

    Authors: M. Ugur Yavas, N. Kemal Ure, Tufan Kumbasar

    Abstract: Automated lane change is one of the most challenging task to be solved of highly automated vehicles due to its safety-critical, uncertain and multi-agent nature. This paper presents the novel deployment of the state of art Q learning method, namely Rainbow DQN, that uses a new safety driven rewarding scheme to tackle the issues in an dynamic and uncertain simulation environment. We present various… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

  21. arXiv:2007.14671  [pdf, other

    cs.RO cs.CV cs.LG eess.SY

    Sample Efficient Interactive End-to-End Deep Learning for Self-Driving Cars with Selective Multi-Class Safe Dataset Aggregation

    Authors: Yunus Bicer, Ali Alizadeh, Nazim Kemal Ure, Ahmetcan Erdogan, Orkun Kizilirmak

    Abstract: The objective of this paper is to develop a sample efficient end-to-end deep learning method for self-driving cars, where we attempt to increase the value of the information extracted from samples, through careful analysis obtained from each call to expert driverś policy. End-to-end imitation learning is a popular method for computing self-driving car policies. The standard approach relies on coll… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: 6 pages, 6 figures, IROS2019 conference

    Journal ref: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China, 2019, pp. 2629-2634

  22. arXiv:2006.05821  [pdf

    cs.RO cs.AI eess.SP

    Development of A Stochastic Traffic Environment with Generative Time-Series Models for Improving Generalization Capabilities of Autonomous Driving Agents

    Authors: Anil Ozturk, Mustafa Burak Gunel, Melih Dal, Ugur Yavas, Nazim Kemal Ure

    Abstract: Automated lane changing is a critical feature for advanced autonomous driving systems. In recent years, reinforcement learning (RL) algorithms trained on traffic simulators yielded successful results in computing lane changing policies that strike a balance between safety, agility and compensating for traffic uncertainty. However, many RL algorithms exhibit simulator bias and policies trained on s… ▽ More

    Submitted 10 June, 2020; originally announced June 2020.

    Comments: 7 pages, 4 figures, 7 tables, IV2020

  23. arXiv:1909.11538  [pdf, other

    cs.RO cs.AI cs.LG eess.SY stat.ML

    Automated Lane Change Decision Making using Deep Reinforcement Learning in Dynamic and Uncertain Highway Environment

    Authors: Ali Alizadeh, Majid Moghadam, Yunus Bicer, Nazim Kemal Ure, Ugur Yavas, Can Kurtulus

    Abstract: Autonomous lane changing is a critical feature for advanced autonomous driving systems, that involves several challenges such as uncertainty in other driver's behaviors and the trade-off between safety and agility. In this work, we develop a novel simulation environment that emulates these challenges and train a deep reinforcement learning agent that yields consistent performance in a variety of d… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Comments: Accepted to IEEE Intelligent Transportation Systems Conference - ITSC 2019