Search | arXiv e-print repository

PEAKS: Selecting Key Training Examples Incrementally via Prediction Error Anchored by Kernel Similarity

Authors: Mustafa Burak Gurbuz, Xingyu Zheng, Constantine Dovrolis

Abstract: As deep learning continues to be driven by ever-larger datasets, understanding which examples are most important for generalization has become a critical question. While progress in data selection continues, emerging applications require studying this problem in dynamic contexts. To bridge this gap, we pose the Incremental Data Selection (IDS) problem, where examples arrive as a continuous stream,… ▽ More As deep learning continues to be driven by ever-larger datasets, understanding which examples are most important for generalization has become a critical question. While progress in data selection continues, emerging applications require studying this problem in dynamic contexts. To bridge this gap, we pose the Incremental Data Selection (IDS) problem, where examples arrive as a continuous stream, and need to be selected without access to the full data source. In this setting, the learner must incrementally build a training dataset of predefined size while simultaneously learning the underlying task. We find that in IDS, the impact of a new sample on the model state depends fundamentally on both its geometric relationship in the feature space and its prediction error. Leveraging this insight, we propose PEAKS (Prediction Error Anchored by Kernel Similarity), an efficient data selection method tailored for IDS. Our comprehensive evaluations demonstrate that PEAKS consistently outperforms existing selection strategies. Furthermore, PEAKS yields increasingly better performance returns than random selection as training data size grows on real-world datasets. The code is available at https://github.com/BurakGurbuz97/PEAKS. △ Less

Submitted 30 June, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

arXiv:2311.04301 [pdf, other]

Class-Incremental Continual Learning for General Purpose Healthcare Models

Authors: Amritpal Singh, Mustafa Burak Gurbuz, Shiva Souhith Gantha, Prahlad Jasti

Abstract: Healthcare clinics regularly encounter dynamic data that changes due to variations in patient populations, treatment policies, medical devices, and emerging disease patterns. Deep learning models can suffer from catastrophic forgetting when fine-tuned in such scenarios, causing poor performance on previously learned tasks. Continual learning allows learning on new tasks without performance drop on… ▽ More Healthcare clinics regularly encounter dynamic data that changes due to variations in patient populations, treatment policies, medical devices, and emerging disease patterns. Deep learning models can suffer from catastrophic forgetting when fine-tuned in such scenarios, causing poor performance on previously learned tasks. Continual learning allows learning on new tasks without performance drop on previous tasks. In this work, we investigate the performance of continual learning models on four different medical imaging scenarios involving ten classification datasets from diverse modalities, clinical specialties, and hospitals. We implement various continual learning approaches and evaluate their performance in these scenarios. Our results demonstrate that a single model can sequentially learn new tasks from different specialties and achieve comparable performance to naive methods. These findings indicate the feasibility of recycling or sharing models across the same or different medical specialties, offering another step towards the development of general-purpose medical imaging AI that can be shared across institutions. △ Less

Submitted 7 November, 2023; originally announced November 2023.

Comments: 4 pages, 1 Figure. Accepted in NeurIPS 2023 (Medical Imaging meets NeurIPS Workshop)

arXiv:2305.18563 [pdf, other]

SHARP: Sparsity and Hidden Activation RePlay for Neuro-Inspired Continual Learning

Authors: Mustafa Burak Gurbuz, Jean Michael Moorman, Constantine Dovrolis

Abstract: Deep neural networks (DNNs) struggle to learn in dynamic environments since they rely on fixed datasets or stationary environments. Continual learning (CL) aims to address this limitation and enable DNNs to accumulate knowledge incrementally, similar to human learning. Inspired by how our brain consolidates memories, a powerful strategy in CL is replay, which involves training the DNN on a mixture… ▽ More Deep neural networks (DNNs) struggle to learn in dynamic environments since they rely on fixed datasets or stationary environments. Continual learning (CL) aims to address this limitation and enable DNNs to accumulate knowledge incrementally, similar to human learning. Inspired by how our brain consolidates memories, a powerful strategy in CL is replay, which involves training the DNN on a mixture of new and all seen classes. However, existing replay methods overlook two crucial aspects of biological replay: 1) the brain replays processed neural patterns instead of raw input, and 2) it prioritizes the replay of recently learned information rather than revisiting all past experiences. To address these differences, we propose SHARP, an efficient neuro-inspired CL method that leverages sparse dynamic connectivity and activation replay. Unlike other activation replay methods, which assume layers not subjected to replay have been pretrained and fixed, SHARP can continually update all layers. Also, SHARP is unique in that it only needs to replay few recently seen classes instead of all past classes. Our experiments on five datasets demonstrate that SHARP outperforms state-of-the-art replay methods in class incremental learning. Furthermore, we showcase SHARP's flexibility in a novel CL scenario where the boundaries between learning episodes are blurry. The SHARP code is available at \url{https://github.com/BurakGurbuz97/SHARP-Continual-Learning}. △ Less

Submitted 29 May, 2023; originally announced May 2023.

arXiv:2212.04603 [pdf, other]

doi 10.1145/3564121.3565236

System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games

Authors: Indranil Sur, Zachary Daniels, Abrar Rahman, Kamil Faber, Gianmarco J. Gallardo, Tyler L. Hayes, Cameron E. Taylor, Mustafa Burak Gurbuz, James Smith, Sahana Joshi, Nathalie Japkowicz, Michael Baron, Zsolt Kira, Christopher Kanan, Roberto Corizzo, Ajay Divakaran, Michael Piacentino, Jesse Hostetler, Aswin Raghavan

Abstract: As Artificial and Robotic Systems are increasingly deployed and relied upon for real-world applications, it is important that they exhibit the ability to continually learn and adapt in dynamically-changing environments, becoming Lifelong Learning Machines. Continual/lifelong learning (LL) involves minimizing catastrophic forgetting of old tasks while maximizing a model's capability to learn new ta… ▽ More As Artificial and Robotic Systems are increasingly deployed and relied upon for real-world applications, it is important that they exhibit the ability to continually learn and adapt in dynamically-changing environments, becoming Lifelong Learning Machines. Continual/lifelong learning (LL) involves minimizing catastrophic forgetting of old tasks while maximizing a model's capability to learn new tasks. This paper addresses the challenging lifelong reinforcement learning (L2RL) setting. Pushing the state-of-the-art forward in L2RL and making L2RL useful for practical applications requires more than developing individual L2RL algorithms; it requires making progress at the systems-level, especially research into the non-trivial problem of how to integrate multiple L2RL algorithms into a common framework. In this paper, we introduce the Lifelong Reinforcement Learning Components Framework (L2RLCF), which standardizes L2RL systems and assimilates different continual learning components (each addressing different aspects of the lifelong learning problem) into a unified system. As an instantiation of L2RLCF, we develop a standard API allowing easy integration of novel lifelong learning components. We describe a case study that demonstrates how multiple independently-developed LL components can be integrated into a single realized system. We also introduce an evaluation environment in order to measure the effect of combining various system components. Our evaluation environment employs different LL scenarios (sequences of tasks) consisting of Starcraft-2 minigames and allows for the fair, comprehensive, and quantitative comparison of different combinations of components within a challenging common evaluation environment. △ Less

Submitted 8 December, 2022; originally announced December 2022.

Comments: The Second International Conference on AIML Systems, October 12--15, 2022, Bangalore, India

arXiv:2207.11782 [pdf, other]

Enhancements to the BOUN Treebank Reflecting the Agglutinative Nature of Turkish

Authors: Büşra Marşan, Salih Furkan Akkurt, Muhammet Şen, Merve Gürbüz, Onur Güngör, Şaziye Betül Özateş, Suzan Üsküdarlı, Arzucan Özgür, Tunga Güngör, Balkız Öztürk

Abstract: In this study, we aim to offer linguistically motivated solutions to resolve the issues of the lack of representation of null morphemes, highly productive derivational processes, and syncretic morphemes of Turkish in the BOUN Treebank without diverging from the Universal Dependencies framework. In order to tackle these issues, new annotation conventions were introduced by splitting certain lemma… ▽ More In this study, we aim to offer linguistically motivated solutions to resolve the issues of the lack of representation of null morphemes, highly productive derivational processes, and syncretic morphemes of Turkish in the BOUN Treebank without diverging from the Universal Dependencies framework. In order to tackle these issues, new annotation conventions were introduced by splitting certain lemmas and employing the MISC (miscellaneous) tab in the UD framework to denote derivation. Representational capabilities of the re-annotated treebank were tested on a LSTM-based dependency parser and an updated version of the BoAT Tool is introduced. △ Less

Submitted 24 July, 2022; originally announced July 2022.

Comments: This is a peer reviewed article that has been presented in The International Conference on Agglutinative Language Technologies as a challenge of Natural Language Processing (ALTNLP) 2022

arXiv:2206.09117 [pdf, other]

NISPA: Neuro-Inspired Stability-Plasticity Adaptation for Continual Learning in Sparse Networks

Authors: Mustafa Burak Gurbuz, Constantine Dovrolis

Abstract: The goal of continual learning (CL) is to learn different tasks over time. The main desiderata associated with CL are to maintain performance on older tasks, leverage the latter to improve learning of future tasks, and to introduce minimal overhead in the training process (for instance, to not require a growing model or retraining). We propose the Neuro-Inspired Stability-Plasticity Adaptation (NI… ▽ More The goal of continual learning (CL) is to learn different tasks over time. The main desiderata associated with CL are to maintain performance on older tasks, leverage the latter to improve learning of future tasks, and to introduce minimal overhead in the training process (for instance, to not require a growing model or retraining). We propose the Neuro-Inspired Stability-Plasticity Adaptation (NISPA) architecture that addresses these desiderata through a sparse neural network with fixed density. NISPA forms stable paths to preserve learned knowledge from older tasks. Also, NISPA uses connection rewiring to create new plastic paths that reuse existing knowledge on novel tasks. Our extensive evaluation on EMNIST, FashionMNIST, CIFAR10, and CIFAR100 datasets shows that NISPA significantly outperforms representative state-of-the-art continual learning baselines, and it uses up to ten times fewer learnable parameters compared to baselines. We also make the case that sparsity is an essential ingredient for continual learning. The NISPA code is available at https://github.com/BurakGurbuz97/NISPA. △ Less

Submitted 18 June, 2022; originally announced June 2022.

Comments: International Conference on Machine Learning 2022

arXiv:2104.03895 [pdf, other]

MGN-Net: a multi-view graph normalizer for integrating heterogeneous biological network populations

Authors: Islem Rekik, Mustafa Burak Gurbuz

Abstract: With the recent technological advances, biological datasets, often represented by networks (i.e., graphs) of interacting entities, proliferate with unprecedented complexity and heterogeneity. Although modern network science opens new frontiers of analyzing connectivity patterns in such datasets, we still lack data-driven methods for extracting an integral connectional fingerprint of a multi-view g… ▽ More With the recent technological advances, biological datasets, often represented by networks (i.e., graphs) of interacting entities, proliferate with unprecedented complexity and heterogeneity. Although modern network science opens new frontiers of analyzing connectivity patterns in such datasets, we still lack data-driven methods for extracting an integral connectional fingerprint of a multi-view graph population, let alone disentangling the typical from the atypical variations across the population samples. We present the multi-view graph normalizer network (MGN-Net; https://github.com/basiralab/MGN-Net), a graph neural network based method to normalize and integrate a set of multi-view biological networks into a single connectional template that is centered, representative, and topologically sound. We demonstrate the use of MGN-Net by discovering the connectional fingerprints of healthy and neurologically disordered brain network populations including Alzheimer's disease and Autism spectrum disorder patients. Additionally, by comparing the learned templates of healthy and disordered populations, we show that MGN-Net significantly outperforms conventional network integration methods across extensive experiments in terms of producing the most centered templates, recapitulating unique traits of populations, and preserving the complex topology of biological networks. Our evaluations showed that MGN-Net is powerfully generic and easily adaptable in design to different graph-based problems such as identification of relevant connections, normalization and integration. △ Less

Submitted 4 April, 2021; originally announced April 2021.

arXiv:2012.14131 [pdf, other]

doi 10.1007/978-3-030-59728-3_16

Deep Graph Normalizer: A Geometric Deep Learning Approach for Estimating Connectional Brain Templates

Authors: Mustafa Burak Gurbuz, Islem Rekik

Abstract: A connectional brain template (CBT) is a normalized graph-based representation of a population of brain networks also regarded as an average connectome. CBTs are powerful tools for creating representative maps of brain connectivity in typical and atypical populations. Particularly, estimating a well-centered and representative CBT for populations of multi-view brain networks (MVBN) is more challen… ▽ More A connectional brain template (CBT) is a normalized graph-based representation of a population of brain networks also regarded as an average connectome. CBTs are powerful tools for creating representative maps of brain connectivity in typical and atypical populations. Particularly, estimating a well-centered and representative CBT for populations of multi-view brain networks (MVBN) is more challenging since these networks sit on complex manifolds and there is no easy way to fuse different heterogeneous network views. This problem remains unexplored with the exception of a few recent works rooted in the assumption that the relationship between connectomes are mostly linear. However, such an assumption fails to capture complex patterns and non-linear variation across individuals. Besides, existing methods are simply composed of sequential MVBN processing blocks without any feedback mechanism, leading to error accumulation. To address these issues, we propose Deep Graph Normalizer (DGN), the first geometric deep learning (GDL) architecture for normalizing a population of MVBNs by integrating them into a single connectional brain template. Our end-to-end DGN learns how to fuse multi-view brain networks while capturing non-linear patterns across subjects and preserving brain graph topological properties by capitalizing on graph convolutional neural networks. We also introduce a randomized weighted loss function which also acts as a regularizer to minimize the distance between the population of MVBNs and the estimated CBT, thereby enforcing its centeredness. We demonstrate that DGN significantly outperforms existing state-of-the-art methods on estimating CBTs on both small-scale and large-scale connectomic datasets in terms of both representativeness and discriminability (i.e., identifying distinctive connectivities fingerprinting each brain network population). △ Less

Submitted 28 December, 2020; originally announced December 2020.

Comments: 11 pages, 2 figures

Journal ref: International Conference on Medical Image Computing and Computer-Assisted Intervention 2020

arXiv:1503.02500 [pdf, ps, other]

Some Generalizations of Integral Inequalities and Their Applications

Authors: Mustafa Gurbuz, Abdullah Yaradilmis

Abstract: In this paper, an integral identity for twice differentiable functions is generalized. Then, by using convexity of |f''| or q-th power of |f''| and with the aid of power mean and Holder's inequalities we achieved some new results. We also gave some applications to quadrature formulas and some special means. Therewithal, by choosing (alpha=1/2) in our main results, we obtained some findings in [13]… ▽ More In this paper, an integral identity for twice differentiable functions is generalized. Then, by using convexity of |f''| or q-th power of |f''| and with the aid of power mean and Holder's inequalities we achieved some new results. We also gave some applications to quadrature formulas and some special means. Therewithal, by choosing (alpha=1/2) in our main results, we obtained some findings in [13]. △ Less

Submitted 9 March, 2015; originally announced March 2015.

Comments: 15 pages. arXiv admin note: text overlap with arXiv:1005.2879 by other authors

MSC Class: 26A51; 26D10; 26D15

arXiv:1212.1420 [pdf, ps, other]

Integral Inequalities for functions whose 3rd derivatives belong to Q(I)

Authors: M. E. Ozdemir, M. Avci Ardic, M. Gurbuz

Abstract: In this paper, we obtain some new inequalities of Hermite-Hadamard type and Simpson type for functions whose third derivatives belong to Godunova-Levin class. In this paper, we obtain some new inequalities of Hermite-Hadamard type and Simpson type for functions whose third derivatives belong to Godunova-Levin class. △ Less

Submitted 6 December, 2012; originally announced December 2012.

arXiv:1211.2750 [pdf, ps, other]

Definitions of h-logaritmic, h-geometric and h-multi convex functions and some inequalities related to them

Authors: M. Emin Ozdemir, Mevlut Tunc, Mustafa Gurbuz

Abstract: In this papaer, we put forward some new definitions and integral inequalities by using fairly elementary analysis. In this papaer, we put forward some new definitions and integral inequalities by using fairly elementary analysis. △ Less

Submitted 12 November, 2012; originally announced November 2012.

Comments: 8 pages

MSC Class: 26D15 (Primary) 26D10; 26A51 (Secondary)

arXiv:1208.1033 [pdf, ps, other]

Hermite-Hadamard-type inequalities for (g,Φ_{h})- convex dominated functions

Authors: M. Emin Ozdemir, Mustafa Gurbuz, Havva Kavurmaci

Abstract: In this paper, we introduce the notion of (g,Φ_{h})-convex dominated function and present some properties of them. Finally, we present a version of Hermite-Hadamard-type inequalities for (g,Φ_{h})-convex dominated functions. Our results generalize the Hermite-Hadamard-type inequalities in [2], [4] and [6]. In this paper, we introduce the notion of (g,Φ_{h})-convex dominated function and present some properties of them. Finally, we present a version of Hermite-Hadamard-type inequalities for (g,Φ_{h})-convex dominated functions. Our results generalize the Hermite-Hadamard-type inequalities in [2], [4] and [6]. △ Less

Submitted 5 August, 2012; originally announced August 2012.

Comments: 7 pages

arXiv:1207.2436 [pdf, ps, other]

New Estimates on Integral Inequalities and Their Applications

Authors: M. Emin Ozdemir, Mustafa Gurbuz, Mevlut Tunc

Abstract: In this paper, we obtain some inequalities by using a kernel and an inequality which is a result of Young inequality. Besides we give some applications to special means. In this paper, we obtain some inequalities by using a kernel and an inequality which is a result of Young inequality. Besides we give some applications to special means. △ Less

Submitted 2 December, 2012; v1 submitted 7 July, 2012; originally announced July 2012.

Comments: 7 pages

MSC Class: 26D15; 26D10

arXiv:1207.2435 [pdf, ps, other]

Simpson Type Inequalities for Q-Class Functions

Authors: M. Emin Ozdemir, Alper Ekinci, Mustafa Gurbuz, Ahmet Ocak Akdemir

Abstract: In this paper, we obtain some Simpson type inequalities for functions whose second derivatives absolute value or q-th power of them are Q-class functions. Also we give applications to numerical integration. In this paper, we obtain some Simpson type inequalities for functions whose second derivatives absolute value or q-th power of them are Q-class functions. Also we give applications to numerical integration. △ Less

Submitted 7 July, 2012; originally announced July 2012.

Comments: 4 pages

arXiv:1011.1543 [pdf, ps, other]

The Hadamard Type Inequalities for M-Convex Functions

Authors: Cetin Yildiz, Mustafa Gurbuz, Ahmet Ocak Akdemir

Abstract: In this paper we obtained some new Hadamard-Type inequalities for functions whose derivatives absolute values m-convex. Some applications to special means of real numbers are given. In this paper we obtained some new Hadamard-Type inequalities for functions whose derivatives absolute values m-convex. Some applications to special means of real numbers are given. △ Less

Submitted 6 November, 2010; originally announced November 2010.

Showing 1–15 of 15 results for author: Gürbüz, M