Skip to main content

Showing 1–16 of 16 results for author: Krompaß, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.23270  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Does Machine Unlearning Truly Remove Model Knowledge? A Framework for Auditing Unlearning in LLMs

    Authors: Haokun Chen, Yueqi Zhang, Yuan Bi, Yao Zhang, Tong Liu, Jinhe Bi, Jian Lan, Jindong Gu, Claudia Grosser, Denis Krompass, Nassir Navab, Volker Tresp

    Abstract: In recent years, Large Language Models (LLMs) have achieved remarkable advancements, drawing significant attention from the research community. Their capabilities are largely attributed to large-scale architectures, which require extensive training on massive datasets. However, such datasets often contain sensitive or copyrighted content sourced from the public internet, raising concerns about dat… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  2. arXiv:2410.04810  [pdf, other

    cs.LG cs.CV cs.DC cs.MM

    FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models

    Authors: Haokun Chen, Hang Li, Yao Zhang, Jinhe Bi, Gengyuan Zhang, Yueqi Zhang, Philip Torr, Jindong Gu, Denis Krompass, Volker Tresp

    Abstract: One-Shot Federated Learning (OSFL), a special decentralized machine learning paradigm, has recently gained significant attention. OSFL requires only a single round of client data or model upload, which reduces communication costs and mitigates privacy threats compared to traditional FL. Despite these promising prospects, existing methods face challenges due to client data heterogeneity and limited… ▽ More

    Submitted 2 March, 2025; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: CVPR 2025

  3. arXiv:2402.07570  [pdf, other

    cs.LG cs.AI

    Only the Curve Shape Matters: Training Foundation Models for Zero-Shot Multivariate Time Series Forecasting through Next Curve Shape Prediction

    Authors: Cheng Feng, Long Huang, Denis Krompass

    Abstract: We present General Time Transformer (GTT), an encoder-only style foundation model for zero-shot multivariate time series forecasting. GTT is pretrained on a large dataset of 200M high-quality time series samples spanning diverse domains. In our proposed framework, the task of multivariate time series forecasting is formulated as a channel-wise next curve shape prediction problem, where each time s… ▽ More

    Submitted 18 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  4. arXiv:2308.12305  [pdf, other

    cs.LG cs.AI cs.MM

    FedDAT: An Approach for Foundation Model Finetuning in Multi-Modal Heterogeneous Federated Learning

    Authors: Haokun Chen, Yao Zhang, Denis Krompass, Jindong Gu, Volker Tresp

    Abstract: Recently, foundation models have exhibited remarkable advancements in multi-modal learning. These models, equipped with millions (or billions) of parameters, typically require a substantial amount of data for finetuning. However, collecting and centralizing training data from diverse sectors becomes challenging due to distinct privacy regulations. Federated Learning (FL) emerges as a promising sol… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  5. arXiv:2308.08634  [pdf, other

    cs.LG cs.AI cs.DC

    FedPop: Federated Population-based Hyperparameter Tuning

    Authors: Haokun Chen, Denis Krompass, Jindong Gu, Volker Tresp

    Abstract: Federated Learning (FL) is a distributed machine learning (ML) paradigm, in which multiple clients collaboratively train ML models without centralizing their local data. Similar to conventional ML pipelines, the client local optimization and server aggregation procedure in FL are sensitive to the hyperparameter (HP) selection. Despite extensive research on tuning HPs for centralized ML, these meth… ▽ More

    Submitted 23 July, 2024; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: Code: https://github.com/HaokunChen245/FedPop

  6. arXiv:2211.10567  [pdf, other

    cs.CV

    CL-CrossVQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering

    Authors: Yao Zhang, Haokun Chen, Ahmed Frikha, Yezi Yang, Denis Krompass, Gengyuan Zhang, Jindong Gu, Volker Tresp

    Abstract: Visual Question Answering (VQA) is a multi-discipline research task. To produce the right answer, it requires an understanding of the visual content of images, the natural language questions, as well as commonsense reasoning over the information contained in the image and world knowledge. Recently, large-scale Vision-and-Language Pre-trained Models (VLPMs) have been the mainstream approach to VQA… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: 10 pages, 6 figures

  7. arXiv:2205.14900  [pdf, other

    cs.LG cs.AI

    FRAug: Tackling Federated Learning with Non-IID Features via Representation Augmentation

    Authors: Haokun Chen, Ahmed Frikha, Denis Krompass, Jindong Gu, Volker Tresp

    Abstract: Federated Learning (FL) is a decentralized learning paradigm, in which multiple clients collaboratively train deep learning models without centralizing their local data, and hence preserve data privacy. Real-world applications usually involve a distribution shift across the datasets of the different clients, which hurts the generalization ability of the clients to unseen samples from their respect… ▽ More

    Submitted 22 August, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: ICCV 2023

  8. arXiv:2110.04545  [pdf, other

    cs.LG cs.CV

    Towards Data-Free Domain Generalization

    Authors: Ahmed Frikha, Haokun Chen, Denis Krompaß, Thomas Runkler, Volker Tresp

    Abstract: In this work, we investigate the unexplored intersection of domain generalization (DG) and data-free learning. In particular, we address the question: How can knowledge contained in models trained on different source domains be merged into a single model that generalizes well to unseen target domains, in the absence of source and target domain data? Machine learning models that can cope with domai… ▽ More

    Submitted 14 November, 2022; v1 submitted 9 October, 2021; originally announced October 2021.

    Comments: Accepted at NeurIPS 2021 (DistShift Workshop) and ACML 2022

  9. arXiv:2109.04320  [pdf, other

    cs.LG stat.ML

    Discovery of New Multi-Level Features for Domain Generalization via Knowledge Corruption

    Authors: Ahmed Frikha, Denis Krompaß, Volker Tresp

    Abstract: Machine learning models that can generalize to unseen domains are essential when applied in real-world scenarios involving strong domain shifts. We address the challenging domain generalization (DG) problem, where a model trained on a set of source domains is expected to generalize well in unseen domains without any exposure to their data. The main challenge of DG is that the features learned from… ▽ More

    Submitted 3 October, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted at AAAI 2022 (AIBSD Workshop) and ICPR 2022

  10. ARCADe: A Rapid Continual Anomaly Detector

    Authors: Ahmed Frikha, Denis Krompaß, Volker Tresp

    Abstract: Although continual learning and anomaly detection have separately been well-studied in previous works, their intersection remains rather unexplored. The present work addresses a learning scenario where a model has to incrementally learn a sequence of anomaly detection tasks, i.e. tasks from which only examples from the normal (majority) class are available for training. We define this novel learni… ▽ More

    Submitted 18 October, 2020; v1 submitted 10 August, 2020; originally announced August 2020.

    Comments: Accepted at ICPR 2020

  11. arXiv:2007.04146  [pdf, other

    cs.LG stat.ML

    Few-Shot One-Class Classification via Meta-Learning

    Authors: Ahmed Frikha, Denis Krompaß, Hans-Georg Köpken, Volker Tresp

    Abstract: Although few-shot learning and one-class classification (OCC), i.e., learning a binary classifier with data from only one class, have been separately well studied, their intersection remains rather unexplored. Our work addresses the few-shot OCC problem and presents a method to modify the episodic data sampling strategy of the model-agnostic meta-learning (MAML) algorithm to learn a model initiali… ▽ More

    Submitted 11 February, 2021; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: Accepted at AAAI 2021

  12. arXiv:1707.01786  [pdf, other

    cs.CV

    Tensor-Train Recurrent Neural Networks for Video Classification

    Authors: Yinchong Yang, Denis Krompass, Volker Tresp

    Abstract: The Recurrent Neural Networks and their variants have shown promising performances in sequence modeling tasks such as Natural Language Processing. These models, however, turn out to be impractical and difficult to train when exposed to very high-dimensional inputs due to the large input-to-hidden weight matrix. This may have prevented RNNs' large-scale application in tasks that involve very high i… ▽ More

    Submitted 6 July, 2017; originally announced July 2017.

  13. arXiv:1512.06900  [pdf, other

    cs.LG

    Predicting the Co-Evolution of Event and Knowledge Graphs

    Authors: Cristóbal Esteban, Volker Tresp, Yinchong Yang, Stephan Baier, Denis Krompaß

    Abstract: Embedding learning, a.k.a. representation learning, has been shown to be able to model large-scale semantic knowledge graphs. A key concept is a mapping of the knowledge graph to a tensor representation whose entries are predicted by models using latent representations of generalized entities. Knowledge graphs are typically treated as static: A knowledge graph grows more links when more facts beco… ▽ More

    Submitted 21 December, 2015; originally announced December 2015.

  14. arXiv:1511.07972  [pdf, other

    cs.AI cs.CL cs.LG

    Learning with Memory Embeddings

    Authors: Volker Tresp, Cristóbal Esteban, Yinchong Yang, Stephan Baier, Denis Krompaß

    Abstract: Embedding learning, a.k.a. representation learning, has been shown to be able to model large-scale semantic knowledge graphs. A key concept is a mapping of the knowledge graph to a tensor representation whose entries are predicted by models using latent representations of generalized entities. Latent variable models are well suited to deal with the high dimensionality and sparsity of typical knowl… ▽ More

    Submitted 7 May, 2016; v1 submitted 25 November, 2015; originally announced November 2015.

    Comments: 29 pages, NIPS 2015 Workshop on Nonparametric Methods for Large Scale Representation Learning

  15. arXiv:1508.02593  [pdf, ps, other

    cs.AI cs.LG

    Type-Constrained Representation Learning in Knowledge Graphs

    Authors: Denis Krompaß, Stephan Baier, Volker Tresp

    Abstract: Large knowledge graphs increasingly add value to various applications that require machines to recognize and understand queries and their semantics, as in search or question answering systems. Latent variable models have increasingly gained attention for the statistical modeling of knowledge graphs, showing promising results in tasks related to knowledge graph completion and cleaning. Besides stor… ▽ More

    Submitted 28 August, 2015; v1 submitted 11 August, 2015; originally announced August 2015.

  16. arXiv:1311.4180  [pdf, other

    cs.CY cs.AI

    Towards a New Science of a Clinical Data Intelligence

    Authors: Volker Tresp, Sonja Zillner, Maria J. Costa, Yi Huang, Alexander Cavallaro, Peter A. Fasching, Andre Reis, Martin Sedlmayr, Thomas Ganslandt, Klemens Budde, Carl Hinrichs, Danilo Schmidt, Philipp Daumke, Daniel Sonntag, Thomas Wittenberg, Patricia G. Oppelt, Denis Krompass

    Abstract: In this paper we define Clinical Data Intelligence as the analysis of data generated in the clinical routine with the goal of improving patient care. We define a science of a Clinical Data Intelligence as a data analysis that permits the derivation of scientific, i.e., generalizable and reliable results. We argue that a science of a Clinical Data Intelligence is sensible in the context of a Big Da… ▽ More

    Submitted 30 December, 2013; v1 submitted 17 November, 2013; originally announced November 2013.

    Comments: NIPS 2013 Workshop: Machine Learning for Clinical Data Analysis and Healthcare, 2013