Skip to main content

Showing 1–14 of 14 results for author: Ackermann, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.08537  [pdf, ps, other

    cs.LG math.CT

    Recursive Reward Aggregation

    Authors: Yuting Tang, Yivan Zhang, Johannes Ackermann, Yu-Jie Zhang, Soichiro Nishimori, Masashi Sugiyama

    Abstract: In reinforcement learning (RL), aligning agent behavior with specific objectives typically requires careful design of the reward function, which can be challenging when the desired objectives are complex. In this work, we propose an alternative approach for flexible behavior alignment that eliminates the need to modify the reward function by selecting appropriate reward aggregation functions. By i… ▽ More

    Submitted 11 July, 2025; originally announced July 2025.

    Comments: Reinforcement Learning Conference 2025

  2. arXiv:2506.21117  [pdf, ps, other

    cs.CV

    CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization

    Authors: Jan Ackermann, Jonas Kulhanek, Shengqu Cai, Haofei Xu, Marc Pollefeys, Gordon Wetzstein, Leonidas Guibas, Songyou Peng

    Abstract: In dynamic 3D environments, accurately updating scene representations over time is crucial for applications in robotics, mixed reality, and embodied AI. As scenes evolve, efficient methods to incorporate changes are needed to maintain up-to-date, high-quality reconstructions without the computational overhead of re-optimizing the entire scene. This paper introduces CL-Splats, which incrementally u… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: ICCV 2025, Project Page: https://cl-splats.github.io

  3. arXiv:2506.05210  [pdf, ps, other

    cs.CV

    Towards Vision-Language-Garment Models for Web Knowledge Garment Understanding and Generation

    Authors: Jan Ackermann, Kiyohiro Nakayama, Guandao Yang, Tong Wu, Gordon Wetzstein

    Abstract: Multimodal foundation models have demonstrated strong generalization, yet their ability to transfer knowledge to specialized domains such as garment generation remains underexplored. We introduce VLG, a vision-language-garment model that synthesizes garments from textual descriptions and visual imagery. Our experiments assess VLG's zero-shot generalization, investigating its ability to transfer we… ▽ More

    Submitted 30 June, 2025; v1 submitted 5 June, 2025; originally announced June 2025.

    Comments: Presented at MMFM CVPRW'25, Project Page: https://www.computationalimaging.org/publications/vision-language-garment-models/

  4. arXiv:2412.03937  [pdf, other

    cs.CV

    AIpparel: A Multimodal Foundation Model for Digital Garments

    Authors: Kiyohiro Nakayama, Jan Ackermann, Timur Levent Kesdogan, Yang Zheng, Maria Korosteleva, Olga Sorkine-Hornung, Leonidas J. Guibas, Guandao Yang, Gordon Wetzstein

    Abstract: Apparel is essential to human life, offering protection, mirroring cultural identities, and showcasing personal style. Yet, the creation of garments remains a time-consuming process, largely due to the manual work involved in designing them. To simplify this process, we introduce AIpparel, a multimodal foundation model for generating and editing sewing patterns. Our model fine-tunes state-of-the-a… ▽ More

    Submitted 5 April, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

    Comments: The project website is at https://georgenakayama.github.io/AIpparel/

  5. arXiv:2406.10876  [pdf, ps, other

    cs.LG math.NA math.PR

    Deep neural networks with ReLU, leaky ReLU, and softplus activation provably overcome the curse of dimensionality for space-time solutions of semilinear partial differential equations

    Authors: Julia Ackermann, Arnulf Jentzen, Benno Kuckuck, Joshua Lee Padgett

    Abstract: It is a challenging topic in applied mathematics to solve high-dimensional nonlinear partial differential equations (PDEs). Standard approximation methods for nonlinear PDEs suffer under the curse of dimensionality (COD) in the sense that the number of computational operations of the approximation method grows at least exponentially in the PDE dimension and with such methods it is essentially impo… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 64 pages. arXiv admin note: text overlap with arXiv:2309.13722, arXiv:2310.20360

    MSC Class: 65M15; 65C05; 68T07 (Primary) 60H35 (Secondary)

  6. arXiv:2405.14114  [pdf, other

    cs.LG cs.AI

    Offline Reinforcement Learning from Datasets with Structured Non-Stationarity

    Authors: Johannes Ackermann, Takayuki Osa, Masashi Sugiyama

    Abstract: Current Reinforcement Learning (RL) is often limited by the large amount of data needed to learn a successful policy. Offline RL aims to solve this issue by using transitions collected by a different behavior policy. We address a novel Offline RL problem setting in which, while collecting the dataset, the transition and reward functions gradually change between episodes but stay constant within ea… ▽ More

    Submitted 27 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted for Reinforcement Learning Conference (RLC) 2024

  7. arXiv:2404.07465  [pdf, other

    cs.LG

    Offline Reinforcement Learning with Domain-Unlabeled Data

    Authors: Soichiro Nishimori, Xin-Qiang Cai, Johannes Ackermann, Masashi Sugiyama

    Abstract: Offline reinforcement learning (RL) is vital in areas where active data collection is expensive or infeasible, such as robotics or healthcare. In the real world, offline datasets often involve multiple domains that share the same state and action spaces but have distinct dynamics, and only a small fraction of samples are clearly labeled as belonging to the target domain we are interested in. For e… ▽ More

    Submitted 28 February, 2025; v1 submitted 11 April, 2024; originally announced April 2024.

  8. arXiv:2402.13934  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Do Efficient Transformers Really Save Computation?

    Authors: Kai Yang, Jan Ackermann, Zhenyu He, Guhao Feng, Bohang Zhang, Yunzhen Feng, Qiwei Ye, Di He, Liwei Wang

    Abstract: As transformer-based language models are trained on increasingly large datasets and with vast numbers of parameters, finding more efficient alternatives to the standard Transformer has become very valuable. While many efficient Transformers and Transformer alternatives have been proposed, none provide theoretical guarantees that they are a suitable replacement for the standard Transformer. This ma… ▽ More

    Submitted 8 November, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 20 pages, ICML 2024 Camera Ready Version

  9. arXiv:2309.13722  [pdf, ps, other

    math.NA cs.LG math.PR

    Deep neural networks with ReLU, leaky ReLU, and softplus activation provably overcome the curse of dimensionality for Kolmogorov partial differential equations with Lipschitz nonlinearities in the $L^p$-sense

    Authors: Julia Ackermann, Arnulf Jentzen, Thomas Kruse, Benno Kuckuck, Joshua Lee Padgett

    Abstract: Recently, several deep learning (DL) methods for approximating high-dimensional partial differential equations (PDEs) have been proposed. The interest that these methods have generated in the literature is in large part due to simulations which appear to demonstrate that such DL methods have the capacity to overcome the curse of dimensionality (COD) for PDEs in the sense that the number of computa… ▽ More

    Submitted 24 June, 2025; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: 52 pages

    MSC Class: 65M15; 65C05; 68T07 (Primary) 60H35 (Secondary)

  10. arXiv:2305.16972  [pdf, other

    cs.CV

    Maskomaly:Zero-Shot Mask Anomaly Segmentation

    Authors: Jan Ackermann, Christos Sakaridis, Fisher Yu

    Abstract: We present a simple and practical framework for anomaly segmentation called Maskomaly. It builds upon mask-based standard semantic segmentation networks by adding a simple inference-time post-processing step which leverages the raw mask outputs of such networks. Maskomaly does not require additional training and only adds a small computational overhead to inference. Most importantly, it does not r… ▽ More

    Submitted 25 August, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: BMVC 2023

  11. arXiv:2303.15595  [pdf, other

    cs.IR

    Bi-Encoder Cascades for Efficient Image Search

    Authors: Robert Hönig, Jan Ackermann, Mingyuan Chi

    Abstract: Modern neural encoders offer unprecedented text-image retrieval (TIR) accuracy, but their high computational cost impedes an adoption to large-scale image searches. To lower this cost, model cascades use an expensive encoder to refine the ranking of a cheap encoder. However, existing cascading algorithms focus on cross-encoders, which jointly process text-image pairs, but do not consider cascades… ▽ More

    Submitted 4 August, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: Under review as a short paper at the ICCV '23 RCV workshop

  12. arXiv:2210.12965  [pdf, other

    cs.CV cs.LG

    High-Resolution Image Editing via Multi-Stage Blended Diffusion

    Authors: Johannes Ackermann, Minjun Li

    Abstract: Diffusion models have shown great results in image generation and in image editing. However, current approaches are limited to low resolutions due to the computational cost of training diffusion models for high-resolution generation. We propose an approach that uses a pre-trained low-resolution diffusion model to edit images in the megapixel range. We first use Blended Diffusion to edit the image… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Machine Learning for Creativity and Design Workshop at NeurIPS 2022

  13. arXiv:2207.14650  [pdf

    eess.IV cs.CV cs.LG

    SYNTA: A novel approach for deep learning-based image analysis in muscle histopathology using photo-realistic synthetic data

    Authors: Leonid Mill, Oliver Aust, Jochen A. Ackermann, Philipp Burger, Monica Pascual, Katrin Palumbo-Zerr, Gerhard Krönke, Stefan Uderhardt, Georg Schett, Christoph S. Clemen, Rolf Schröder, Christian Holtzhausen, Samir Jabari, Andreas Maier, Anika Grüneboom

    Abstract: Artificial intelligence (AI), machine learning, and deep learning (DL) methods are becoming increasingly important in the field of biomedical image analysis. However, to exploit the full potential of such methods, a representative number of experimentally acquired images containing a significant number of manually annotated objects is needed as training data. Here we introduce SYNTA (synthetic dat… ▽ More

    Submitted 3 January, 2024; v1 submitted 29 July, 2022; originally announced July 2022.

  14. arXiv:1910.01465  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics

    Authors: Johannes Ackermann, Volker Gabler, Takayuki Osa, Masashi Sugiyama

    Abstract: Many real world tasks require multiple agents to work together. Multi-agent reinforcement learning (RL) methods have been proposed in recent years to solve these tasks, but current methods often fail to efficiently learn policies. We thus investigate the presence of a common weakness in single-agent RL, namely value function overestimation bias, in the multi-agent setting. Based on our findings, w… ▽ More

    Submitted 2 December, 2019; v1 submitted 3 October, 2019; originally announced October 2019.

    Comments: Accepted for the Deep RL Workshop at NeurIPS 2019; Changes for v2: Changed Figures 3,4, due to an error in the implementation of MATD3. Please refer to this version for fair evaluation