Skip to main content

Showing 1–7 of 7 results for author: Zecha, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.14794  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Assembly of Experts: Linear-time construction of the Chimera LLM variants with emergent and adaptable behaviors

    Authors: Henrik Klagges, Robert Dahlke, Fabian Klemm, Benjamin Merkel, Daniel Klingmann, David A. Reiss, Dan Zecha

    Abstract: Requiring $10^{13}$-$10^{15}$ FLOPs to calculate one 8 bit weight in an LLM during pretraining is extremely expensive and seems inefficient. To better leverage the huge investments made into pretrained models, we develop the new "Assembly-of-Experts" (AoE) construction method to create capable child variants of existing Mixture-of-Experts parent models in linear time. Model weight tensors get inte… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  2. arXiv:2502.11096  [pdf, other

    cs.AI cs.CL

    Mixture of Tunable Experts -- Behavior Modification of DeepSeek-R1 at Inference Time

    Authors: Robert Dahlke, Henrik Klagges, Dan Zecha, Benjamin Merkel, Sven Rohr, Fabian Klemm

    Abstract: We present the Mixture-of-Tunable-Experts (MoTE), a method that extends the Mixture-of-Experts architecture of Large Language Models (LLMs). Without additional training, MoTE enables meaningful and focused behavior changes in LLMs on-the-fly during inference time. By analyzing the digital LLM brain of DeepSeek-R1 using a technique we dub 'functional Token Resonance Imaging' (fTRI) -- inspired by f… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  3. Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing

    Authors: Philipp Harzig, Dan Zecha, Rainer Lienhart, Carolin Kaiser, René Schallner

    Abstract: Automatically generating descriptive captions for images is a well-researched area in computer vision. However, existing evaluation approaches focus on measuring the similarity between two sentences disregarding fine-grained semantics of the captions. In our setting of images depicting persons interacting with branded products, the subject, predicate, object and the name of the branded product are… ▽ More

    Submitted 6 May, 2019; originally announced May 2019.

    Comments: 6 pages, accepted at MIPR 2019

  4. Mining Automatically Estimated Poses from Video Recordings of Top Athletes

    Authors: Rainer Lienhart, Moritz Einfalt, Dan Zecha

    Abstract: Human pose detection systems based on state-of-the-art DNNs are on the go to be extended, adapted and re-trained to fit the application domain of specific sports. Therefore, plenty of noisy pose data will soon be available from videos recorded at a regular and frequent basis. This work is among the first to develop mining algorithms that can mine the expected abundance of noisy and annotation-free… ▽ More

    Submitted 27 April, 2018; v1 submitted 24 April, 2018; originally announced April 2018.

    Comments: Under review for the International Journal of Computer Science in Sport

  5. Activity-conditioned continuous human pose estimation for performance analysis of athletes using the example of swimming

    Authors: Moritz Einfalt, Dan Zecha, Rainer Lienhart

    Abstract: In this paper we consider the problem of human pose estimation in real-world videos of swimmers. Swimming channels allow filming swimmers simultaneously above and below the water surface with a single stationary camera. These recordings can be used to quantitatively assess the athletes' performance. The quantitative evaluation, so far, requires manual annotations of body parts in each video frame.… ▽ More

    Submitted 2 February, 2018; originally announced February 2018.

    Comments: 10 pages, 9 figures, accepted at WACV 2018

  6. Improving Small Object Proposals for Company Logo Detection

    Authors: Christian Eggert, Dan Zecha, Stephan Brehm, Rainer Lienhart

    Abstract: Many modern approaches for object detection are two-staged pipelines. The first stage identifies regions of interest which are then classified in the second stage. Faster R-CNN is such an approach for object detection which combines both stages into a single pipeline. In this paper we apply Faster R-CNN to the task of company logo detection. Motivated by its weak performance on small object instan… ▽ More

    Submitted 28 April, 2017; originally announced April 2017.

    Comments: 8 Pages, ICMR 2017

  7. arXiv:1504.05369  [pdf, other

    cs.CV

    Key-Pose Prediction in Cyclic Human Motion

    Authors: Dan Zecha, Rainer Lienhart

    Abstract: In this paper we study the problem of estimating innercyclic time intervals within repetitive motion sequences of top-class swimmers in a swimming channel. Interval limits are given by temporal occurrences of key-poses, i.e. distinctive postures of the body. A key-pose is defined by means of only one or two specific features of the complete posture. It is often difficult to detect such subtle feat… ▽ More

    Submitted 21 April, 2015; originally announced April 2015.

    Comments: Accepted at WACV 2015, 8 pages, 3 figures