Skip to main content

Showing 1–9 of 9 results for author: Dooms, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.08852  [pdf, ps, other

    cs.CY cs.AI cs.LG

    Safe and Certifiable AI Systems: Concepts, Challenges, and Lessons Learned

    Authors: Kajetan Schweighofer, Barbara Brune, Lukas Gruber, Simon Schmid, Alexander Aufreiter, Andreas Gruber, Thomas Doms, Sebastian Eder, Florian Mayer, Xaver-Paul Stadlbauer, Christoph Schwald, Werner Zellinger, Bernhard Nessler, Sepp Hochreiter

    Abstract: There is an increasing adoption of artificial intelligence in safety-critical applications, yet practical schemes for certifying that AI systems are safe, lawful and socially acceptable remain scarce. This white paper presents the TÜV AUSTRIA Trusted AI framework an end-to-end audit catalog and methodology for assessing and certifying machine learning systems. The audit catalog has been in continu… ▽ More

    Submitted 8 September, 2025; originally announced September 2025.

    Comments: 63 pages, 27 figures

  2. arXiv:2504.09184  [pdf, ps, other

    cs.CL cs.AI

    Parameterized Synthetic Text Generation with SimpleStories

    Authors: Lennart Finke, Chandan Sreedhara, Thomas Dooms, Mat Allen, Emerald Zhang, Juan Diego Rodriguez, Noa Nabeshima, Thomas Marshall, Dan Braun

    Abstract: We present SimpleStories, a large synthetic story dataset in simple language, consisting of 2 million samples each in English and Japanese. Through parameterizing prompts at multiple levels of abstraction, we achieve control over story characteristics at scale, inducing syntactic and semantic diversity. Ablations on a newly trained model suite show improved sample efficiency and model interpretabi… ▽ More

    Submitted 30 May, 2025; v1 submitted 12 April, 2025; originally announced April 2025.

  3. arXiv:2504.02667  [pdf, other

    cs.LG

    Compositionality Unlocks Deep Interpretable Models

    Authors: Thomas Dooms, Ward Gauderis, Geraint A. Wiggins, Jose Oramas

    Abstract: We propose $χ$-net, an intrinsically interpretable architecture combining the compositional multilinear structure of tensor networks with the expressivity and efficiency of deep neural networks. $χ$-nets retain equal accuracy compared to their baseline counterparts. Our novel, efficient diagonalisation algorithm, ODT, reveals linear low-rank structure in a multilayer SVHN model. We leverage this t… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  4. arXiv:2502.17332  [pdf, other

    cs.LG

    Tokenized SAEs: Disentangling SAE Reconstructions

    Authors: Thomas Dooms, Daniel Wilhelm

    Abstract: Sparse auto-encoders (SAEs) have become a prevalent tool for interpreting language models' inner workings. However, it is unknown how tightly SAE features correspond to computationally important directions in the model. This work empirically shows that many RES-JB SAE features predominantly correspond to simple input statistics. We hypothesize this is caused by a large class imbalance in training… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  5. arXiv:2410.08417  [pdf, ps, other

    cs.LG stat.ML

    Bilinear MLPs enable weight-based mechanistic interpretability

    Authors: Michael T. Pearce, Thomas Dooms, Alice Rigg, Jose M. Oramas, Lee Sharkey

    Abstract: A mechanistic understanding of how MLPs do computation in deep neural networks remains elusive. Current interpretability work can extract features from hidden activations over an input dataset but generally cannot explain how MLP weights construct features. One challenge is that element-wise nonlinearities introduce higher-order interactions and make it difficult to trace computations through the… ▽ More

    Submitted 25 June, 2025; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: Accepted to ICLR'25

  6. arXiv:2406.03947  [pdf, other

    cs.LG cs.AI

    Weight-based Decomposition: A Case for Bilinear MLPs

    Authors: Michael T. Pearce, Thomas Dooms, Alice Rigg

    Abstract: Gated Linear Units (GLUs) have become a common building block in modern foundation models. Bilinear layers drop the non-linearity in the "gate" but still have comparable performance to other GLUs. An attractive quality of bilinear layers is that they can be fully expressed in terms of a third-order tensor and linear operations. Leveraging this, we develop a method to decompose the bilinear tensor… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  7. arXiv:2311.18130  [pdf, other

    cs.LG cs.CV

    The Trifecta: Three simple techniques for training deeper Forward-Forward networks

    Authors: Thomas Dooms, Ing Jyh Tsang, Jose Oramas

    Abstract: Modern machine learning models are able to outperform humans on a variety of non-trivial tasks. However, as the complexity of the models increases, they consume significant amounts of power and still struggle to generalize effectively to unseen data. Local learning, which focuses on updating subsets of a model's parameters at a time, has emerged as a promising technique to address these issues. Re… ▽ More

    Submitted 12 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

    MSC Class: 68T07

  8. arXiv:2310.02727  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Functional trustworthiness of AI systems by statistically valid testing

    Authors: Bernhard Nessler, Thomas Doms, Sepp Hochreiter

    Abstract: The authors are concerned about the safety, health, and rights of the European citizens due to inadequate measures and procedures required by the current draft of the EU Artificial Intelligence (AI) Act for the conformity assessment of AI systems. We observe that not only the current draft of the EU AI Act, but also the accompanying standardization efforts in CEN/CENELEC, have resorted to the posi… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: Position paper to the current regulation and standardization effort of AI in Europe

  9. arXiv:2103.16910  [pdf

    stat.ML cs.CY cs.LG cs.SE

    Trusted Artificial Intelligence: Towards Certification of Machine Learning Applications

    Authors: Philip Matthias Winter, Sebastian Eder, Johannes Weissenböck, Christoph Schwald, Thomas Doms, Tom Vogt, Sepp Hochreiter, Bernhard Nessler

    Abstract: Artificial Intelligence is one of the fastest growing technologies of the 21st century and accompanies us in our daily lives when interacting with technical applications. However, reliance on such technical systems is crucial for their widespread applicability and acceptance. The societal tools to express reliance are usually formalized by lawful regulations, i.e., standards, norms, accreditations… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

    Comments: 48 pages, 11 figures, soft-review