-
Towards Achieving Concept Completeness for Textual Concept Bottleneck Models
Authors:
Milan Bhan,
Yann Choho,
Pierre Moreau,
Jean-Noel Vittaut,
Nicolas Chesneau,
Marie-Jeanne Lesot
Abstract:
Textual Concept Bottleneck Models (TCBMs) are interpretable-by-design models for text classification that predict a set of salient concepts before making the final prediction. This paper proposes Complete Textual Concept Bottleneck Model (CT-CBM), a novel TCBM generator building concept labels in a fully unsupervised manner using a small language model, eliminating both the need for predefined hum…
▽ More
Textual Concept Bottleneck Models (TCBMs) are interpretable-by-design models for text classification that predict a set of salient concepts before making the final prediction. This paper proposes Complete Textual Concept Bottleneck Model (CT-CBM), a novel TCBM generator building concept labels in a fully unsupervised manner using a small language model, eliminating both the need for predefined human labeled concepts and LLM annotations. CT-CBM iteratively targets and adds important and identifiable concepts in the bottleneck layer to create a complete concept basis. CT-CBM achieves striking results against competitors in terms of concept basis completeness and concept detection accuracy, offering a promising solution to reliably enhance interpretability of NLP classifiers.
△ Less
Submitted 28 May, 2025; v1 submitted 16 February, 2025;
originally announced February 2025.
-
Path Verification for Dynamic Indirect Illumination
Authors:
Pierre Moreau,
Michael Doggett,
Erik Sintorn
Abstract:
In this paper we present a technique that improves rendering performance for real-time scenes with ray traced lighting in the presence of dynamic lights and objects. In particular we verify photon paths from the previous frame against dynamic objects in the current frame, and show how most photon paths are still valid. When using area lights, we use a data structure to store light distribution tha…
▽ More
In this paper we present a technique that improves rendering performance for real-time scenes with ray traced lighting in the presence of dynamic lights and objects. In particular we verify photon paths from the previous frame against dynamic objects in the current frame, and show how most photon paths are still valid. When using area lights, we use a data structure to store light distribution that tracks light paths allowing photons to be reused when the light source is moving in the scene. We also show that by reusing paths when the error in the reflected energy is below a threshold value, even more paths can be reused. We apply this technique to Indirect Illumination using a screen space photon splatting rendering engine. By reusing photon paths and applying our error threshold, our method can reduce the number of rays traced by up to 5x, and improve performance by up to 2x.
△ Less
Submitted 12 November, 2021;
originally announced November 2021.
-
Generic Encodings of Constructor Rewriting Systems
Authors:
Horatiu Cirstea,
Pierre-Etienne Moreau
Abstract:
Rewriting is a formalism widely used in computer science and mathematical logic. The classical formalism has been extended, in the context of functional languages, with an order over the rules and, in the context of rewrite based languages, with the negation over patterns. We propose in this paper a concise and clear algorithm computing the difference over patterns which can be used to define gene…
▽ More
Rewriting is a formalism widely used in computer science and mathematical logic. The classical formalism has been extended, in the context of functional languages, with an order over the rules and, in the context of rewrite based languages, with the negation over patterns. We propose in this paper a concise and clear algorithm computing the difference over patterns which can be used to define generic encodings of constructor term rewriting systems with negation and order into classical term rewriting systems. As a direct consequence, established methods used for term rewriting systems can be applied to analyze properties of the extended systems. The approach can also be seen as a generic compiler which targets any language providing basic pattern matching primitives. The formalism provides also a new method for deciding if a set of patterns subsumes a given pattern and thus, for checking the presence of useless patterns or the completeness of a set of patterns.
△ Less
Submitted 11 June, 2019; v1 submitted 14 May, 2019;
originally announced May 2019.
-
Faithful (meta-)encodings of programmable strategies into term rewriting systems
Authors:
Horatiu Cirstea,
Serguei Lenglet,
Pierre-Etienne Moreau
Abstract:
Rewriting is a formalism widely used in computer science and mathematical logic. When using rewriting as a programming or modeling paradigm, the rewrite rules describe the transformations one wants to operate and rewriting strategies are used to con- trol their application. The operational semantics of these strategies are generally accepted and approaches for analyzing the termination of specific…
▽ More
Rewriting is a formalism widely used in computer science and mathematical logic. When using rewriting as a programming or modeling paradigm, the rewrite rules describe the transformations one wants to operate and rewriting strategies are used to con- trol their application. The operational semantics of these strategies are generally accepted and approaches for analyzing the termination of specific strategies have been studied. We propose in this paper a generic encoding of classic control and traversal strategies used in rewrite based languages such as Maude, Stratego and Tom into a plain term rewriting system. The encoding is proven sound and complete and, as a direct consequence, estab- lished termination methods used for term rewriting systems can be applied to analyze the termination of strategy controlled term rewriting systems. We show that the encoding of strategies into term rewriting systems can be easily adapted to handle many-sorted signa- tures and we use a meta-level representation of terms to reduce the size of the encodings. The corresponding implementation in Tom generates term rewriting systems compatible with the syntax of termination tools such as AProVE and TTT2, tools which turned out to be very effective in (dis)proving the termination of the generated term rewriting systems. The approach can also be seen as a generic strategy compiler which can be integrated into languages providing pattern matching primitives; experiments in Tom show that applying our encoding leads to performances comparable to the native Tom strategies.
△ Less
Submitted 26 November, 2017; v1 submitted 24 May, 2017;
originally announced May 2017.
-
A Type System for Tom
Authors:
Claude Kirchner,
Pierre-Etienne Moreau,
Cláudia Tavares
Abstract:
Extending a given language with new dedicated features is a general and quite used approach to make the programming language more adapted to problems. Being closer to the application, this leads to less programming flaws and easier maintenance. But of course one would still like to perform program analysis on these kinds of extended languages, in particular type checking and inference. In this ca…
▽ More
Extending a given language with new dedicated features is a general and quite used approach to make the programming language more adapted to problems. Being closer to the application, this leads to less programming flaws and easier maintenance. But of course one would still like to perform program analysis on these kinds of extended languages, in particular type checking and inference. In this case one has to make the typing of the extended features compatible with the ones in the starting language.
The Tom programming language is a typical example of such a situation as it consists of an extension of Java that adds pattern matching, more particularly associative pattern matching, and reduction strategies.
This paper presents a type system with subtyping for Tom, that is compatible with Java's type system, and that performs both type checking and type inference. We propose an algorithm that checks if all patterns of a Tom program are well-typed. In addition, we propose an algorithm based on equality and subtyping constraints that infers types of variables occurring in a pattern. Both algorithms are exemplified and the proposed type system is showed to be sound and complete.
△ Less
Submitted 25 March, 2010;
originally announced March 2010.