Search | arXiv e-print repository

Learning with Fitzpatrick Losses

Authors: Seta Rakotomandimby, Jean-Philippe Chancelier, Michel de Lara, Mathieu Blondel

Abstract: Fenchel-Young losses are a family of convex loss functions, encompassing the squared, logistic and sparsemax losses, among others. Each Fenchel-Young loss is implicitly associated with a link function, for mapping model outputs to predictions. For instance, the logistic loss is associated with the soft argmax link function. Can we build new loss functions associated with the same link function as… ▽ More Fenchel-Young losses are a family of convex loss functions, encompassing the squared, logistic and sparsemax losses, among others. Each Fenchel-Young loss is implicitly associated with a link function, for mapping model outputs to predictions. For instance, the logistic loss is associated with the soft argmax link function. Can we build new loss functions associated with the same link function as Fenchel-Young losses? In this paper, we introduce Fitzpatrick losses, a new family of convex loss functions based on the Fitzpatrick function. A well-known theoretical tool in maximal monotone operator theory, the Fitzpatrick function naturally leads to a refined Fenchel-Young inequality, making Fitzpatrick losses tighter than Fenchel-Young losses, while maintaining the same link function for prediction. As an example, we introduce the Fitzpatrick logistic loss and the Fitzpatrick sparsemax loss, counterparts of the logistic and the sparsemax losses. This yields two new tighter losses associated with the soft argmax and the sparse argmax, two of the most ubiquitous output layers used in machine learning. We study in details the properties of Fitzpatrick losses and in particular, we show that they can be seen as Fenchel-Young losses using a modified, target-dependent generating function. We demonstrate the effectiveness of Fitzpatrick losses for label proportion estimation. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2108.03099 [pdf, other]

Causal Inference Theory with Information Dependency Models

Authors: Benjamin Heymann, Michel de Lara, Jean-Philippe Chancelier

Abstract: Inferring the potential consequences of an unobserved event is a fundamental scientific question. To this end, Pearl's celebrated do-calculus provides a set of inference rules to derive an interventional probability from an observational one. In this framework, the primitive causal relations are encoded as functional dependencies in a Structural Causal Model (SCM), which are generally mapped into… ▽ More Inferring the potential consequences of an unobserved event is a fundamental scientific question. To this end, Pearl's celebrated do-calculus provides a set of inference rules to derive an interventional probability from an observational one. In this framework, the primitive causal relations are encoded as functional dependencies in a Structural Causal Model (SCM), which are generally mapped into a Directed Acyclic Graph (DAG) in the absence of cycles. In this paper, by contrast, we capture causality without reference to graphs or functional dependencies, but with information fields and Witsenhausen's intrinsic model. The three rules of do-calculus reduce to a unique sufficient condition for conditional independence, the topological separation, which presents interesting theoretical and practical advantages over the d-separation. With this unique rule, we can deal with systems that cannot be represented with DAGs, for instance systems with cycles and/or 'spurious' edges. We treat an example that cannot be handled-to the extent of our knowledge-with the tools of the current literature. We also explain why, in the presence of cycles, the theory of causal inference might require different tools, depending on whether the random variables are discrete or continuous. △ Less

Submitted 9 August, 2021; v1 submitted 6 August, 2021; originally announced August 2021.

arXiv:2108.03096 [pdf, ps, other]

Topological Conditional Separation

Authors: Michel de Lara, Jean-Philippe Chancelier, Benjamin Heymann

Abstract: Pearl's d-separation is a foundational notion to study conditional independence between random variables. We define the topological conditional separation and we show that it is equivalent to the d-separation, extended beyond acyclic graphs, be they finite or infinite. Pearl's d-separation is a foundational notion to study conditional independence between random variables. We define the topological conditional separation and we show that it is equivalent to the d-separation, extended beyond acyclic graphs, be they finite or infinite. △ Less

Submitted 6 August, 2021; originally announced August 2021.

arXiv:2108.03018 [pdf, other]

Conditional Separation as a Binary Relation. A Coq Assisted Proof

Authors: Jean-Philippe Chancelier, Michel de Lara, Benjamin Heymann

Abstract: The concept of d-separation holds a pivotal role in causality theory, serving as a fundamental tool for deriving conditional independence properties from causal graphs. Pearl defined the d-separation of two subsets conditionally on a third one. In this study, we present a novel perspective by showing i) how the d-separation can be extended beyond acyclic graphs, possibly infinite, and ii) how… ▽ More The concept of d-separation holds a pivotal role in causality theory, serving as a fundamental tool for deriving conditional independence properties from causal graphs. Pearl defined the d-separation of two subsets conditionally on a third one. In this study, we present a novel perspective by showing i) how the d-separation can be extended beyond acyclic graphs, possibly infinite, and ii) how it can be expressed and characterized as a binary relation between vertices. Compared to the typical perspectives in causality theory, our equivalence opens the door to more compact and computational proofing techniques, because the language of binary relations is well adapted to equational reasoning. Additionally, and of independent interest, the proofs of the results presented in this paper are checked with the Coq proof assistant. △ Less

Submitted 2 April, 2024; v1 submitted 6 August, 2021; originally announced August 2021.

arXiv:2104.05249 [pdf, ps, other]

Kuhn's Equivalence Theorem for Games in Product Form

Authors: Benjamin Heymann, Michel de Lara, Jean-Philippe Chancelier

Abstract: We propose an alternative to the tree representation of extensive form games. Games in product form represent information with $σ$-fields over a product set, and do not require an explicit description of the play temporality, as opposed to extensive form games on trees. This representation encompasses games with a continuum of actions, randomness and players, as well as games for which the play or… ▽ More We propose an alternative to the tree representation of extensive form games. Games in product form represent information with $σ$-fields over a product set, and do not require an explicit description of the play temporality, as opposed to extensive form games on trees. This representation encompasses games with a continuum of actions, randomness and players, as well as games for which the play order cannot be determined in advance. We adapt and prove Kuhn's theorem-regarding equivalence between mixed and behavioral strategies under perfect recall-for games in product form with continuous action sets. △ Less

Submitted 13 July, 2022; v1 submitted 12 April, 2021; originally announced April 2021.

arXiv:2006.14838 [pdf, ps, other]

Kuhn's Equivalence Theorem for Games in Intrinsic Form

Authors: Benjamin Heymann, Michel de Lara, Jean-Philippe Chancelier

Abstract: We state and prove Kuhn's equivalence theorem for a new representation of games, the intrinsic form. First, we introduce games in intrinsic form where information is represented by $σ$-fields over a product set. For this purpose, we adapt to games the intrinsic representation that Witsenhausen introduced in control theory. Those intrinsic games do not require an explicit description of the play te… ▽ More We state and prove Kuhn's equivalence theorem for a new representation of games, the intrinsic form. First, we introduce games in intrinsic form where information is represented by $σ$-fields over a product set. For this purpose, we adapt to games the intrinsic representation that Witsenhausen introduced in control theory. Those intrinsic games do not require an explicit description of the play temporality, as opposed to extensive form games on trees. Second, we prove, for this new and more general representation of games, that behavioral and mixed strategies are equivalent under perfect recall (Kuhn's theorem). As the intrinsic form replaces the tree structure with a product structure, the handling of information is easier. This makes the intrinsic form a new valuable tool for the analysis of games with information. △ Less

Submitted 26 June, 2020; originally announced June 2020.

arXiv:1510.02789 [pdf, ps, other]

A novel code generation methodology for block diagram modeler and simulators Scicos and VSS

Authors: Jean-Philippe Chancelier, Ramine Nikoukhah

Abstract: Block operations during simulation in Scicos and VSS environments can naturally be described as Nsp functions. But the direct use of Nsp functions for simulation leads to poor performance since the Nsp language is interpreted, not compiled. The methodology presented in this paper is used to develop a tool for generating efficient compilable code, such as C and ADA, for Scicos and VSS models from t… ▽ More Block operations during simulation in Scicos and VSS environments can naturally be described as Nsp functions. But the direct use of Nsp functions for simulation leads to poor performance since the Nsp language is interpreted, not compiled. The methodology presented in this paper is used to develop a tool for generating efficient compilable code, such as C and ADA, for Scicos and VSS models from these block Nsp functions. Operator overloading and partial evaluation are the key elements of this novel approach. This methodology may be used in other simulation environments such as Matlab/Simulink. △ Less

Submitted 8 October, 2015; originally announced October 2015.

arXiv:1001.3213 [pdf, ps, other]

Using Premia and Nsp for Constructing a Risk Management Benchmark for Testing Parallel Architecture

Authors: Jean-Philippe Chancelier, Jérôme Lelong, Bernard Lapeyre

Abstract: Financial institutions have massive computations to carry out overnight which are very demanding in terms of the consumed CPU. The challenge is to price many different products on a cluster-like architecture. We have used the Premia software to valuate the financial derivatives. In this work, we explain how Premia can be embedded into Nsp, a scientific software like Matlab, to provide a powerful t… ▽ More Financial institutions have massive computations to carry out overnight which are very demanding in terms of the consumed CPU. The challenge is to price many different products on a cluster-like architecture. We have used the Premia software to valuate the financial derivatives. In this work, we explain how Premia can be embedded into Nsp, a scientific software like Matlab, to provide a powerful tool to valuate a whole portfolio. Finally, we have integrated an MPI toolbox into Nsp to enable to use Premia to solve a bunch of pricing problems on a cluster. This unified framework can then be used to test different parallel architectures. △ Less

Submitted 21 May, 2012; v1 submitted 19 January, 2010; originally announced January 2010.

Showing 1–8 of 8 results for author: Chancelier, J