Showing 1–2 of 2 results for author: Loukas, O

Search v0.5.6 released 2020-02-24

arXiv:2204.03406 [pdf, other]

hep-th cond-mat.stat-mech stat.ML

Categorical Distributions of Maximum Entropy under Marginal Constraints

Authors: Orestis Loukas, Ho Ryun Chung

Abstract: The estimation of categorical distributions under marginal constraints summarizing some sample from a population in the most-generalizable way is key for many machine-learning and data-driven approaches. We provide a parameter-agnostic theoretical framework that enables this task ensuring (i) that a categorical distribution of Maximum Entropy under marginal constraints always exists and (ii) that… ▽ More The estimation of categorical distributions under marginal constraints summarizing some sample from a population in the most-generalizable way is key for many machine-learning and data-driven approaches. We provide a parameter-agnostic theoretical framework that enables this task ensuring (i) that a categorical distribution of Maximum Entropy under marginal constraints always exists and (ii) that it is unique. The procedure of iterative proportional fitting (IPF) naturally estimates that distribution from any consistent set of marginal constraints directly in the space of probabilities, thus deductively identifying a least-biased characterization of the population. The theoretical framework together with IPF leads to a holistic workflow that enables modeling any class of categorical distributions solely using the phenomenological information provided. △ Less

Submitted 15 November, 2023; v1 submitted 7 April, 2022; originally announced April 2022.

Comments: 20 pages, 5 figures, 1 Excel file
arXiv:1912.05634 [pdf, other]

cond-mat.dis-nn cond-mat.stat-mech hep-th stat.ML

Self-regularizing restricted Boltzmann machines

Authors: Orestis Loukas

Abstract: Focusing on the grand-canonical extension of the ordinary restricted Boltzmann machine, we suggest an energy-based model for feature extraction that uses a layer of hidden units with varying size. By an appropriate choice of the chemical potential and given a sufficiently large number of hidden resources the generative model is able to efficiently deduce the optimal number of hidden units required… ▽ More Focusing on the grand-canonical extension of the ordinary restricted Boltzmann machine, we suggest an energy-based model for feature extraction that uses a layer of hidden units with varying size. By an appropriate choice of the chemical potential and given a sufficiently large number of hidden resources the generative model is able to efficiently deduce the optimal number of hidden units required to learn the target data with exceedingly small generalization error. The formal simplicity of the grand-canonical ensemble combined with a rapidly converging ansatz in mean-field theory enable us to recycle well-established numerical algothhtims during training, like contrastive divergence, with only minor changes. As a proof of principle and to demonstrate the novel features of grand-canonical Boltzmann machines, we train our generative models on data from the Ising theory and MNIST. △ Less

Submitted 9 December, 2019; originally announced December 2019.

Comments: 28 pages, 11 figures

Search v0.5.6 released 2020-02-24