-
H2O Open Ecosystem for State-of-the-art Large Language Models
Authors:
Arno Candel,
Jon McKinney,
Philipp Singer,
Pascal Pfeiffer,
Maximilian Jeblick,
Chun Ming Lee,
Marcos V. Conde
Abstract:
Large Language Models (LLMs) represent a revolution in AI. However, they also pose many significant risks, such as the presence of biased, private, copyrighted or harmful text. For this reason we need open, transparent and safe solutions. We introduce a complete open-source ecosystem for developing and testing LLMs. The goal of this project is to boost open alternatives to closed-source approaches…
▽ More
Large Language Models (LLMs) represent a revolution in AI. However, they also pose many significant risks, such as the presence of biased, private, copyrighted or harmful text. For this reason we need open, transparent and safe solutions. We introduce a complete open-source ecosystem for developing and testing LLMs. The goal of this project is to boost open alternatives to closed-source approaches. We release h2oGPT, a family of fine-tuned LLMs of diverse sizes. We also introduce H2O LLM Studio, a framework and no-code GUI designed for efficient fine-tuning, evaluation, and deployment of LLMs using the most recent state-of-the-art techniques. Our code and models are fully open-source. We believe this work helps to boost AI development and make it more accessible, efficient and trustworthy. The demo is available at: https://gpt.h2o.ai/
△ Less
Submitted 23 October, 2023; v1 submitted 17 October, 2023;
originally announced October 2023.
-
h2oGPT: Democratizing Large Language Models
Authors:
Arno Candel,
Jon McKinney,
Philipp Singer,
Pascal Pfeiffer,
Maximilian Jeblick,
Prithvi Prabhu,
Jeff Gambera,
Mark Landry,
Shivam Bansal,
Ryan Chesler,
Chun Ming Lee,
Marcos V. Conde,
Pasha Stetsenko,
Olivier Grellier,
SriSatish Ambati
Abstract:
Applications built on top of Large Language Models (LLMs) such as GPT-4 represent a revolution in AI due to their human-level capabilities in natural language processing. However, they also pose many significant risks such as the presence of biased, private, or harmful text, and the unauthorized inclusion of copyrighted material.
We introduce h2oGPT, a suite of open-source code repositories for…
▽ More
Applications built on top of Large Language Models (LLMs) such as GPT-4 represent a revolution in AI due to their human-level capabilities in natural language processing. However, they also pose many significant risks such as the presence of biased, private, or harmful text, and the unauthorized inclusion of copyrighted material.
We introduce h2oGPT, a suite of open-source code repositories for the creation and use of LLMs based on Generative Pretrained Transformers (GPTs). The goal of this project is to create the world's best truly open-source alternative to closed-source approaches. In collaboration with and as part of the incredible and unstoppable open-source community, we open-source several fine-tuned h2oGPT models from 7 to 40 Billion parameters, ready for commercial use under fully permissive Apache 2.0 licenses. Included in our release is 100\% private document search using natural language.
Open-source language models help boost AI development and make it more accessible and trustworthy. They lower entry hurdles, allowing people and groups to tailor these models to their needs. This openness increases innovation, transparency, and fairness. An open-source strategy is needed to share AI benefits fairly, and H2O.ai will continue to democratize AI and LLMs.
△ Less
Submitted 16 June, 2023; v1 submitted 13 June, 2023;
originally announced June 2023.
-
Some examples of first exit times
Authors:
Jesús Antonio Álvarez López,
Alberto Candel
Abstract:
The purpose of this article is to compute the expected first exit times of Brownian motion from a variety of domains in the Euclidean plane and in the hyperbolic plane.
The purpose of this article is to compute the expected first exit times of Brownian motion from a variety of domains in the Euclidean plane and in the hyperbolic plane.
△ Less
Submitted 22 July, 2016; v1 submitted 27 June, 2016;
originally announced June 2016.
-
Non-reduction of relations in the Gromov space to Polish actions
Authors:
Jesús A. Álvarez López,
Alberto Candel
Abstract:
It is shown that, in the Gromov space of isometry classes of pointed proper metric spaces, the equivalence relations defined by existence of coarse quasi-isometries or being at finite Gromov-Hausdorff distance, cannot be reduced to the equivalence relation defined by any Polish action.
It is shown that, in the Gromov space of isometry classes of pointed proper metric spaces, the equivalence relations defined by existence of coarse quasi-isometries or being at finite Gromov-Hausdorff distance, cannot be reduced to the equivalence relation defined by any Polish action.
△ Less
Submitted 28 October, 2017; v1 submitted 12 January, 2015;
originally announced January 2015.
-
A universal Riemannian foliated space
Authors:
Jesús A. Álvarez López,
Ramón Barral Lijó,
Alberto Candel
Abstract:
It is proved that the isometry classes of pointed connected complete Riemannian $n$-manifolds form a Polish space, $\mathcal{M}_*^\infty(n)$, with the topology described by the $C^\infty$ convergence of manifolds. This space has a canonical partition into sets defined by varying the distinguished point into each manifold. The locally non-periodic manifolds define an open dense subspace…
▽ More
It is proved that the isometry classes of pointed connected complete Riemannian $n$-manifolds form a Polish space, $\mathcal{M}_*^\infty(n)$, with the topology described by the $C^\infty$ convergence of manifolds. This space has a canonical partition into sets defined by varying the distinguished point into each manifold. The locally non-periodic manifolds define an open dense subspace $\mathcal{M}_{*,\text{lnp}}^\infty(n)\subset\mathcal{M}_*^\infty(n)$, which becomes a $C^\infty$ foliated space with the restriction of the canonical partition. Its leaves without holonomy form the subspace $\mathcal{M}_{*,\text{np}}^\infty(n)\subset\mathcal{M}_{*,\text{lnp}}^\infty(n)$ defined by the non-periodic manifolds. Moreover the leaves have a natural Riemannian structure so that $\mathcal{M}_{*,\text{lnp}}^\infty(n)$ becomes a Riemannian foliated space, which is universal among all sequential Riemannian foliated spaces satisfying certain property called covering-continuity. $\mathcal{M}_{*,\text{lnp}}^\infty(n)$ is used to characterize the realization of complete connected Riemannian manifolds as dense leaves of covering-continuous compact sequential Riemannian foliated spaces.
△ Less
Submitted 13 December, 2016; v1 submitted 20 August, 2014;
originally announced August 2014.
-
Generic coarse geometry of leaves
Authors:
Jesús A. Álvarez López,
Alberto Candel
Abstract:
A compact Polish foliated space is considered. Part of this work studies coarsely quasi-isometric invariants of leaves in some residual saturated subset when the foliated space is transitive. In fact, we also use "equi-" versions of this kind of invariants, which means that the definition is satisfied with the same constants by some given set of leaves. For instance, the following properties are p…
▽ More
A compact Polish foliated space is considered. Part of this work studies coarsely quasi-isometric invariants of leaves in some residual saturated subset when the foliated space is transitive. In fact, we also use "equi-" versions of this kind of invariants, which means that the definition is satisfied with the same constants by some given set of leaves. For instance, the following properties are proved. Either all dense leaves without holonomy are equi-coarsely quasi-isometric to each other, or else there exist residually many dense leaves without holonomy such that each of them is coarsely quasi-isometric to meagerly many leaves. Assuming that the foliated space is minimal, the first of the above alternatives holds if and if the leaves without holonomy satisfy a condition called coarse quasi-symmetry. A similar dichotomy holds for the growth type of the leaves, as well as an analogous characterization of the first alternative in the minimal case, involving a property called growth symmetry. Moreover some classes of growth are shared, either by residually many leaves, or by meagerly many leaves. If some leaf without holonomy is amenable, then all dense leaves without holonomy are equi-amenable, and, in the minimal case, they satisfy a property called amenable symmetry. Residually many leaves have the same asymptotic dimension. If the foliated space is minimal, then any pair of nonempty open sets in the Higson coronas of the leaves with holonomy contain homeomorphic nonempty open subsets. Another part studies limit sets of leaves at points in their Higson corona, defined like the usual limit sets at their ends.
△ Less
Submitted 8 December, 2017; v1 submitted 6 June, 2014;
originally announced June 2014.
-
Topological description of Riemannian foliations with dense leaves
Authors:
Jesús A. Álvarez López,
Alberto Candel
Abstract:
A transitive compact foliated space is shown to be a Riemannian foliation if and only if it is locally connected, finite dimensional, strongly equicontinuous and quasi-analytic, and the closure of its holonomy pseudogroup is quasi-analytic.
A transitive compact foliated space is shown to be a Riemannian foliation if and only if it is locally connected, finite dimensional, strongly equicontinuous and quasi-analytic, and the closure of its holonomy pseudogroup is quasi-analytic.
△ Less
Submitted 14 November, 2013;
originally announced November 2013.
-
Equicontinuous foliated spaces
Authors:
Jesús A. Álvarez López,
Alberto Candel
Abstract:
Some properties of Riemannian foliations on closed manifolds are generalized to compact equicontinuous foliated spaces. For instance, it is proved that all holonomy covers of the leaves are quasi-isometric to each other.
Some properties of Riemannian foliations on closed manifolds are generalized to compact equicontinuous foliated spaces. For instance, it is proved that all holonomy covers of the leaves are quasi-isometric to each other.
△ Less
Submitted 14 November, 2013;
originally announced November 2013.
-
Algebraic characterization of quasi-isometric spaces via the Higson compactification
Authors:
Jesús A. Álvarez López,
Alberto Candel
Abstract:
The purpose of this article is to characterize the quasi-isometry type of a proper metric space via the Banach algebra of Higson functions on it.
The purpose of this article is to characterize the quasi-isometry type of a proper metric space via the Banach algebra of Higson functions on it.
△ Less
Submitted 14 November, 2013;
originally announced November 2013.
-
On turbulent relations
Authors:
Jesús A. Álvarez López,
Alberto Candel
Abstract:
This paper extends the theory of turbulence of Hjorth to certain classes of equivalence relations that cannot be induced by Polish actions. It applies this theory to analyze the quasi-isometry relation and finite Gromov-Hausdorff distance relation in the space of isometry classes of pointed proper metric spaces, called the Gromov space.
This paper extends the theory of turbulence of Hjorth to certain classes of equivalence relations that cannot be induced by Polish actions. It applies this theory to analyze the quasi-isometry relation and finite Gromov-Hausdorff distance relation in the space of isometry classes of pointed proper metric spaces, called the Gromov space.
△ Less
Submitted 13 December, 2016; v1 submitted 3 September, 2012;
originally announced September 2012.
-
Connection preserving actions are topologically engaging
Authors:
A. Candel,
R. Quiroga-Barranco
Abstract:
Topologically and geometrically engaging actions have proved to be useful to obtain rigidity results for semisimple Lie group actions. We show that the action of a simple noncompact Lie group on a compact manifold preserving a unimodular rigid geometric structure of algebraic type (e.g. a connection together with a volume density) is topologically engaging on an open conull dense set.
Topologically and geometrically engaging actions have proved to be useful to obtain rigidity results for semisimple Lie group actions. We show that the action of a simple noncompact Lie group on a compact manifold preserving a unimodular rigid geometric structure of algebraic type (e.g. a connection together with a volume density) is topologically engaging on an open conull dense set.
△ Less
Submitted 10 January, 2012;
originally announced January 2012.