-
Constructive community race: full-density spiking neural network model drives neuromorphic computing
Authors:
Johanna Senk,
Anno C. Kurth,
Steve Furber,
Tobias Gemmeke,
Bruno Golosio,
Arne Heittmann,
James C. Knight,
Eric Müller,
Tobias Noll,
Thomas Nowotny,
Gorka Peraza Coppola,
Luca Peres,
Oliver Rhodes,
Andrew Rowley,
Johannes Schemmel,
Tim Stadtmann,
Tom Tetzlaff,
Gianmarco Tiddia,
Sacha J. van Albada,
José Villamar,
Markus Diesmann
Abstract:
The local circuitry of the mammalian brain is a focus of the search for generic computational principles because it is largely conserved across species and modalities. In 2014 a model was proposed representing all neurons and synapses of the stereotypical cortical microcircuit below $1\,\text{mm}^2$ of brain surface. The model reproduces fundamental features of brain activity but its impact remain…
▽ More
The local circuitry of the mammalian brain is a focus of the search for generic computational principles because it is largely conserved across species and modalities. In 2014 a model was proposed representing all neurons and synapses of the stereotypical cortical microcircuit below $1\,\text{mm}^2$ of brain surface. The model reproduces fundamental features of brain activity but its impact remained limited because of its computational demands. For theory and simulation, however, the model was a breakthrough because it removes uncertainties of downscaling, and larger models are less densely connected. This sparked a race in the neuromorphic computing community and the model became a de facto standard benchmark. Within a few years real-time performance was reached and surpassed at significantly reduced energy consumption. We review how the computational challenge was tackled by different simulation technologies and derive guidelines for the next generation of benchmarks and other domains of science.
△ Less
Submitted 2 June, 2025; v1 submitted 27 May, 2025;
originally announced May 2025.
-
Logical Discrete Graphical Models Must Supplement Large Language Models for Information Synthesis
Authors:
Gregory Coppola
Abstract:
Given the emergent reasoning abilities of large language models, information retrieval is becoming more complex. Rather than just retrieve a document, modern information retrieval systems advertise that they can synthesize an answer based on potentially many different documents, conflicting data sources, and using reasoning. We review recent literature and argue that the large language model has c…
▽ More
Given the emergent reasoning abilities of large language models, information retrieval is becoming more complex. Rather than just retrieve a document, modern information retrieval systems advertise that they can synthesize an answer based on potentially many different documents, conflicting data sources, and using reasoning. We review recent literature and argue that the large language model has crucial flaws that prevent it from on its own ever constituting general intelligence, or answering general information synthesis requests. This review shows that the following are problems for large language models: hallucinations, complex reasoning, planning under uncertainty, and complex calculations. We outline how logical discrete graphical models can solve all of these problems, and outline a method of training a logical discrete model from unlabeled text.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
A Categorization of Complexity Classes for Information Retrieval and Synthesis Using Natural Logic
Authors:
Gregory Coppola
Abstract:
Given the emergent reasoning abilities of large language models, information retrieval is becoming more complex. Rather than just retrieve a document, modern information retrieval systems advertise that they can synthesize an answer based on potentially many different documents, conflicting data sources, and using reasoning. But, different kinds of questions have different answers, and different a…
▽ More
Given the emergent reasoning abilities of large language models, information retrieval is becoming more complex. Rather than just retrieve a document, modern information retrieval systems advertise that they can synthesize an answer based on potentially many different documents, conflicting data sources, and using reasoning. But, different kinds of questions have different answers, and different answers have different complexities. In this paper, we introduce a novel framework for analyzing the complexity of a question answer based on the natural deduction calculus as presented in Prawitz (1965). Our framework is novel both in that no one to our knowledge has used this logic as a basis for complexity classes, and also in that no other existing complexity classes to these have been delineated using any analogous methods either. We identify three decidable fragments in particular called the forward, query and planning fragments, and we compare this to what would be needed to do proofs for the complete first-order calculus, for which theorem-proving is long known to be undecidable.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
The Quantified Boolean Bayesian Network: Theory and Experiments with a Logical Graphical Model
Authors:
Gregory Coppola
Abstract:
This paper introduces the Quantified Boolean Bayesian Network (QBBN), which provides a unified view of logical and probabilistic reasoning. The QBBN is meant to address a central problem with the Large Language Model (LLM), which has become extremely popular in Information Retrieval, which is that the LLM hallucinates. A Bayesian Network, by construction, cannot hallucinate, because it can only re…
▽ More
This paper introduces the Quantified Boolean Bayesian Network (QBBN), which provides a unified view of logical and probabilistic reasoning. The QBBN is meant to address a central problem with the Large Language Model (LLM), which has become extremely popular in Information Retrieval, which is that the LLM hallucinates. A Bayesian Network, by construction, cannot hallucinate, because it can only return answers that it can explain. We show how a Bayesian Network over an unbounded number of boolean variables can be configured to represent the logical reasoning underlying human language. We do this by creating a key-value version of the First-Order Calculus, for which we can prove consistency and completeness. We show that the model is trivially trained over fully observed data, but that inference is non-trivial. Exact inference in a Bayesian Network is intractable (i.e. $Ω(2^N)$ for $N$ variables). For inference, we investigate the use of Loopy Belief Propagation (LBP), which is not guaranteed to converge, but which has been shown to often converge in practice. Our experiments show that LBP indeed does converge very reliably, and our analysis shows that a round of LBP takes time $O(N2^n)$, where $N$ bounds the number of variables considered, and $n$ bounds the number of incoming connections to any factor, and further improvements may be possible. Our network is specifically designed to alternate between AND and OR gates in a Boolean Algebra, which connects more closely to logical reasoning, allowing a completeness proof for an expanded version of our network, and also allows inference to follow specific but adequate pathways, that turn out to be fast.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
Fact-Checking at Scale with DimensionRank
Authors:
Gregory Coppola
Abstract:
The most important problem that has emerged after twenty years of popular internet usage is that of fact-checking at scale. This problem is experienced acutely in both of the major internet application platform types, web search and social media.
We offer a working definition of what a "platform" is. We critically deconstruct what we call the "PolitiFact" model of fact checking, and show it to b…
▽ More
The most important problem that has emerged after twenty years of popular internet usage is that of fact-checking at scale. This problem is experienced acutely in both of the major internet application platform types, web search and social media.
We offer a working definition of what a "platform" is. We critically deconstruct what we call the "PolitiFact" model of fact checking, and show it to be inherently inferior for fact-checking at scale to a platform-b ased solution.
Our central contribution is to show how to effectively platformize the problem of fact-checking at scale. We show how a two-dimensional rating system, with dimensions agreement and hotness allows us to create information-seeking queries not possible with the on e-dimensional rating system predominating on existing platforms. And, we show that, underlying our user-friendly user-interface, lies a system that allows the creation of formal proofs in the propositional calculus.
Our algorithm is implemented in our open-source DimensionRank software package available at "https://thinkdifferentagain.art".
△ Less
Submitted 20 October, 2020;
originally announced October 2020.
-
A Hub-and-Spoke Model for Content-Moderation-at-Scale on an Information-Sharing Platform
Authors:
Gregory Coppola
Abstract:
One of the most expensive parts of maintaining a modern information-sharing platform (e.g., web search, social network) is the task of content-moderation-at-scale. Content moderation is the binary task of determining whether or not a given user-created message meets the editorial team's content guidelines for the site. The challenge is that the number of messages to check scales with the number of…
▽ More
One of the most expensive parts of maintaining a modern information-sharing platform (e.g., web search, social network) is the task of content-moderation-at-scale. Content moderation is the binary task of determining whether or not a given user-created message meets the editorial team's content guidelines for the site. The challenge is that the number of messages to check scales with the number of users, which is much larger than the number of moderator-employees working for the given platform.
We show how content moderation can be achieved significantly more cheaply than before, in the special case where all messages are public, by effectively platformizing the task of content moderation. Our approach is to use a hub-and-spoke model. The hub is the core editorial team delegated by the management of the given platform. The spokes are the individual users. The ratings of the editorial team create the labels for a statistical learning algorithm, while the ratings of the users are used as features.
We have implemented a primitive version of this algorithm into our open-source DimensionRank code base, found at "thinkdifferentagain.art".
△ Less
Submitted 20 October, 2020;
originally announced October 2020.
-
DimensionRank: Personal Neural Representations for Personalized General Search
Authors:
Gregory Coppola
Abstract:
Web Search and Social Media have always been two of the most important applications on the internet. We begin by giving a unified framework, called general search, of which which all search and social media products can be seen as instances.
DimensionRank is our main contribution. This is an algorithm for personalized general search, based on neural networks. DimensionRank's bold innovation is t…
▽ More
Web Search and Social Media have always been two of the most important applications on the internet. We begin by giving a unified framework, called general search, of which which all search and social media products can be seen as instances.
DimensionRank is our main contribution. This is an algorithm for personalized general search, based on neural networks. DimensionRank's bold innovation is to model and represent each user using their own unique personal neural representation vector, a learned representation in a real-valued multidimensional vector space. This is the first internet service we are aware of that to model each user with their own independent representation vector. This is also the first service we are aware of to attempt personalization for general web search. Also, neural representations allows us to present the first Reddit-style algorithm, that is immune to the problem of "brigading". We believe personalized general search will yield a search product orders of magnitude better than Google's one-size-fits-all web search algorithm.
Finally, we announce Deep Revelations, a new search and social network internet application based on DimensionRank.
△ Less
Submitted 26 May, 2020;
originally announced May 2020.
-
Partial Labeled Gastric Tumor Segmentation via patch-based Reiterative Learning
Authors:
Yang Nan,
Gianmarc Coppola,
Qiaokang Liang,
Kunglin Zou,
Wei Sun,
Dan Zhang,
Yaonan Wang,
Guanzhen Yu
Abstract:
Gastric cancer is the second leading cause of cancer-related deaths worldwide, and the major hurdle in biomedical image analysis is the determination of the cancer extent. This assignment has high clinical relevance and would generally require vast microscopic assessment by pathologists. Recent advances in deep learning have produced inspiring results on biomedical image segmentation, while its ou…
▽ More
Gastric cancer is the second leading cause of cancer-related deaths worldwide, and the major hurdle in biomedical image analysis is the determination of the cancer extent. This assignment has high clinical relevance and would generally require vast microscopic assessment by pathologists. Recent advances in deep learning have produced inspiring results on biomedical image segmentation, while its outcome is reliant on comprehensive annotation. This requires plenty of labor costs, for the ground truth must be annotated meticulously by pathologists. In this paper, a reiterative learning framework was presented to train our network on partial annotated biomedical images, and superior performance was achieved without any pre-trained or further manual annotation. We eliminate the boundary error of patch-based model through our overlapped region forecast algorithm. Through these advisable methods, a mean intersection over union coefficient (IOU) of 0.883 and mean accuracy of 91.09% on the partial labeled dataset was achieved, which made us win the 2017 China Big Data & Artificial Intelligence Innovation and Entrepreneurship Competitions.
△ Less
Submitted 20 December, 2017;
originally announced December 2017.