$\text{C}^2\text{P}$: Featuring Large Language Models with Causal Reasoning
Authors:
Abdolmahdi Bagheri,
Matin Alinejad,
Kevin Bello,
Alireza Akhondi-Asl
Abstract:
Causal reasoning is one of the primary bottlenecks that Large Language Models (LLMs) must overcome to attain human-level intelligence. Recent studies indicate that LLMs display near-random performance on reasoning tasks. To address this, we introduce the Causal Chain of Prompting ($\text{C}^2\text{P}$), a reasoning framework that aims to equip current LLMs with causal reasoning capabilities as the…
▽ More
Causal reasoning is one of the primary bottlenecks that Large Language Models (LLMs) must overcome to attain human-level intelligence. Recent studies indicate that LLMs display near-random performance on reasoning tasks. To address this, we introduce the Causal Chain of Prompting ($\text{C}^2\text{P}$), a reasoning framework that aims to equip current LLMs with causal reasoning capabilities as the first framework of its kind operating autonomously without relying on external tools or modules during both the causal learning and reasoning phases. To evaluate the performance of $\text{C}^2\text{P}$, we first demonstrate that reasoning accuracy improved by over $30.7\%$ and $25.9\%$ for GPT-4 Turbo and LLaMA 3.1, respectively, when using our framework, compared to the same models without $\text{C}^2\text{P}$ on a synthetic benchmark dataset. Then, using few-shot learning of the same LLMs with $\text{C}^2\text{P}$, the reasoning accuracy increased by more than $20.05\%$ and $20.89\%$, respectively, with as few as ten examples, compared to the corresponding LLMs without $\text{C}^2\text{P}$ on the same dataset. To evaluate $\text{C}^2\text{P}$ in realistic scenarios, we utilized another benchmark dataset containing natural stories across various fields, including healthcare, medicine, economics, education, social sciences, environmental science, and marketing. The results show improved reasoning when $\text{C}^2\text{P}$ is applied, compared to cases where our framework is not used, which often leads to random and hallucinated responses. By showing the improved performance of few-shot learned GPT-4 Turbo and LLaMA 3.1 with $\text{C}^2\text{P}$, we demonstrate the generalizability of our framework.
△ Less
Submitted 14 December, 2024; v1 submitted 25 July, 2024;
originally announced July 2024.
Counting short cycles of (c,d)-regular bipartite graphs
Authors:
Mohsen Alinejad,
Kazem Khashyarmanesh
Abstract:
Recently, working on the Tanner graph which represents a low density parity check (LDPC) code becomes an interesting research subject. Finding the number of short cycles of Tanner graphs motivated Blake and Lin to investigate the multiplicity of cycles of length girth in bi-regular bipartite graphs, by using the spectrum and degree distribution of the graph. Although there were many algorithms to…
▽ More
Recently, working on the Tanner graph which represents a low density parity check (LDPC) code becomes an interesting research subject. Finding the number of short cycles of Tanner graphs motivated Blake and Lin to investigate the multiplicity of cycles of length girth in bi-regular bipartite graphs, by using the spectrum and degree distribution of the graph. Although there were many algorithms to find the number of cycles, they preferred to investigate in a computational way. Dehghan and Banihashemi counted the number of cycles of length $g+2$ and $g+4,$ where $G$ is a bi-regular bipartite graph and $g$ is the length of the girth $G.$ But they just proposed a descriptive technique to compute the multiplicity of cycles of length less than $2g$ for bi-regular bipartite graphs. In this paper, we find the number of cycles of length less than $2g$ by using spectrum and degree distribution of a bi-regular bipartite graph such that the formula depends only on the partitions of positive integers and the number of closed cycle-free walks from a vertex of the bi-regular bipartite graph.
△ Less
Submitted 4 August, 2018; v1 submitted 1 August, 2018;
originally announced August 2018.