ViSymRe: Vision-guided Multimodal Symbolic Regression

Li, Da; Yin, Junping; Xu, Jin; Li, Xinxin; Zhang, Juan

Computer Science > Machine Learning

arXiv:2412.11139 (cs)

[Submitted on 15 Dec 2024 (v1), last revised 2 Sep 2025 (this version, v2)]

Title:ViSymRe: Vision-guided Multimodal Symbolic Regression

Authors:Da Li, Junping Yin, Jin Xu, Xinxin Li, Juan Zhang

View PDF HTML (experimental)

Abstract:Extracting simple mathematical expression from an observational dataset to describe complex natural phenomena is one of the core objectives of artificial intelligence (AI). This field is known as symbolic regression (SR). Traditional SR models are based on genetic programming (GP) or reinforcement learning (RL), facing well-known challenges, such as low efficiency and overfitting. Recent studies have integrated SR with large language models (LLMs), enabling fast zero-shot inference by learning mappings from millions of dataset-expression pairs. However, since the input and output are inherently different modalities, such models often struggle to converge effectively. In this paper, we introduce ViSymRe, a vision-guided multimodal SR model that incorporates the third resource, expression graph, to bridge the modality gap. Different from traditional multimodal models, ViSymRe is trained to extract vision, termed virtual vision, from datasets, without relying on the global availability of expression graphs, which addresses the essential challenge of visual SR, i.e., expression graphs are not available during inference. Evaluation results on multiple mainstream benchmarks show that ViSymRe achieves more competitive performance than the state-of-the-art dataset-only baselines. The expressions predicted by ViSymRe not only fit the dataset well but are also simple and structurally accurate, goals that SR models strive to achieve.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Symbolic Computation (cs.SC)
Cite as:	arXiv:2412.11139 [cs.LG]
	(or arXiv:2412.11139v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.11139

Submission history

From: Da Li [view email]
[v1] Sun, 15 Dec 2024 10:05:31 UTC (3,489 KB)
[v2] Tue, 2 Sep 2025 13:41:15 UTC (3,086 KB)

Computer Science > Machine Learning

Title:ViSymRe: Vision-guided Multimodal Symbolic Regression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ViSymRe: Vision-guided Multimodal Symbolic Regression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators