Entropy-Reinforced Planning with Large Language Models for Drug Discovery

Liu, Xuefeng; Tien, Chih-chan; Ding, Peng; Jiang, Songhao; Stevens, Rick L.

Computer Science > Machine Learning

arXiv:2406.07025 (cs)

COVID-19 e-print

Important: e-prints posted on arXiv are not peer-reviewed by arXiv; they should not be relied upon without context to guide clinical practice or health-related behavior and should not be reported in news media as established information without consulting multiple experts in the field.

[Submitted on 11 Jun 2024 (v1), last revised 29 Mar 2025 (this version, v2)]

Title:Entropy-Reinforced Planning with Large Language Models for Drug Discovery

Authors:Xuefeng Liu, Chih-chan Tien, Peng Ding, Songhao Jiang, Rick L. Stevens

View PDF

Abstract:The objective of drug discovery is to identify chemical compounds that possess specific pharmaceutical properties toward a binding target. Existing large language models (LLMS) can achieve high token matching scores in terms of likelihood for molecule generation. However, relying solely on LLM decoding often results in the generation of molecules that are either invalid due to a single misused token, or suboptimal due to unbalanced exploration and exploitation as a consequence of the LLMs prior experience. Here we propose ERP, Entropy-Reinforced Planning for Transformer Decoding, which employs an entropy-reinforced planning algorithm to enhance the Transformer decoding process and strike a balance between exploitation and exploration. ERP aims to achieve improvements in multiple properties compared to direct sampling from the Transformer. We evaluated ERP on the SARS-CoV-2 virus (3CLPro) and human cancer cell target protein (RTCB) benchmarks and demonstrated that, in both benchmarks, ERP consistently outperforms the current state-of-the-art algorithm by 1-5 percent, and baselines by 5-10 percent, respectively. Moreover, such improvement is robust across Transformer models trained with different objectives. Finally, to further illustrate the capabilities of ERP, we tested our algorithm on three code generation benchmarks and outperformed the current state-of-the-art approach as well. Our code is publicly available at: this https URL.

Comments:	Published in ICML2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML)
Cite as:	arXiv:2406.07025 [cs.LG]
	(or arXiv:2406.07025v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.07025

Submission history

From: Xuefeng Liu [view email]
[v1] Tue, 11 Jun 2024 07:29:13 UTC (1,075 KB)
[v2] Sat, 29 Mar 2025 07:27:37 UTC (1,075 KB)

Computer Science > Machine Learning

Title:Entropy-Reinforced Planning with Large Language Models for Drug Discovery

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Entropy-Reinforced Planning with Large Language Models for Drug Discovery

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators