Showing 1–1 of 1 results for author: Dey, V

Search v0.5.6 released 2020-02-24

arXiv:2502.13398 [pdf, other]

cs.LG cs.AI cs.CL physics.chem-ph q-bio.QM

GeLLMO: Generalizing Large Language Models for Multi-property Molecule Optimization

Authors: Vishal Dey, Xiao Hu, Xia Ning

Abstract: Despite recent advancements, most computational methods for molecule optimization are constrained to single- or double-property optimization tasks and suffer from poor scalability and generalizability to novel optimization tasks. Meanwhile, Large Language Models (LLMs) demonstrate remarkable out-of-domain generalizability to novel tasks. To demonstrate LLMs' potential for molecule optimization, we… ▽ More Despite recent advancements, most computational methods for molecule optimization are constrained to single- or double-property optimization tasks and suffer from poor scalability and generalizability to novel optimization tasks. Meanwhile, Large Language Models (LLMs) demonstrate remarkable out-of-domain generalizability to novel tasks. To demonstrate LLMs' potential for molecule optimization, we introduce MuMOInstruct, the first high-quality instruction-tuning dataset specifically focused on complex multi-property molecule optimization tasks. Leveraging MuMOInstruct, we develop GeLLMOs, a series of instruction-tuned LLMs for molecule optimization. Extensive evaluations across 5 in-domain and 5 out-of-domain tasks demonstrate that GeLLMOs consistently outperform state-of-the-art baselines. GeLLMOs also exhibit outstanding zero-shot generalization to unseen tasks, significantly outperforming powerful closed-source LLMs. Such strong generalizability demonstrates the tremendous potential of GeLLMOs as foundational models for molecule optimization, thereby tackling novel optimization tasks without resource-intensive retraining. MuMOInstruct, models, and code are accessible through https://github.com/ninglab/GeLLMO. △ Less

Submitted 27 May, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

Comments: Accepted to ACL Main 2025. Vishal Dey and Xiao Hu contributed equally to this paper

Search v0.5.6 released 2020-02-24