MoleCode unlocks structural intelligence in large language models

📰 ArXiv cs.AI

arXiv:2605.16480v1 Announce Type: cross Abstract: Molecules are graphs, but large language models~(LLMs) are usually asked to reason about them through linear strings. The most popular molecular representation, SMILES, compresses atoms, bonds, branches and rings into a compact sequence in which topology is implicit, forcing LLMs to reconstruct molecular structure before performing the requested chemical operation. Here we introduce MoleCode, an LLM-native, training-free, graph-explicit molecular

Published 19 May 2026

Read full paper → ← Back to Reads