KOMBO: Korean Character Representations Based on the Combination Rules of Subcharacters
📰 ArXiv cs.AI
arXiv:2604.23948v1 Announce Type: cross Abstract: The Korean writing system, \textit{Hangeul}, has a unique character representation rigidly following the invention principles recorded in \textit{Hunminjeongeum}.\footnote{\textit{Hunminjeongeum} is a book published in 1446 that describes the principles of invention and usage of \textit{Hangeul}, devised by King Sejong \cite{Hunminjeongeum_Guide}.} However, existing pre-trained language models (PLMs) for Korean have overlooked these principles. I
DeepCamp AI