Introducing Code-Mixed Chain-of-Thought — Teaching Gemma 4 31B to reason bilingually cut thinking tokens by 40% [Mnemic Glorious 31B]
📰 Reddit r/deeplearning
submitted by /u/superman_27 [link] [comments]
submitted by /u/superman_27 [link] [comments]