Chain-of-Draft (CoD), introduced by Xu et al. at MIT/Stanford, asks the model to produce terse 'drafts' of reasoning steps rather than full natural-language chains.
On GSM8K, MATH, and BIG-Bench Hard, CoD matched chain-of-thought accuracy while emitting 70% fewer tokens — a meaningful cost win for production deployments.