Cohere's first developer coding model is a 30B mixture-of-experts running on a single H100 with 256K context length.
UIUC and Chroma's Harness-1 is a 20B retrieval subagent trained with reinforcement learning inside a stateful search harness. The harness maintains the bookkeeping — candidate pool, importance-tagged ...