Cerebras WSE-3: wafer-scale SRAM for fast-token inference — strengths, tradeoffs, and the OpenAI capacity deal | Raisolo