Workflow
Nvidia unveils new GPU designed for long-context inference
英伟达英伟达(US:NVDA) TechCrunch·2025-09-10 00:35

In Brief At the AI Infrastructure Summit on Tuesday, Nvidia announced a new GPU called the Rubin CPX, designed for context windows larger than 1 million tokens. Part of the chip giant’s forthcoming Rubin series, the CPX is optimized for processing large sequences of context and is meant to be used as part of a broader “disaggregated inference” infrastructure approach. For users, the result will be better performance on long-context tasks like video generation or software development. Nvidia’s relentless de ...