A 2-million token context window — large enough to ingest eight full-length novels simultaneously — shipped as a standard feature of Google's Gemini 3.1 Ultra when the model launched in April 2026, doubling the previous production benchmark and setting a new operational standard for enterprise AI deployments. No competing model at general availability matches it.
Context windows are the working memory of a language model during a session: every additional token allows the system to hold more information active at once. Gemini 2.0 Ultra, released in late 2025, carried a 1-million token context. OpenAI's GPT-5, which launched in March 2026, offers 256,000 tokens as its standard consumer tier. The doubling to 2 million is not primarily a consumer feature — most individuals will never write a 2-million token prompt — but for enterprise use cases including legal discovery, medical record synthesis, software code audits, and long-form financial analysis, the expanded capacity is practically significant and changes what tasks can be completed in a single session without truncation.
Continue reading to see the full article