What does 'context window exceeded' mean in Codex CLI?

This error occurs when the total tokens (your prompt plus conversation history plus file contents) exceeds the model's maximum context window. GPT-4o supports 128K tokens, but practical limits may be lower. Codex CLI tracks this and warns you before hitting limits.

How can I reduce context size in Codex CLI?

Use .codexignore to exclude large files, start fresh sessions for new tasks, use the /compact command to summarize conversation history, and explicitly reference only the files you need rather than letting Codex scan the entire codebase.

Does Codex CLI automatically manage context?

Yes, Codex CLI has built-in context management that truncates old conversation turns when approaching limits. However, if a single operation exceeds limits (like reading a very large file), you'll still get errors.

Which Codex CLI model has the largest context window?

GPT-4o has 128K tokens context. o1 models have similar limits. For very large codebases, consider using Gemini CLI which offers up to 1M tokens context, or split your work into focused sessions with smaller scopes.

How to Fix OpenAI Codex CLI Context Window Exceeded Errors

Context window errors occur when your conversation plus file contents exceeds the model's token limit. This guide explains how to diagnose, prevent, and resolve these errors in OpenAI Codex CLI.

Understanding Context Windows

A context window is the total amount of text a language model can process at once, measured in tokens. Tokens are roughly equivalent to 4 characters or 0.75 words in English.

Common model context limits:

Model	Context Window	Approximate Words
GPT-4o	128K tokens	~96,000 words
GPT-4 Turbo	128K tokens	~96,000 words
o1	128K tokens	~96,000 words

While these numbers seem large, they fill up quickly when:

Reading multiple source files
Maintaining conversation history
Including system prompts and tool outputs

How Codex CLI Counts Tokens

Codex CLI tracks token usage across several components:

System prompt - The instructions that define Codex's behavior
Conversation history - Your prompts and Codex's responses
File contents - Any files Codex reads or you reference
Tool outputs - Results from commands Codex executes

When combined, these can quickly approach the context limit, especially in longer sessions or when working with large files.

Recognizing Context Errors

Context window errors typically appear as:

Error: Context length exceeded. Maximum context length is 128000 tokens.
Your messages resulted in 142567 tokens.

Or you may see warnings like:

Warning: Approaching context limit (95% used). Consider starting a new session.

Sometimes the error manifests as Codex "forgetting" earlier parts of the conversation or providing incomplete responses.

Creating Effective .codexignore Files

The .codexignore file prevents Codex from reading unnecessary files. Create it in your project root:

# .codexignore

# Dependencies (often massive)
node_modules/
vendor/
.venv/
__pycache__/

# Build outputs
dist/
build/
.next/
out/

# Large binary files
*.jpg
*.png
*.gif
*.pdf
*.zip
*.tar.gz

# Generated files
*.min.js
*.min.css
*.map
package-lock.json
yarn.lock
pnpm-lock.yaml

# Test coverage
coverage/
.nyc_output/

# Large data files
*.csv
*.json
data/
fixtures/

Pro tip: Start with aggressive exclusions and add exceptions as needed:

# Exclude everything in data/
data/

# But include config files
!data/config.json

Using Context Management Commands

Codex CLI provides commands to manage context:

/compact - Summarize History

The /compact command compresses your conversation history into a summary:

/compact

This preserves the key decisions and context while reducing token count significantly. Use it when:

You have been working for a while and context is building up
You are switching to a different task within the same session
You see warnings about approaching context limits

Starting Fresh Sessions

For new tasks, start a fresh session rather than continuing an old one:

# End current session
exit

# Start new session
codex

This clears all conversation history and file context.

Referencing Specific Files

Instead of letting Codex scan your codebase, explicitly reference only needed files:

codex -f src/auth.ts "explain the authentication flow"

This is more efficient than:

codex "look at the auth system and explain it"

Strategies for Large Codebases

When working with large projects, use these strategies:

1. Work in Focused Directories

Navigate to the specific area you need:

cd src/components/dashboard
codex "refactor the chart component"

2. Use Multiple Sessions

Break large tasks into smaller, focused sessions:

Session 1: Understand the current architecture
Session 2: Plan the changes
Session 3: Implement component A
Session 4: Implement component B

3. Leverage Summary Files

Create architecture docs that Codex can read instead of scanning all files:

codex "read ARCHITECTURE.md and help me add a new API endpoint"

4. Exclude Test Files During Implementation

Add test files to .codexignore while implementing, then remove when writing tests:

# Temporarily exclude tests
**/*.test.ts
**/*.spec.ts

Model Comparison for Context

If you frequently hit context limits, consider the right tool for the job:

Tool	Context Window	Best For
Codex CLI (GPT-4o)	128K tokens	Standard development tasks
Gemini CLI	1M tokens	Large codebase exploration
Claude Code	200K tokens	Multi-file refactoring

For very large codebases, Gemini CLI's 1M token context window provides 8x more capacity than Codex. See our guide on installing Gemini CLI.

When to Switch Tools

Consider switching tools when:

Single files exceed limits - Use a tool with larger context or split the file
You need full codebase awareness - Gemini CLI with 1M context handles this better
Complex multi-file refactoring - Claude Code excels at maintaining coherence across files

Troubleshooting

Error Persists After /compact

If context errors continue after compacting:

Exit and start a fresh session
Check your .codexignore for missing exclusions
Reference fewer files explicitly

Codex Forgets Previous Instructions

This often indicates context is being silently truncated:

Use /compact to create an explicit summary
Re-state important context in your next prompt
Consider using a tool with larger context

Large File Cannot Be Read

If a single file exceeds limits:

Ask Codex to read specific sections: "Read lines 1-500 of large-file.ts"
Split the file into smaller modules
Create a summary document describing the file's structure

Next Steps

Learn about session management in Codex CLI
Explore Gemini CLI for 1M token context
Read about orchestrating multiple AI tools for optimal workflows

How to Fix OpenAI Codex CLI Context Window Exceeded Errors

Understanding Context Windows

How Codex CLI Counts Tokens

Recognizing Context Errors

Creating Effective .codexignore Files

Using Context Management Commands

/compact - Summarize History

Starting Fresh Sessions

Referencing Specific Files

Strategies for Large Codebases

1. Work in Focused Directories

2. Use Multiple Sessions

3. Leverage Summary Files

4. Exclude Test Files During Implementation

Model Comparison for Context

When to Switch Tools

Troubleshooting

Error Persists After /compact

Codex Forgets Previous Instructions

Large File Cannot Be Read

Next Steps

Frequently Asked Questions

Building with AI?

How to Configure Approval and Sandbox Modes in OpenAI Codex CLI

How to Fix OpenAI Codex CLI Slow Performance

Install OpenAI Codex CLI: Complete Setup Guide for macOS, Windows & Linux

LLM Token Counter

JSON Formatter

JWT Decoder

How to Fix OpenAI Codex CLI Context Window Exceeded Errors

Frequently Asked Questions

Building with AI?

Related Articles

How to Configure Approval and Sandbox Modes in OpenAI Codex CLI

How to Fix OpenAI Codex CLI Slow Performance

Install OpenAI Codex CLI: Complete Setup Guide for macOS, Windows & Linux

Related Tools

LLM Token Counter

JSON Formatter

JWT Decoder