Claude Limits (Sessions, Usage, Context, Memory, and Others)

claude limits

If you’ve ever been in the middle of a solid Claude session and suddenly hit a wall (“you’ve reached your usage limit”), you know how disorienting it feels. No countdown. No warning. Just a hard stop.

The problem isn’t just the limits themselves. It’s that they’re genuinely confusing. Claude doesn’t give you a fixed number of messages per day. The limits shift based on what you’re doing, which model you’re using, and how long your messages are. And the official documentation is scattered across a dozen different help center articles.

This guide pulls it all together. What the limits actually are, how they work behind the scenes, and what you can do to stop hitting them at the worst possible time.

Two types of Claude limits, and they’re very different things

Before getting into the numbers, it helps to understand that Claude has two separate limits that operate independently.

Limit typeWhat it controlsWhat it feels like
Usage limitHow many messages you can send over time“You’ve reached your limit” wall
Length limit (context window)How much Claude can hold in working memory per conversationClaude “forgetting” earlier parts of the chat

You can hit either one independently, and they need different fixes. Running out of messages is a usage limit problem. Claude going vague or forgetting context is a context window problem.

Section 1: Message limits and how the 5-hour window actually works

The most common misconception about Claude’s limits: people assume they reset at midnight. They don’t.

Claude uses a rolling 5-hour window. If you send your first message at 9:00 AM and hit your limit, your capacity starts replenishing at 2:00 PM, not at midnight. You’re never more than 5 hours away from a full reset, regardless of when you started.

Message limits by plan

PlanMessages per 5-hour windowPeak-hour throttlingPrice
Free~15-40 (dynamic)YesFree
Pro~50-100No$20/mo
Max 5x~225No$100/mo
Max 20x~900+No$200/mo
TeamDoubled since May 2026No$30/user/mo
EnterpriseCustomNoCustom

These are estimates. Actual limits vary by message length, model, and features used.

Note: The free plan was explicitly excluded from Anthropic’s May 2026 rate limit increases. Paid plan ceilings went up substantially. Free limits stayed the same.

What actually burns through your limit faster

The message count shown in the UI is a simplification. Claude actually meters by tokens. Every prompt, file attachment, tool definition, and line of conversation history draws from the same quota.

ActionToken impact
Short questionLow
Pasting a long documentHigh
Extended thinking / high effort modeHigh
Uploading a fileMedium to high depending on size
Using web search or MCP connectorsAdditional overhead per use

Watch out: claude.ai, Claude Code, and Claude Desktop all draw from the same usage pool. Burn through a long Claude Code session in the morning and you’ll have less left for browser work in the afternoon.

claude settings for usage

On paid plans, check Settings > Usage to see how much of your 5-hour session and weekly limits you’ve consumed.

Section 2: Context window limits and Claude’s working memory

The context window is Claude’s working memory for a conversation. Everything lives in it: your messages, Claude’s responses, uploaded files, and any instructions you gave. When it fills up, earlier content starts falling out.

Context window by model

ModelContext windowMax output per response
Claude Opus 4.71,000,000 tokens128,000 tokens
Claude Sonnet 4.61,000,000 tokens64,000 tokens
Claude Haiku 4.5200,000 tokensVaries
Enterprise (some models)500,000 tokensVaries

To make that concrete: 1M tokens is roughly 750,000 words of English text, about 10 full-length novels or 1,500 average blog posts.

Important caveat: The 1M context window applies to the API and specific surfaces. According to Anthropic’s official help center, the context window on claude.ai is 200K tokens for all models and paid plans (except Enterprise). The model you’re using and where you’re accessing it both matter.

What happens when you approach the limit

For users with code execution enabled, Claude automatically manages long conversations. When your conversation approaches the limit, Claude summarizes earlier messages to continue seamlessly. Your full chat history is preserved even after summarization.

Tip: If you see Claude say it’s “organizing its thoughts” mid-conversation, that’s automatic context management working. It’s compressing earlier context, not losing your work. But these longer conversations do consume more of your usage limit, so starting a fresh chat is often the smarter move.

Section 3: Projects and memory and what actually persists

claude memory and files for project

“Memory” means different things in different parts of Claude. Here’s how to keep them straight.

Projects

FeatureFreePro / Max / Team / Enterprise
Number of projectsUp to 5Unlimited
Upload files to knowledge baseYesYes
Set project instructionsYesYes
Memory scoped to projectYesYes

Projects are self-contained workspaces. You upload documents, write instructions for how Claude should behave, and every conversation inside that project references those files automatically. Claude uses retrieval-augmented generation (RAG) to load only the relevant content into the context window at any given time, which is more efficient than pasting everything in manually.

Memory from chat history

Claude automatically synthesizes key insights from your past conversations and updates this every 24 hours. That synthesis carries into every new standalone conversation you start.

Note: Each project has its own separate memory space. What Claude learns inside one project stays in that project and does not bleed into other projects or your standalone chats.

Memory from chat history is available on all plans including free. Toggle it under Settings > Capabilities.

What does NOT persist

Common misconception: Memory does not mean Claude remembers every conversation verbatim. It synthesizes patterns and key information across your history. If you need Claude to reliably reference something specific (a client brief, a style guide, a set of instructions), put it in a Project knowledge base. That’s the only reliable way to make it always available.

claude memory and capabilities

Section 4: API limits for developers and builders

Skip this section if you’re using Claude through claude.ai. This part is for developers accessing Claude programmatically.

The API meters everything by tokens, not messages. Every prompt, file, tool call, and line of conversation history draws from the same quota.

API tiers and rate limits (post May 2026)

TierMinimum spendInput tokens/minOutput tokens/minRequests/min
FreeNone~Low~Low~5
Tier 1$5 deposit500,00080,00050
Tier 2Varies2,000,000Varies1,000
Tier 3Varies5,000,000Varies2,000
Tier 4$400+10,000,000Varies4,000

Tier 1 input tokens jumped from 30,000 to 500,000 per minute after the May 2026 expansion. Output tokens went from 8,000 to 80,000, a 10x increase.

Tip for API users: Cached tokens from prompt caching don’t count toward your ITPM limits. If you’re sending the same large system prompt repeatedly, prompt caching can give you 5-10x more effective throughput without upgrading your tier.

The API uses a token bucket algorithm, meaning capacity replenishes continuously rather than resetting at fixed intervals. Same principle as the 5-hour rolling window on claude.ai.

Section 5: Practical workarounds to stop hitting walls

Most limit problems are fixable with habit changes, not plan upgrades.

Start a new conversation instead of continuing an old one

Long conversations accumulate context with every turn. Starting fresh resets that pressure. If a conversation is going long and responses are getting worse, don’t push through. Start a new chat and paste in just the relevant context.

Tip: In Claude Code, run /clear to reset context without ending your session entirely.

Use Projects for recurring work

If you’re working on the same client, product, or codebase repeatedly, put your instructions and key documents in a Project. Claude will reference them every time without you needing to re-paste anything.

Keep project instructions short

Warning: Verbose project instructions eat into your context window before you’ve even started working. Keep instructions focused on the essentials. Reserve task-specific detail for the chat itself. If your project instructions run past 500 words, trim them.

Turn off tools you’re not using

Web search, extended thinking, and MCP connectors all consume tokens on top of your actual message. Disable anything you don’t need for the task at hand.

Check your usage before a big session

On paid plans, go to Settings > Usage before starting intensive work. If you’re already deep into your 5-hour window, waiting for a reset costs nothing and saves a lot of frustration mid-task.

Break large tasks into sessions

Don’t try to process a 50-page document or run a complex multi-step task in one conversation. Break it into logical chunks across separate sessions. You’ll get sharper responses and you won’t lose progress to a limit hitting at the wrong moment.

Quick reference: all limits at a glance

FreeProMax 5xMax 20xEnterprise
Messages per 5 hrs~15-40 (dynamic)~50-100~225~900+Custom
Context window (claude.ai)200K200K200K200K500K (some models)
ProjectsUp to 5UnlimitedUnlimitedUnlimitedUnlimited
Memory from chat historyYesYesYesYesYes (admin-controlled)
Peak-hour throttlingYesNoNoNoNo
PriceFree$20/mo$100/mo$200/moCustom

Verify current pricing at anthropic.com/pricing before publishing. Plans update frequently.

The honest bottom line

Claude’s limits aren’t arbitrary. Running a model at this scale is compute-intensive, and Anthropic is balancing access across millions of users. That said, most of the friction people experience comes from misunderstanding how the limits work, not from the limits themselves.

The rolling 5-hour window is more flexible than a daily cap. Projects solve the “Claude forgot everything” problem that frustrates most regular users. And for heavy users, the Max plans have meaningfully higher ceilings than they did six months ago.

If you’re hitting limits daily on free, Pro is worth it. If you’re burning through Pro regularly, check whether you’re carrying long conversation histories you don’t need. That’s usually the fix before spending more money.

Want help building a content strategy that actually drives pipeline? Book a call with us or read more on the LymLyt blog.

Related reading:

Categories: ,

Leave a Reply

Your email address will not be published. Required fields are marked *

Index