Content is user-generated and unverified.

Claude Code Usage Limits & Pricing

Cost Overview

MetricValue
Average Cost per Developer~$100-200/month with Sonnet 4
Daily Average$6 per developer
90th Percentile Daily CostBelow $12 per developer
Background UsageUnder $0.04 per session

Rate Limit Recommendations (Token Per Minute)

Team SizeTPM per UserExample Total TPM
1-5 users200k-300k200k-1.5M
5-20 users100k-150k500k-3M
20-50 users50k-75k1M-3.75M
50-100 users25k-35k1.25M-3.5M
100-500 users15k-20k1.5M-10M
500+ users10k-15k5M+

Note: Rate limits apply at the organization level, not per individual user. Individual users can temporarily consume more than their calculated share when others aren't actively using the service.

Pricing Model

AspectDetails
Billing MethodPay per API token consumption
Subscription PlansPro and Max plans include usage
EnterpriseContact sales for custom pricing

Cost Management Features

FeatureDescription
Session TrackingUse /cost command to see current session usage
Workspace LimitsSet maximum monthly spend per workspace (Anthropic API only)
Usage ReportsAvailable in Anthropic Console for Admins/Billing roles
Cloud ProvidersBedrock/Vertex require third-party tools like LiteLLM for metrics

Default Model Configuration

Model TypeDefault ModelToken Limits
Main ModelClaude Sonnet 4Standard API limits apply
Small/Fast ModelClaude HaikuUsed for background tasks
Max Output Tokens4096Typical usage without extended thinking
Max Thinking Tokens1024For extended reasoning

Factors Affecting Token Usage

FactorImpact
Codebase SizeLarger codebases = more tokens for analysis
Query ComplexityComplex tasks require more tokens
File OperationsNumber of files searched/modified affects usage
Conversation LengthLonger conversations consume more context
Auto-compactTriggers at 95% context capacity
Background Processes~1 cent/day for haiku generation

Cost Optimization Tips

  1. Use Compact Conversations
    • Auto-compact enabled by default
    • Manual compact with /compact command
    • Custom compaction instructions in CLAUDE.md
  2. Write Specific Queries
    • Avoid vague requests that trigger unnecessary scanning
    • Break down complex tasks into focused interactions
  3. Clear History
    • Use /clear between unrelated tasks
    • Reduces context window usage
  4. Team Deployment
    • Start with small pilot group
    • Establish usage patterns before wider rollout

Platform-Specific Considerations

PlatformAuthenticationCost Tracking
Anthropic APIAPI KeyBuilt-in Console metrics
AWS BedrockAWS Credentials/SSOUse LiteLLM or AWS CloudWatch
Google Vertex AIGoogle Cloud AuthUse LiteLLM or GCP monitoring

Important Notes

  • Claude Code is currently in beta
  • Usage can vary significantly based on how many instances users run and automation usage
  • For high-volume or enterprise use cases, contact Anthropic sales for custom arrangements
  • Rate limits are shared across all models in the organization
Content is user-generated and unverified.
    Claude Code Usage Limits & Pricing | Claude