For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DiscordGet API Key
  • Getting Started
    • Overview
    • Quickstart
    • Errors
    • Pricing
  • Chat
    • Overview
    • Chat with Image, Video, and Audio
    • Function Calling
    • Models
  • Vision
    • Overview
    • Rate Limits
    • Pricing
    • MCP Server
    • Video Management
    • Video Group Management
    • Video Search
    • Video QA
    • Clip Generation
    • Metadata Tagging
    • Image Management
    • Image Search
  • Research
    • Overview
    • Streaming
    • Reasoning Steps
    • Web Search
    • Structured Output
    • Parallel Thinking
    • Best Practices
    • Errors
    • Examples
  • Speech
    • Overview
    • Audio Transcription
    • Speech Translation
    • Speech-to-Speech Translation
  • Resources
    • FAQs
    • Changelog
    • System Status
LogoLogo
DiscordGet API Key
On this page
  • Request Limits
  • Rate Limit Headers
  • Example
  • Handling 429 Errors
  • Higher Limits
Vision

Rate Limits

Was this page helpful?
Previous

Vision API Pricing

Next
Built with

Rate limits ensure fair usage and service stability for all customers. When you exceed a limit, the API returns a 429 Too Many Requests error.

Request Limits

All limits are per API key, per 24-hour rolling window.

EndpointRequests per Day
POST /v1/images/upload100
POST /v1/images/search100
POST /v1/videos/upload50
POST /v1/videos/search50
POST /v1/clips10

Rate Limit Headers

Every response includes headers to help you monitor your usage:

HeaderDescription
x-ratelimit-requests-limitMaximum requests allowed in the time window
x-ratelimit-requests-remainingRequests remaining in the current window
x-ratelimit-requests-resetSeconds until the rate limit window resets
retry-afterSeconds to wait before retrying (0 if allowed, positive if rate limited)

Example

HTTP/1.1 429 Too Many Requests
x-ratelimit-requests-limit: 100
x-ratelimit-requests-remaining: 0
x-ratelimit-requests-reset: 86400
retry-after: 3600

Handling 429 Errors

When you hit a rate limit:

  1. Check X-RateLimit-Reset to know when you can retry
  2. Implement exponential backoff in your application
  3. Consider batching operations where possible
1{
2 "error": {
3 "code": "rate_limit_exceeded",
4 "message": "You have exceeded your daily request limit for this endpoint."
5 }
6}

Higher Limits

Enterprise plans offer significantly higher limits. Contact us if you need:

  • Higher request quotas
  • Increased content size limits

For pricing information, see the Vision Pricing page.