For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
PlaygroundChangelogSign In
OverviewAPI ReferencePre-recorded STTStreaming STTVoice AgentsSpeech UnderstandingGuardrailsLLM GatewayFAQ
OverviewAPI ReferencePre-recorded STTStreaming STTVoice AgentsSpeech UnderstandingGuardrailsLLM GatewayFAQ
  • Overview
      • Can I sign up for free?
      • Do my free credits expire?
      • Do you have any type of special pricing for non-profit companies?
      • Do you offer special pricing for start-ups?
      • Do you offer special pricing for YCombinator companies?
      • Do you offer volume discounts?
      • How do I add developers to my account?
      • How do I change my account email address?
      • How do I change the credit card associated with my account?
      • How do I update company information on my invoices?
      • How is multichannel billed?
      • How does the concurrency limit work for transcription requests?
      • How often does the Usage and Spend Information in my Dashboard update?
      • How to share account access with team members
      • Is AssemblyAI available on the AWS Marketplace?
      • What happens if I reach my concurrency limit?
      • What happens when I have used all of my free tier credits?
      • What payment methods do you accept?
LogoLogo
PlaygroundChangelogSign In
On this page
  • Async speech-to-text
  • Streaming
OverviewAccount, Billing & Payments

What happens if I reach my concurrency limit?

Was this page helpful?
Previous

What happens when I have used all of my free tier credits?

Next
Built with

Async speech-to-text

When you reach your concurrency limit, new requests are automatically queued. As soon as a current transcription completes, the next job in the queue begins processing. All transcripts will be processed, though queued jobs may take longer than usual.

For example, if a user’s concurrency limit is set to 200 and they submit 201 transcript requests, the user will receive a throttle alert email. The first 200 transcription requests will begin to process immediately. The 201st one will wait until one of the previous 200 requests has finished before processing.

Streaming

When you reach your streaming concurrency limit, new requests will receive a 1008 error with the message “Unauthorized connection: Too many concurrent sessions”.

Note: Our Streaming STT feature includes automatically scaling concurrency. Anytime you are using 70% or more of your streaming concurrency your concurrency limit will automatically increase by 10% every sixty seconds.

You can find more information on concurrency limits for Pre-Recorded STT and Streaming STT in our documentation.

Need a higher concurrency?

We offer custom concurrency limits that scale to support any workload at no additional cost. If you need a higher concurrency limit, please either contact our Sales team or reach out to us at support@assemblyai.com.