For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
PlaygroundChangelogSign In
OverviewAPI ReferencePre-recorded STTStreaming STTVoice AgentsSpeech UnderstandingGuardrailsLLM GatewayFAQ
OverviewAPI ReferencePre-recorded STTStreaming STTVoice AgentsSpeech UnderstandingGuardrailsLLM GatewayFAQ
  • Getting started
    • Overview
    • Apply LLM Gateway to pre-recorded audio
    • Apply LLM Gateway to streaming audio
    • Specify fallback models
    • Prompt caching
    • Post-processing
    • Cloud endpoints & data residency
    • Troubleshooting
  • Use cases
    • Ask questions about your audio data
    • Build agentic workflows
    • Basic chat completions
    • Multi-turn conversations
    • Use tool calling with LLMs
    • Get structured JSON outputs
  • Guides
LogoLogo
PlaygroundChangelogSign In
On this page
  • Default Endpoint
  • EU Data Residency
  • Endpoints
  • EU model availability
  • Which endpoint should I use?
  • How to use it
Getting started

Cloud Endpoints and Data Residency

Was this page helpful?
Previous

Troubleshooting

Common issues and fixes when using the LLM Gateway.
Next
Built with

Choose the endpoint that best fits your application’s requirements—whether that’s using the default US region or ensuring your data stays within the European Union.

Default Endpoint

The default endpoint (llm-gateway.assemblyai.com) processes your LLM Gateway requests in the US region. If you don’t specify a base URL, this is the endpoint used by default.

EU Data Residency

The EU endpoint (llm-gateway.eu.assemblyai.com) guarantees your data never leaves the European Union. This is designed for organizations with strict data residency and governance requirements—your request and response data will remain entirely within the EU.

Endpoints

EndpointBase URLDescription
US (default)https://llm-gateway.assemblyai.com/v1/chat/completionsData stays in the US
EUhttps://llm-gateway.eu.assemblyai.com/v1/chat/completionsData stays in the EU

EU model availability

The EU endpoint currently supports Anthropic Claude and Google Gemini models. OpenAI models are only available through the US endpoint.

ProviderUSEU
Anthropic ClaudeYesYes
Google GeminiYesYes
OpenAI GPTYesNo
Alibaba Cloud QwenYesNo
Moonshot AI KimiYesNo

For a full list of available models, see the LLM Gateway Overview.

Which endpoint should I use?

  • No data residency requirements? Use the default endpoint. No configuration change is needed.
  • Need EU data residency? Use the EU endpoint to ensure your data stays within the European Union. Note that only Claude and Gemini models are available in the EU region.

How to use it

Update your request URL to your preferred endpoint. Select an endpoint tab below to see examples for each.

US (default)
EU data residency

The US endpoint is the default. No configuration change is required.

1import requests
2
3headers = {
4 "authorization": "<YOUR_API_KEY>"
5}
6
7# No URL change needed — US is the default
8response = requests.post(
9 "https://llm-gateway.assemblyai.com/v1/chat/completions",
10 headers=headers,
11 json={
12 "model": "claude-sonnet-4-6",
13 "messages": [
14 {"role": "user", "content": "What is the capital of France?"}
15 ],
16 "max_tokens": 1000
17 }
18)
19
20result = response.json()
21print(result["choices"][0]["message"]["content"])