customers
All customer stories
Top Voice AI companies are building with Assembly.
resources
Latest Release
Voice Agent API
Voice agents that get it right, respond instantly, and ship the same day with our new Voice Agent API
resources
Run AssemblyAI's Voice AI models on your own infrastructure to tighten latency, meet compliance requirements, and keep full control of your stack.
Self-host our speech-to-text models with the same accuracy and price-performance you get from AssemblyAI's cloud API.
Co-locate your Voice AI stack with the rest of your infrastructure so audio is processed close to where your traffic originates.
Keep every second of audio inside your environment, even while you're serving customers globally.
Tune scaling to match your exact traffic patterns. We provide the metrics and observability your autoscaling needs.
Run on any container orchestration platform — Kubernetes, AWS ECS, or whatever your team already uses.
Apply AssemblyAI usage to your cloud provider's committed spend so you get the discounts you've already negotiated.
Meet strict regulatory and data residency requirements by processing audio inside your controlled perimeter.
Run our Universal-3 Pro Streaming model with the same accuracy and speed you get from our cloud API.
Same usage-based pricing as the cloud — no self-hosting premium. Daily billing options and volume discounts included.
Full GPU support for maximum performance, with options for regions with hardware import limitations.