Skip to main content
Typecast is an AI-powered text-to-speech platform for creating natural voice narration. It provides a wide range of synthetic voices and converts text into speech across multiple languages.
New to Typecast? Use the Ask Assistant button at the top of the Typecast docs. Ask questions like “Which SDK should I use for a Next.js app?”, “How do I stream audio?”, or “Show me the fastest way to generate captions.”

What Typecast API provides

37 languages

Generate speech in Korean, English, Japanese, Chinese, Spanish, Vietnamese, and many more languages with the ssfm-v30 model.

500+ voices

Choose voices by model, gender, age group, and use case, or create a custom voice with instant cloning.

Natural expression

Control emotion, pacing, format, and language while using consistent voice metadata across API and SDK flows.

Core API features

Use Typecast API for full audio generation, real-time playback, subtitle timing, and custom voice creation.

Text-to-speech

Convert text into complete WAV or MP3 audio files for apps, videos, narration, learning content, and voice products.

Streaming TTS

Play audio as chunks arrive instead of waiting for the full synthesis result. Useful for voice agents, interactive apps, and low-latency playback.

Timestamp TTS

Generate audio with word- or character-level alignment data for subtitles, karaoke highlights, and lip-sync.

Instant Cloning

Create a custom voice from a short audio sample and use it in text-to-speech requests alongside Typecast-provided voices.

Choose how to integrate

After you understand what Typecast provides, choose the workflow that matches how you want to build.

AI agent implementation

Let Claude, Cursor, OpenClaw, or another agent read the Typecast docs and generate integration code for you. Prefer the official SDK for your language instead of asking the agent to hand-roll raw HTTP calls.

Direct implementation

Follow the quickstart, create an API key, pick a voice, and run your first TTS request. Use the API Reference when you need exact request and response fields.

Direct implementation

Quickstart

Create an API key and generate your first audio file.

SDKs

Use Python, JavaScript, Go, Rust, C#, Java, Kotlin, C, Swift, Zig, PHP, Dart, or Ruby.

API Reference

Check exact endpoints, request parameters, response schemas, and Try It examples.

AI-assisted development

If you are asking an AI agent to add Typecast to your project, give it one of these docs first:

Skills

Best for Claude Code and Claude Desktop. The agent gets task-specific Typecast instructions and examples.

MCP

Best when your agent can connect to remote MCP docs or a self-hosted Typecast MCP server.

OpenClaw

Best for local agent workflows that can run shell commands, use the cast CLI, or connect MCP tools.
Prompt your agent to use the official SDK page for your language first. SDKs include helpers for text-to-speech, streaming, timestamp TTS, subtitle export, voice lookup, and error handling.

No-code tools

If you want to automate voice generation without writing a full app, start with the integration that matches your workflow:

Zapier

Trigger Typecast voice generation from thousands of apps and automate handoffs between tools.

Make

Build visual scenarios for repeatable TTS pipelines, content workflows, and multilingual production.

n8n

Create self-hosted or cloud automation workflows that call Typecast as one step in a larger process.

Google Sheets

Generate audio from spreadsheet rows for batch jobs, team operations, and content lists.

API Reference

Check exact endpoints, request parameters, response schemas, and Try It examples.

Voice Library

Browse available voices and choose the voice ID for your request.

Pricing

Check plan limits, pricing, and credit usage before you ship or scale.

Models

Compare ssfm-v30 and ssfm-v21 language support, emotion controls, and model behavior.

Next step

Start with the Quickstart, or open Ask Assistant and describe your app, language, and target feature.