OpenClaw is an AI agent runtime that can execute tools, shell commands, and MCP servers. With Typecast, your agent can generate natural-sounding speech in a single command.
What You Can Do
With Typecast and OpenClaw, you can:- Generate speech from agent workflows — Convert any text output to natural-sounding audio
- Choose from 500+ voices — Select voices by gender, age, and style
- Control emotion — Apply Smart Emotion or preset emotions (happy, sad, angry, whisper, etc.)
- Support 37 languages — Generate speech in English, Korean, Japanese, Chinese, and more
- Automate audio pipelines — Combine with other tools for end-to-end content creation
Prerequisites
Before you start, make sure you have:- OpenClaw installed —
npm install -g openclaw@latest - Typecast API Key — Get yours here
- Typecast CLI (
cast) — The fastest integration path
Quick Start: cast CLI
The official Typecast CLI turns speech generation into a single shell command. If your agent can run shell commands, it can generate Typecast audio without writing a custom provider.Step 1: Install the CLI
- Homebrew (macOS/Linux)
- Go
Step 2: Authenticate
Step 3: Verify
Integration Methods
Method 1: cast via Local exec (Recommended)
OpenClaw distinguishes localexec from remote code_execution. Use local exec when the command must access installed binaries on the machine.
Simply ask your OpenClaw agent:
Method 2: MCP Server (Tool-Native)
For deeper integration, connect the Typecast API MCP server so OpenClaw can call TTS tools directly.- CLI Registration
- Bundle Config (JSON)
typecast__synthesize_speech, typecast__list_voices, etc. Ask your agent:
Voice & Emotion Control
Finding Voices
Use thecast CLI to list available voices:
list_voices tool to browse by gender, age, and use case.
Emotion Options
Smart Emotion
AI automatically detects the best emotion from text context. Great for natural conversations and storytelling.
Preset Emotion
Manually choose from 7 emotions: Normal, Happy, Sad, Angry, Whisper, Tone Up, Tone Down.
Example Workflows
Automated meeting summary narration
Automated meeting summary narration
- OpenClaw receives meeting transcript
- Agent summarizes key points with an LLM
- Agent runs
castto generate audio summary - Output file is uploaded to Slack or Google Drive
Multilingual content pipeline
Multilingual content pipeline
- Agent receives content in English
- Translates to Korean, Japanese, Chinese
- Generates Typecast audio for each language
- Saves all audio files to cloud storage
CI/CD voice notifications
CI/CD voice notifications
- Build pipeline triggers OpenClaw agent
- Agent generates status message: “Build succeeded” or “Build failed”
castproduces audio notification- Audio is posted to team Discord channel
Troubleshooting
cast: command not found
cast: command not found
Install the CLI in the same runtime where OpenClaw executes tools. If installed via Homebrew, verify your
PATH includes the Homebrew bin directory.Authentication error
Authentication error
Run
cast login or pass your API key directly with cast login <api_key>. Verify at the Typecast API Console.No audio playback in headless sessions
No audio playback in headless sessions
Use
--out to save to a file instead of playing audio. Return the file path to the user.Agent prints the command but doesn't run it
Agent prints the command but doesn't run it
Explicitly ask the agent to use the
exec tool or local shell. Add a project instruction to clarify this behavior.MCP server not connecting
MCP server not connecting
- Ensure
uvxis installed and onPATH:command -v uvx - Check that
TYPECAST_API_KEYis set in the environment - Run
openclaw mcp show typecastto verify registration
Resources
Typecast API Console
Get your API key
Voice Library
Browse all available voices
API Reference
Explore the Typecast API
MCP Server
Typecast MCP Server docs