New Endpoint: POST /v1/text-to-speech/stream
Added a low-latency streaming endpoint that delivers audio chunks as they are generated, enabling real-time playback without waiting for full synthesis./v1/text-to-speech:| Feature | Standard | Streaming |
|---|---|---|
| Response | Complete audio file | Chunked audio stream |
| Latency | Wait for full synthesis | First chunk in ~200ms |
volume / target_lufs | Supported | Not supported |
| Output settings | Output | OutputStream (pitch, tempo, format only) |
audio/wav or audio/mpeg).New Endpoint: GET /v1/users/me/subscription
Retrieve the authenticated user’s plan tier, credit usage, and concurrency limits.SDK Updates
All 9 official SDKs have been updated with streaming and subscription support:| SDK | Version | Streaming Method |
|---|---|---|
| Python | 0.2.0 | text_to_speech_stream() (sync + async) |
| JavaScript | 0.3.0 | textToSpeechStream() → ReadableStream |
| Go | v0.2.0 | TextToSpeechStream() → io.ReadCloser |
| Rust | 0.2.0 | text_to_speech_stream() → Stream<Bytes> |
| Swift | v0.2.0 | textToSpeechStream() → AsyncThrowingStream |
| C# | 0.2.0 | TextToSpeechStreamAsync() → Stream |
| Java | 1.1.0 | textToSpeechStream() → InputStream |
| Kotlin | 1.1.0 | textToSpeechStream() → InputStream |
| C | 1.1.0 | typecast_text_to_speech_stream() (callback) |
New Model: ssfm-v30
Added support for the newssfm-v30 model with improved speech quality and expanded capabilities.New Features:- Smart Emotion - Context-aware emotion inference using
SmartPrompt - 7 Emotion Presets - Added
whisper,toneup,tonedownpresets - Universal Emotion Support - All emotions available across all voices
- 37 Languages - Added 10 new languages
New Endpoint: GET /v2/voices
Added enhanced voice listing endpoint with model-grouped emotions and additional metadata.| Parameter | Type | Description |
|---|---|---|
model | string | Filter by model (ssfm-v21, ssfm-v30) |
gender | string | Filter by gender (male, female) |
age | string | Filter by age group (child, teenager, young_adult, middle_age, elder) |
use_cases | string | Filter by use case (Audiobook, Game, E-learning, etc.) |
Deprecated: Voice Management Endpoints
The following endpoints have been deprecated and removed:| Endpoint | Status |
|---|---|
POST /v1/voices | Removed |
GET /v1/voices/{voice_id} | Removed |
GET /v2/voices for listing voices with enhanced metadata.Initial Release: ssfm-v21
Launched the Typecast Text-to-Speech API with thessfm-v21 model.Endpoints:| Method | Endpoint | Description |
|---|---|---|
| POST | /v1/text-to-speech | Generate speech from text |
| GET | /v1/voices | List available voices |
- Low latency speech synthesis
- 4 Emotion presets:
normal,happy,sad,angry - Emotion availability varies by voice
- 27 languages supported