About our foundation model SSFM
Typecast currently uses an advanced AI voice model, the Typecast Speech Synthesis Foundation Model, or Typecast SSFM for short, which is our next generation text-to-speech technology that brings text to life with unparalleled naturalness and expressiveness.Models overview
| Model | Release Date | Description |
|---|---|---|
| ssfm-v30 | 2026.01 | - More natural-sounding speech with smoother prosody and pacing - Emotion controls with 7 emotion presets - 37 languages supported - Smart Emotion available |
| ssfm-v21 | 2025.04 | - Low latency - Emotion controls with 4 emotion presets - 27 languages supported |
ssfm-v30
- Smart Emotion: Automatically detects the appropriate emotion from the text context and applies it to the voice.
- Emotion Presets:
normal,happy,sad,angry,whisper,toneup,tonedown(available across all voices) - Languages Supported: English, Korean, Arabic, Bengali, Bulgarian, Cantonese, Chinese (Mandarin), Croatian, Czech, Danish, Dutch, Finnish, French, German, Greek, Hindi, Hungarian, Indonesian, Italian, Japanese, Malay, Min Nan, Norwegian, Polish, Portuguese, Punjabi, Romanian, Russian, Slovak, Spanish, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Vietnamese
ssfm-v21
- Emotion Presets:
normal,happy,sad,angry(availability varies by voice) - Languages Supported: English, Korean, Arabic, Bulgarian, Chinese, Croatian, Czech, Danish, Dutch, Finnish, French, German, Greek, Indonesian, Italian, Japanese, Malay, Polish, Portuguese, Romanian, Russian, Slovak, Spanish, Swedish, Tagalog, Tamil, Ukrainian