Skip to main content

About our foundation model SSFM

Typecast currently uses an advanced AI voice model, the Typecast Speech Synthesis Foundation Model, or Typecast SSFM for short, which is our next generation text-to-speech technology that brings text to life with unparalleled naturalness and expressiveness.

Models overview

ModelRelease DateDescription
ssfm-v302026.01- More natural-sounding speech with smoother prosody and pacing
- Emotion controls with 7 emotion presets
- 37 languages supported
- Smart Emotion available
ssfm-v212025.04- Low latency
- Emotion controls with 4 emotion presets
- 27 languages supported

ssfm-v30

  • Smart Emotion: Automatically detects the appropriate emotion from the text context and applies it to the voice.
  • Emotion Presets: normal, happy, sad, angry, whisper, toneup, tonedown (available across all voices)
  • Languages Supported: English, Korean, Arabic, Bengali, Bulgarian, Cantonese, Chinese (Mandarin), Croatian, Czech, Danish, Dutch, Finnish, French, German, Greek, Hindi, Hungarian, Indonesian, Italian, Japanese, Malay, Min Nan, Norwegian, Polish, Portuguese, Punjabi, Romanian, Russian, Slovak, Spanish, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Vietnamese

ssfm-v21

  • Emotion Presets: normal, happy, sad, angry (availability varies by voice)
  • Languages Supported: English, Korean, Arabic, Bulgarian, Chinese, Croatian, Czech, Danish, Dutch, Finnish, French, German, Greek, Indonesian, Italian, Japanese, Malay, Polish, Portuguese, Romanian, Russian, Slovak, Spanish, Swedish, Tagalog, Tamil, Ukrainian