Skip to main content
Audiobook workflows benefit from repeatable naming, stable voice settings, and chapter-level files. The CLI gives an agent a simple command surface for generating drafts, rerendering chapters, and keeping approved audio separate.

Prepare defaults

Set the voice and model once before generating chapters:
cast config set voice-id tc_xxx
cast config set model ssfm-v30
cast config set format mp3
If you need a project-specific voice, clone it first and store the returned uc_ voice ID:
cast voices clone narrator-sample.wav --name "Narrator Draft"
cast config set voice-id uc_xxx

Generate chapters

cast "$(cat chapter-01.txt)" \
  --emotion smart \
  --out audiobook/chapter-01.mp3 \
  --format mp3

cast "$(cat chapter-02.txt)" \
  --emotion smart \
  --out audiobook/chapter-02.mp3 \
  --format mp3

Generate captions or review timing

cast "$(cat chapter-01.txt)" \
  --emotion smart \
  --out audiobook/chapter-01.wav \
  --timestamps-out audiobook/chapter-01.timestamps.json
Use JSON for detailed review timing, or write .srt / .vtt when the audiobook content also needs a video preview.

Use emotion for scene changes

For short passages with a known tone, preset emotion can be more controllable than smart emotion:
cast "The room fell silent as the letter slipped from her hand." \
  --emotion preset \
  --emotion-preset sad \
  --emotion-intensity 1.2 \
  --out audiobook/scene-letter.mp3
For passages where surrounding context matters, pass neighboring text:
cast "She opened the door and froze." \
  --emotion smart \
  --prev-text "The hallway had been empty a moment ago." \
  --next-text "A familiar voice whispered her name." \
  --out audiobook/scene-door.mp3

Agent prompt pattern

Generate audiobook draft files from the chapter text files.
Use the same voice for every chapter.
Write output to ./audiobook.
Use one MP3 per chapter.
For chapters that also need video previews, generate an SRT file next to the audio.
If a chapter fails, report the filename and keep going.

File naming

AssetSuggested filename
Full chapterchapter-01.mp3
Scene revisionchapter-01-scene-03-v2.mp3
Approved finalchapter-01-final.mp3
Alternate deliverychapter-01-alt-happy.mp3

When to use advanced features

NeedRecommended feature
Consistent narrator identitycast config set voice-id ...
Temporary narrator matching a samplecast voices clone narrator-sample.wav --name "Narrator Draft"
Chapter review timing--timestamps-out chapter-01.timestamps.json
Video preview for a chapter--timestamps-out chapter-01.srt
Quick approval playbackcast "one review sentence" without --out
Avoid overwriting approved audio. Ask the agent to write revisions with -v2, -v3, or a delivery label.