Java

The official Java library for the Typecast API. Convert text to lifelike speech using AI-powered voices. Compatible with Java 8 and later versions. Works with Maven, Gradle, and manual installation.

Maven Central

Typecast Java SDK

Source Code

Typecast Java SDK Source Code

Installation

Maven
Gradle
Manual

Add the following dependency to your pom.xml:

<dependency>
    <groupId>com.neosapience</groupId>
    <artifactId>typecast-java</artifactId>
    <version>1.0.1</version>
</dependency>

Add to your build.gradle:

implementation 'com.neosapience:typecast-java:1.0.1'

Clone and install to local Maven repository:

git clone https://github.com/neosapience/typecast-sdk.git
cd typecast-sdk/typecast-java
mvn clean install -DskipTests

Make sure you have version 1.0.1 or higher installed. If you have an older version, update your dependency version in pom.xml or build.gradle.

Quick Start

import com.neosapience.TypecastClient;
import com.neosapience.models.*;

import java.io.FileOutputStream;

public class QuickStart {
    public static void main(String[] args) throws Exception {
        // Initialize client
        TypecastClient client = new TypecastClient("YOUR_API_KEY");

        // Convert text to speech
        TTSRequest request = TTSRequest.builder()
                .voiceId("tc_672c5f5ce59fac2a48faeaee")
                .text("Hello there! I'm your friendly text-to-speech agent.")
                .model(TTSModel.SSFM_V30)
                .build();

        TTSResponse response = client.textToSpeech(request);

        // Save audio file
        try (FileOutputStream fos = new FileOutputStream("output." + response.getFormat())) {
            fos.write(response.getAudioData());
        }

        System.out.println("Audio saved! Duration: " + response.getDuration() + "s, Format: " + response.getFormat());

        // Clean up
        client.close();
    }
}

Features

The Typecast Java SDK provides powerful features for text-to-speech conversion:

Multiple Voice Models: Support for ssfm-v30 (latest) and ssfm-v21 AI voice models
Multi-language Support: 37 languages including English, Korean, Spanish, Japanese, Chinese, and more
Emotion Control: Preset emotions (normal, happy, sad, angry, whisper, toneup, tonedown) or smart context-aware inference
Audio Customization: Control loudness (LUFS -70 to 0), pitch (-12 to +12 semitones), tempo (0.5x to 2.0x), and format (WAV/MP3)
Voice Discovery: V2 Voices API with filtering by model, gender, age, and use cases
Builder Pattern: Fluent API with builder pattern for easy request construction
Comprehensive Error Handling: Specific exception classes for each error type
Streaming: Real-time chunked audio delivery for low-latency playback

Configuration

Set your API key via environment variable, .env file, or constructor:

// Using environment variable
// export TYPECAST_API_KEY="your-api-key-here"
TypecastClient client = new TypecastClient();

// Or pass directly
TypecastClient client = new TypecastClient("your-api-key-here");

// Or with custom base URL
TypecastClient client = new TypecastClient("your-api-key-here", "https://custom-api.example.com");

Environment File

Create a .env file in your project root:

TYPECAST_API_KEY=your-api-key-here

Advanced Usage

Emotion Control (ssfm-v30)

ssfm-v30 offers two emotion control modes: Preset and Smart.

Smart Mode
Preset Mode

Let the AI infer emotion from context:

TTSRequest request = TTSRequest.builder()
        .voiceId("tc_672c5f5ce59fac2a48faeaee")
        .text("Everything is going to be okay.")
        .model(TTSModel.SSFM_V30)
        .prompt(SmartPrompt.builder()
                .previousText("I just got the best news!")  // Optional context
                .nextText("I can't wait to celebrate!")     // Optional context
                .build())
        .build();

TTSResponse response = client.textToSpeech(request);

Explicitly set emotion with preset values:

TTSRequest request = TTSRequest.builder()
        .voiceId("tc_672c5f5ce59fac2a48faeaee")
        .text("I am so excited to show you these features!")
        .model(TTSModel.SSFM_V30)
        .prompt(PresetPrompt.builder()
                .emotionPreset(EmotionPreset.HAPPY)  // normal, happy, sad, angry, whisper, toneup, tonedown
                .emotionIntensity(1.5)               // Range: 0.0 to 2.0
                .build())
        .build();

TTSResponse response = client.textToSpeech(request);

Audio Customization

Control loudness, pitch, tempo, and output format:

TTSRequest request = TTSRequest.builder()
        .voiceId("tc_672c5f5ce59fac2a48faeaee")
        .text("Customized audio output!")
        .model(TTSModel.SSFM_V30)
        .output(Output.builder()
                .targetLufs(-14.0)                 // Range: -70 to 0 (LUFS)
                .audioPitch(2)                  // Range: -12 to +12 semitones
                .audioTempo(1.2)                // Range: 0.5x to 2.0x
                .audioFormat(AudioFormat.MP3)   // Options: WAV, MP3
                .build())
        .seed(42)  // Non-negative seed for reproducible results
        .build();

TTSResponse response = client.textToSpeech(request);

try (FileOutputStream fos = new FileOutputStream("output." + response.getFormat())) {
    fos.write(response.getAudioData());
}
System.out.println("Duration: " + response.getDuration() + "s, Format: " + response.getFormat());

Voice Discovery (V2 API)

List and filter available voices with enhanced metadata:

// Get all voices
List<VoiceV2Response> voices = client.getVoicesV2();

// Filter by criteria
VoicesV2Filter filter = VoicesV2Filter.builder()
        .model(TTSModel.SSFM_V30)
        .gender(GenderEnum.FEMALE)
        .age(AgeEnum.YOUNG_ADULT)
        .build();

List<VoiceV2Response> filtered = client.getVoicesV2(filter);

// Display voice info
for (VoiceV2Response voice : voices) {
    System.out.println("ID: " + voice.getVoiceId() + ", Name: " + voice.getVoiceName());
    System.out.println("Gender: " + voice.getGender() + ", Age: " + voice.getAge());
    
    for (ModelInfo model : voice.getModels()) {
        System.out.println("Model: " + model.getVersion() + ", Emotions: " + model.getEmotions());
    }
    
    if (voice.getUseCases() != null) {
        System.out.println("Use cases: " + String.join(", ", voice.getUseCases()));
    }
}

Multilingual Content

The SDK supports 37 languages with automatic language detection:

// Auto-detect language (recommended)
TTSRequest request = TTSRequest.builder()
        .voiceId("tc_672c5f5ce59fac2a48faeaee")
        .text("こんにちは。お元気ですか。")
        .model(TTSModel.SSFM_V30)
        .build();

TTSResponse response = client.textToSpeech(request);

// Or specify language explicitly
TTSRequest koreanRequest = TTSRequest.builder()
        .voiceId("tc_672c5f5ce59fac2a48faeaee")
        .text("안녕하세요. 반갑습니다.")
        .model(TTSModel.SSFM_V30)
        .language(LanguageCode.KOR)  // ISO 639-3 language code
        .build();

TTSResponse koreanResponse = client.textToSpeech(koreanRequest);

try (FileOutputStream fos = new FileOutputStream("output." + response.getFormat())) {
    fos.write(response.getAudioData());
}

Streaming

Stream audio chunks in real-time for low-latency playback:

import javax.sound.sampled.*;

// Set up audio playback: 32000 Hz, 16-bit, mono, little-endian
AudioFormat format = new AudioFormat(32000, 16, 1, true, false);
SourceDataLine line = AudioSystem.getSourceDataLine(format);
line.open(format, 8192);
line.start();

try (InputStream stream = client.textToSpeechStream(request)) {
    byte[] buf = new byte[4096];
    boolean first = true;
    int bytesRead;

    while ((bytesRead = stream.read(buf)) != -1) {
        int offset = 0;
        if (first) {
            offset = 44;           // Skip 44-byte WAV header
            bytesRead -= 44;
            first = false;
        }
        line.write(buf, offset, bytesRead);
    }
}
line.drain();
line.close();
client.close();

WAV streaming format: 32000 Hz, 16-bit, mono PCM. The first chunk includes a 44-byte WAV header (size = 0xFFFFFFFF); subsequent chunks are raw PCM only. For MP3: 320 kbps, 44100 Hz, each chunk is independently decodable. Use com.neosapience.models.OutputStream to avoid collision with java.io.OutputStream. The streaming endpoint does not support volume or targetLufs.

Supported Languages

The SDK supports 37 languages with automatic language detection:

Code	Language	Code	Language	Code	Language
`ENG`	English	`JPN`	Japanese	`UKR`	Ukrainian
`KOR`	Korean	`ELL`	Greek	`IND`	Indonesian
`SPA`	Spanish	`TAM`	Tamil	`DAN`	Danish
`DEU`	German	`TGL`	Tagalog	`SWE`	Swedish
`FRA`	French	`FIN`	Finnish	`MSA`	Malay
`ITA`	Italian	`ZHO`	Chinese	`CES`	Czech
`POL`	Polish	`SLK`	Slovak	`POR`	Portuguese
`NLD`	Dutch	`ARA`	Arabic	`BUL`	Bulgarian
`RUS`	Russian	`HRV`	Croatian	`RON`	Romanian
`BEN`	Bengali	`HIN`	Hindi	`HUN`	Hungarian
`NAN`	Hokkien	`NOR`	Norwegian	`PAN`	Punjabi
`THA`	Thai	`TUR`	Turkish	`VIE`	Vietnamese
`YUE`	Cantonese

If not specified, the language will be automatically detected from the input text.

Error Handling

The SDK provides specific exception classes for handling API errors:

import com.neosapience.TypecastClient;
import com.neosapience.exceptions.*;

try {
    TTSResponse response = client.textToSpeech(request);
} catch (UnauthorizedException e) {
    // 401: Invalid API key
    System.err.println("Invalid API key: " + e.getMessage());
} catch (PaymentRequiredException e) {
    // 402: Insufficient credits
    System.err.println("Insufficient credits: " + e.getMessage());
} catch (ForbiddenException e) {
    // 403: Access denied
    System.err.println("Access denied: " + e.getMessage());
} catch (NotFoundException e) {
    // 404: Resource not found
    System.err.println("Voice not found: " + e.getMessage());
} catch (UnprocessableEntityException e) {
    // 422: Validation error
    System.err.println("Validation error: " + e.getMessage());
} catch (RateLimitException e) {
    // 429: Rate limit exceeded
    System.err.println("Rate limit exceeded - please retry later");
} catch (InternalServerException e) {
    // 500: Server error
    System.err.println("Server error: " + e.getMessage());
} catch (TypecastException e) {
    // Generic error
    System.err.println("API error (" + e.getStatusCode() + "): " + e.getMessage());
}

Exception Hierarchy

Exception	Status Code	Description
`BadRequestException`	400	Invalid request parameters
`UnauthorizedException`	401	Invalid or missing API key
`PaymentRequiredException`	402	Insufficient credits
`ForbiddenException`	403	Access denied
`NotFoundException`	404	Resource not found
`UnprocessableEntityException`	422	Validation error
`RateLimitException`	429	Rate limit exceeded
`InternalServerException`	500	Server error
`TypecastException`	*	Base exception class

Eclipse IDE Setup

Import Project

Open Eclipse
Go to File → Import...
Select Maven → Existing Maven Projects
Browse to the typecast-java directory
Click Finish

Add Dependency

Add to your project’s pom.xml:

<dependency>
    <groupId>com.neosapience</groupId>
    <artifactId>typecast-java</artifactId>
    <version>1.0.1</version>
</dependency>

Update Project

Right-click on your project → Maven → Update Project...

IntelliJ IDEA Setup

Import Project

Open IntelliJ IDEA
Go to File → Open...
Select the typecast-java directory
Select “Open as Project”

Add Dependency

IntelliJ will automatically detect the pom.xml and import dependencies.Or add to your project’s pom.xml:

<dependency>
    <groupId>com.neosapience</groupId>
    <artifactId>typecast-java</artifactId>
    <version>1.0.1</version>
</dependency>

Reload Maven

Click the Maven refresh button or right-click pom.xml → Maven → Reload Project

API Reference

TypecastClient Methods

Method	Description
`textToSpeech(TTSRequest)`	Convert text to speech audio
`getVoicesV2()`	Get all available voices
`getVoicesV2(VoicesV2Filter)`	Get filtered voices
`getVoiceV2(String voiceId)`	Get a specific voice by ID
`close()`	Release resources

TTSRequest Fields

Field	Type	Required	Description
`voiceId`	`String`	✓	Voice ID (format: `tc_` or `uc_`)
`text`	`String`	✓	Text to synthesize (max 5000 chars)
`model`	`TTSModel`	✓	TTS model (`SSFM_V21` or `SSFM_V30`)
`language`	`LanguageCode`		ISO 639-3 code (auto-detected if omitted)
`prompt`	`Prompt` / `PresetPrompt` / `SmartPrompt`		Emotion settings
`output`	`Output`		Audio output settings
`seed`	`Integer`		Non-negative integer seed for reproducibility (≥ 0)

TTSResponse Fields

Field	Type	Description
`audioData`	`byte[]`	Generated audio data
`duration`	`double`	Audio duration in seconds
`format`	`String`	Audio format (`wav` or `mp3`)

GET STARTED

SDKs

INTEGRATIONS

Maven Central

Source Code

Installation

Quick Start

Features

Configuration

Environment File

Advanced Usage

Emotion Control (ssfm-v30)

Audio Customization

Voice Discovery (V2 API)

Multilingual Content

Streaming

Supported Languages

Error Handling

Exception Hierarchy

Eclipse IDE Setup

IntelliJ IDEA Setup

API Reference

TypecastClient Methods

TTSRequest Fields

TTSResponse Fields

GET STARTED

SDKs

INTEGRATIONS

Documentation Index

Maven Central

Source Code

​Installation

​Quick Start

​Features

​Configuration

​Environment File

​Advanced Usage

​Emotion Control (ssfm-v30)

​Audio Customization

​Voice Discovery (V2 API)

​Multilingual Content

​Streaming

​Supported Languages

​Error Handling

​Exception Hierarchy

​Eclipse IDE Setup

​IntelliJ IDEA Setup

​API Reference

​TypecastClient Methods

​TTSRequest Fields

​TTSResponse Fields

Installation

Quick Start

Features

Configuration

Environment File

Advanced Usage

Emotion Control (ssfm-v30)

Audio Customization

Voice Discovery (V2 API)

Multilingual Content

Streaming

Supported Languages

Error Handling

Exception Hierarchy

Eclipse IDE Setup

IntelliJ IDEA Setup

API Reference

TypecastClient Methods

TTSRequest Fields

TTSResponse Fields