Audio catalog
Audio Models
Stay inside a compact audio catalog for music and speech-adjacent workflows, while keeping pricing and deeper model details one click away.
3 models
2 providers in this family
MiniMax
MiniMax Speech 2.8
MiniMax Speech 2.8 is an async text-to-speech API with HD and turbo quality modes, preset voices, custom MiniMax voice_id support, emotion control, pronunciation dictionaries, and multilingual output settings.
NewText to SpeechText to Audio
from $0.06/trackView
MiniMax
MiniMax Voice
MiniMax Voice creates reusable custom voice IDs from text-described voice design or a single public reference audio clip, then returns preview audio for validation.
NewText to Audio
from $0.5/trackView

suno
Suno V5
Latest Suno text-to-music model that returns two polished songs per call with faster queues and richer vocals.
NewText to Audio
from $0.12/trackView