Model Overview
Stem Separation
| Model | API Reference | Description |
|---|---|---|
| gsep_music_hq_v1 | View | Separates vocals, drums, bass, electric guitar, piano, and other instruments |
| gsep_music_shq_v1 | View | Higher-quality model that separates into vocals and accompaniment only |
| gsep_speech_hq_v1 | View | Isolates speech from background noise to deliver clearer voice |
DME Separation
| Model | API Reference | Description |
|---|---|---|
| gsep_dme_dtrack_v1 | View | Outputs only the dialogue track, removing both music and effects. Ideal for subtitling, dubbing, and localization |
| gsep_dme_etrack_v1 | View | Outputs only the effects track, removing dialogue and music. Helpful for Foley, sound design, and post-production editing |
| gsep_dme_metrack_v1 | View | Outputs music + effects together, with dialogue removed. Perfect for ADR (Automated Dialogue Replacement) |
| gsep_dme_mtrack_v1 | View | Outputs only the music track, without dialogue or effects. Great for karaoke, remixing, or music replacement |
AI Text Sync
| Model | API Reference | Description |
|---|---|---|
| gts_lyrics_line_v1 | View | Line-level lyric alignment; supports English, Korean, Japanese, Chinese (Simplified) |
| gts_lyrics_line_v3 | – | Not yet available |