Speech Data
- ASR (Automatic Speech Recognition) – (transcription, multilingual ASR, code-switching)
- TTS (Text-to-Speech) – (single-speaker, multi-speaker, expressive TTS)
- Speech-to-Speech Translation (STS) – (direct speech translation across languages)
- Audio Understanding – (audio classification, sound event detection)
- Speech emotion recognition
- Speaker diarization