AUDiO, ViD —> TEXT
Transcribe audio to text with speaker diarization
Transcribe audio files and YouTube videos to text
Transcribe audio files into timestamped text