Generate voice-converted audio or TTS from text
Generate audio from text or modify voice pitch
Generate audio from text using voice synthesis