home/categories/productivity-tools/yonatangross-skillforge-claude-plugin-skills-audio-language-models-skill-md
productivity-toolstools

audio-language-models

Gemini Live API, Grok Voice Agent, GPT-4o-Transcribe, AssemblyAI patterns for real-time voice, speech-to-text, and TTS. Use when implementing voice agents, audio transcription, or conversational AI.

yonatangross
maintainer
yonatangross
Atualizado 1/19/2026
Estrelas
26
Forks
4
quick start

Installation and usage

Gemini Live API, Grok Voice Agent, GPT-4o-Transcribe, AssemblyAI patterns for real-time voice, speech-to-text, and TTS. Use when implementing voice agents, audio transcription, or conversational AI.

Instalação
$ install --globalskills.sh
Uso

Depois de instalar, você pode usar esta skill executando o seguinte comando no terminal:

skills use audio-language-models