home/categories/productivity-tools/yonatangross-skillforge-claude-plugin-skills-audio-language-models-skill-md
productivity-toolstools

audio-language-models

Gemini Live API, Grok Voice Agent, GPT-4o-Transcribe, AssemblyAI patterns for real-time voice, speech-to-text, and TTS. Use when implementing voice agents, audio transcription, or conversational AI.

yonatangross
maintainer
yonatangross
Updated 1/19/2026
Stars
26
Forks
4
quick start

Installation and usage

Gemini Live API, Grok Voice Agent, GPT-4o-Transcribe, AssemblyAI patterns for real-time voice, speech-to-text, and TTS. Use when implementing voice agents, audio transcription, or conversational AI.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use audio-language-models