home/categories/productivity-tools/yonatangross-skillforge-claude-plugin-skills-audio-language-models-skill-md
productivity-toolstools

audio-language-models

Gemini Live API, Grok Voice Agent, GPT-4o-Transcribe, AssemblyAI patterns for real-time voice, speech-to-text, and TTS. Use when implementing voice agents, audio transcription, or conversational AI.

yonatangross
maintainer
yonatangross
更新于 1/19/2026
星标
26
分支
4
quick start

Installation and usage

Gemini Live API, Grok Voice Agent, GPT-4o-Transcribe, AssemblyAI patterns for real-time voice, speech-to-text, and TTS. Use when implementing voice agents, audio transcription, or conversational AI.

安装
$ install --globalskills.sh
使用

安装后,您可以通过在终端运行以下命令来使用此技能:

skills use audio-language-models