gemini-imagegen
Generate and edit images using Gemini API (Nano Banana Pro). Supports text-to-image, image editing, multi-turn refinement, Google Search grounding for factual accuracy, and composition from multiple reference images.
Generate and edit images using Gemini API (Nano Banana Pro). Supports text-to-image, image editing, multi-turn refinement, Google Search grounding for factual accuracy, and composition from multiple reference images.
Guide for implementing Google Gemini API audio capabilities - analyze audio with transcription, summarization, and understanding (up to 9.5 hours), plus generate speech with controllable TTS. Use when processing audio files, creating transcripts, analyzing speech/music/sounds, or generating natural speech from text.
Phi-4 LLM interaction skill for generating text completions via Ollama API. Use for all LLM inference tasks including section detection, summarization, recommendation generation, and quality evaluation.
Audit and optimize website for AI search engines like ChatGPT, Perplexity, Google AI Overviews, and Claude. Use when discussing GEO (Generative Engine Optimization), SEO for AI, llms.txt, AI crawlers, structured data for LLMs, or visibility in AI search results.
画像生成AIに最適化されたプロンプトを作成。Nano Banana Pro (Gemini 3 Pro)、Imagen 3、DALL-E向けのプロンプトを生成する。
Generate detailed AI image prompts for YouTube video B-roll using Imagen. Creates premium, sophisticated visuals for talking head videos targeting 30+ business owners.
Transforms content into a voiceover-ready script optimized for Chatterbox TTS. Use when the user provides ANY content for voiceover - URLs, raw text, video scripts, notes, or asks to "create a script" for audio.
This skill should be used when the user asks to "generate video", "create video", "make a video", "animate image", "image to video", "video with audio", "talking video", or needs AI-powered video generation with synchronized audio, lip-syncing, and cinematic camera controls.
Video production workflow orchestrator for Veo 3. Guides users through creating professional video prompts via two paths - direct text-to-video OR image-to-video pipeline (Imagen 3/4 → Veo 3). Validates prompt completeness, checks conflicts, ensures all mandatory components present. Integrates camera-movements, great-prompt-anatomy, short-prompt-guide, long-prompt-guide, and imagine skills.
Generate and edit images using the Gemini API (Nano Banana). Use this skill when creating images from text prompts, editing existing images, applying style transfers, generating logos with text, creating stickers, product mockups, or any image generation/manipulation task. Supports text-to-image, image editing, multi-turn refinement, and composition from multiple reference images.
Generate AI images with templates and style references. Use when creating images/illustrations/diagrams, managing templates or style references, choosing image providers (Gemini, OpenRouter, VertexAI), or when the user mentions /create-image (including /create-image help).
Knowledge base for analyzing and replicating writing voice. Use when learning voice patterns or generating voice-matched content.
Generate and edit images using the Gemini API (Nano Banana). Use this skill when creating images from text prompts, editing existing images, applying style transfers, generating logos with text, creating stickers, product mockups, or any image generation/manipulation task. Supports text-to-image, image editing, multi-turn refinement, and composition from multiple reference images.
Generate illustrations using Google's Nano Banana (Gemini Image) API. Use when creating images, illustrations, visual content from text prompts, or illustrating documents/stories. Defaults to Pro model with landscape (16:9) aspect ratio. Requires GOOGLE_API_KEY in environment.
Low-level Gemini API skill for image generation. Use this skill when you need direct API control, custom resolution/aspect ratios, multi-turn refinement, or non-brand image tasks (logos, stickers, product mockups). For tacosdedatos post banners and brand-consistent illustrations, use tacosdedatos-illustrator instead—it handles creative direction and calls this skill internally.
This skill should be used when the user asks to "generate an image", "create project images", "make illustrations", "generate icons", "create visual assets", "use Gemini for images", "generate with nono banana", or needs AI-generated images for their project using Google's Gemini API.
NanoBanana(Google Gemini画像生成)向けの高品質プロンプトを生成。画像生成、編集、フェイススワップ、背景変更、キャラクター一貫性のプロンプト作成を支援。
Write tweets in Keanan's voice. Use when drafting tweets, creating Twitter threads, or crafting social media content for X/Twitter. Triggers include requests to "write a tweet", "draft a post for X", "help me tweet about", or any Twitter/X content creation. Generates punchy, first-person content mixing playful wonder with pragmatic realism.
Apply Matt Palmer's voice, tone, and content pillars to any writing. Use for blog posts, social media, documentation, emails, or any content needing Matt's authentic brand voice.
A skill for advanced text manipulation, including sentiment analysis and summarization.
Ruthless concision for AI prompts and context - "Not One Word Wasted"