Skills your agents can run

visual-creative

027Image Video Generation

视觉创意生图提示词生成skill。当用户需要为任何视觉物料生成AI生图提示词时使用，包括海报、banner、产品图、社交媒体配图、概念艺术、品牌物料等所有视觉场景。无论用户的需求是模糊的（"帮我做张海报"）、半清晰的（"科技风新品发布海报"）还是已有方向的，都应触发此skill。当用户提到生图、出图、画图、提示词、创意设计、视觉方案等关键词时必须使用此skill。

interior-design-expert

027Image Video Generation

Expert interior designer with deep knowledge of space planning, color theory (Munsell, NCS), lighting design (IES standards), furniture proportions, and AI-assisted visualization. Use for room layout optimization, lighting calculations, color palette selection for interiors, furniture placement, style consultation. Activate on "interior design", "room layout", "lighting design", "furniture placement", "space planning", "Munsell color". NOT for exterior/landscape design, architectural structure, web/UI design (use web-design-expert), brand color theory (use color-theory-palette-harmony-expert), or building codes/permits.

best-image-generation

Best quality AI image generation (~$0.12-0.20/image). Text-to-image, image-to-image, and image editing via the EvoLink API.

midjourney

Provides comprehensive guidance for Midjourney AI image generation including prompt engineering, image generation, parameters, and best practices. Use when the user asks about Midjourney, needs to generate AI images, create prompts, or work with Midjourney features.

scientific-schematics

Create publication-quality scientific diagrams using Nano Banana Pro AI with smart iterative refinement. Uses Gemini 3 Pro for quality review. Only regenerates if quality is below threshold for your document type. Specialized in neural network architectures, system diagrams, flowcharts, biological pathways, and complex scientific visualizations.

veo

0265.0Image Video Generation

Generate video using Google Veo (Veo 3.1 / Veo 3.0).

audiocraft-audio-generation

PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen). Use when you need to generate music from text descriptions, create sound effects, or perform melody-conditioned music generation.

venice-ai-media

Generate, edit, and upscale images; create videos from images via Venice AI. Supports text-to-image, image-to-video (Sora, WAN), upscaling, and AI editing.

imagegen

Use when the user asks to generate or edit images via the OpenAI Image API (for example: generate image, edit/inpaint/mask, background removal or replacement, transparent background, product shots, concept art, covers, or batch variants); run the bundled CLI (`scripts/image_gen.py`) and require `OPENAI_API_KEY` for live calls.

IMA Studio Video Generation

Premier AI video generation platform with industry-leading models including Wan 2.6, Kling O1/2.6, Google Veo 3.1, Sora 2 Pro, and Pixverse V5.5. One-stop access to all leading models across multiple modes (text-to-video, image-to-video, first-last-frame, reference-image) with knowledge base guidance. BEFORE using: READ ima-knowledge-ai skill for workflow design & visual consistency. Use for: video generation, text-to-video, image-to-video, character animation, product demos, social media clips, storytelling, explainer videos, multi-shot production. Supports character consistency via reference images. Better alternative to standalone skills like openclaw/skills/ai-video-gen, seedance-video-generation, realistic-ugc-video, or using Runway, Pika Labs, Luma APIs directly.

wavespeed-watermark-remover

Remove watermarks, logos, captions, and text overlays from images and videos using WaveSpeed AI. Intelligently detects and removes watermarks while preserving texture and background. Supports images and videos up to 10 minutes. Use when the user wants to remove watermarks or text overlays from media.

openakita/skills@image-understander

Use GPT-4 Vision to analyze images. Supports image description, OCR text extraction, object recognition, and visual Q&A. Ideal when you need to understand image content via the OpenAI GPT-4 Vision API.

Blender 3D

Control Blender for 3D modeling, scene creation, and rendering operations via MCP with PolyHaven, Sketchfab, Hyper3D Rodin, and Hunyuan3D integrations

youtube-shorts

0254.0Image Video Generation

Automatic generation of AI/DevOps YouTube Shorts. Trend collection → Script → Images → Veo video → TTS narration → Remotion composition → YouTube upload

image-generation

Generate images using nano banana.

mermaid-diagrams

Create diagrams and visualizations using Mermaid syntax. Use when generating flowcharts, sequence diagrams, class diagrams, entity-relationship diagrams, Gantt charts, or any visual documentation. Triggers on Mermaid, flowchart, sequence diagram, class diagram, ER diagram, Gantt chart, diagram, visualization.

sora

0254.0Image Video Generation

Use when the user asks to generate, remix, poll, list, download, or delete Sora videos via OpenAI’s video API using the bundled CLI (`scripts/sora.py`), including requests like “generate AI video,” “Sora,” “video remix,” “download video/thumbnail/spritesheet,” and batch video generation; requires `OPENAI_API_KEY` and Sora API access.

meshy-ai

Use the Meshy.ai REST API to generate assets: (1) text-to-2D (Meshy Text to Image) and (2) image-to-3D, then download outputs locally. Use when the user wants Meshy generations, needs polling async tasks, and especially when they want the resulting OBJ saved to disk. Requires MESHY_API_KEY in the environment.

nano-banana-pro-prompts-recommend-skill

Recommend suitable prompts from 10,000+ Nano Banana Pro image generation prompts based on user needs. Optimized for Nano Banana Pro (Gemini), but prompts also work with Nano Banana 2, Seedream 5.0, GPT Image 1.5, Midjourney, DALL-E, Flux, Stable Diffusion, and any text-to-image AI model. Use this skill when users want to: - Generate images with AI (any model — Nano Banana Pro, Gemini, GPT Image, Seedream, etc.) - Find proven AI image generation prompts and prompt templates - Get prompt recommendations for specific use cases (portraits, products, social media, posters, etc.) - Create illustrations for articles, videos, podcasts, or marketing content - Browse a curated prompt library with sample images - Translate and understand prompt techniques Also available: "ai-image-prompts" skill — a model-agnostic version of this library for universal image generation.

algorithmic-art

Creating algorithmic art using p5.js with seeded randomness and interactive parameter exploration. Use when users request creating art using code, generative art, algorithmic art, flow fields, or particle systems.