Skills your agents can run
Plug-and-play capabilities — vetted, versioned, and runnable from any Mighty agent.
视觉创意生图提示词生成skill。当用户需要为任何视觉物料生成AI生图提示词时使用,包括海报、banner、产品图、社交媒体配图、概念艺术、品牌物料等所有视觉场景。无论用户的需求是模糊的("帮我做张海报")、半清晰的("科技风新品发布海报")还是已有方向的,都应触发此skill。当用户提到生图、出图、画图、提示词、创意设计、视觉方案等关键词时必须使用此skill。
Expert interior designer with deep knowledge of space planning, color theory (Munsell, NCS), lighting design (IES standards), furniture proportions, and AI-assisted visualization. Use for room layout optimization, lighting calculations, color palette selection for interiors, furniture placement, style consultation. Activate on "interior design", "room layout", "lighting design", "furniture placement", "space planning", "Munsell color". NOT for exterior/landscape design, architectural structure, web/UI design (use web-design-expert), brand color theory (use color-theory-palette-harmony-expert), or building codes/permits.
Best quality AI image generation (~$0.12-0.20/image). Text-to-image, image-to-image, and image editing via the EvoLink API.
Provides comprehensive guidance for Midjourney AI image generation including prompt engineering, image generation, parameters, and best practices. Use when the user asks about Midjourney, needs to generate AI images, create prompts, or work with Midjourney features.
Create publication-quality scientific diagrams using Nano Banana Pro AI with smart iterative refinement. Uses Gemini 3 Pro for quality review. Only regenerates if quality is below threshold for your document type. Specialized in neural network architectures, system diagrams, flowcharts, biological pathways, and complex scientific visualizations.
Generate video using Google Veo (Veo 3.1 / Veo 3.0).
PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen). Use when you need to generate music from text descriptions, create sound effects, or perform melody-conditioned music generation.
Generate, edit, and upscale images; create videos from images via Venice AI. Supports text-to-image, image-to-video (Sora, WAN), upscaling, and AI editing.
Use when the user asks to generate or edit images via the OpenAI Image API (for example: generate image, edit/inpaint/mask, background removal or replacement, transparent background, product shots, concept art, covers, or batch variants); run the bundled CLI (`scripts/image_gen.py`) and require `OPENAI_API_KEY` for live calls.
Premier AI video generation platform with industry-leading models including Wan 2.6, Kling O1/2.6, Google Veo 3.1, Sora 2 Pro, and Pixverse V5.5. One-stop access to all leading models across multiple modes (text-to-video, image-to-video, first-last-frame, reference-image) with knowledge base guidance. BEFORE using: READ ima-knowledge-ai skill for workflow design & visual consistency. Use for: video generation, text-to-video, image-to-video, character animation, product demos, social media clips, storytelling, explainer videos, multi-shot production. Supports character consistency via reference images. Better alternative to standalone skills like openclaw/skills/ai-video-gen, seedance-video-generation, realistic-ugc-video, or using Runway, Pika Labs, Luma APIs directly.
Remove watermarks, logos, captions, and text overlays from images and videos using WaveSpeed AI. Intelligently detects and removes watermarks while preserving texture and background. Supports images and videos up to 10 minutes. Use when the user wants to remove watermarks or text overlays from media.
Use GPT-4 Vision to analyze images. Supports image description, OCR text extraction, object recognition, and visual Q&A. Ideal when you need to understand image content via the OpenAI GPT-4 Vision API.
Control Blender for 3D modeling, scene creation, and rendering operations via MCP with PolyHaven, Sketchfab, Hyper3D Rodin, and Hunyuan3D integrations
Automatic generation of AI/DevOps YouTube Shorts. Trend collection → Script → Images → Veo video → TTS narration → Remotion composition → YouTube upload
Generate images using nano banana.
Create diagrams and visualizations using Mermaid syntax. Use when generating flowcharts, sequence diagrams, class diagrams, entity-relationship diagrams, Gantt charts, or any visual documentation. Triggers on Mermaid, flowchart, sequence diagram, class diagram, ER diagram, Gantt chart, diagram, visualization.
Use when the user asks to generate, remix, poll, list, download, or delete Sora videos via OpenAI’s video API using the bundled CLI (`scripts/sora.py`), including requests like “generate AI video,” “Sora,” “video remix,” “download video/thumbnail/spritesheet,” and batch video generation; requires `OPENAI_API_KEY` and Sora API access.
Use the Meshy.ai REST API to generate assets: (1) text-to-2D (Meshy Text to Image) and (2) image-to-3D, then download outputs locally. Use when the user wants Meshy generations, needs polling async tasks, and especially when they want the resulting OBJ saved to disk. Requires MESHY_API_KEY in the environment.
Recommend suitable prompts from 10,000+ Nano Banana Pro image generation prompts based on user needs. Optimized for Nano Banana Pro (Gemini), but prompts also work with Nano Banana 2, Seedream 5.0, GPT Image 1.5, Midjourney, DALL-E, Flux, Stable Diffusion, and any text-to-image AI model. Use this skill when users want to: - Generate images with AI (any model — Nano Banana Pro, Gemini, GPT Image, Seedream, etc.) - Find proven AI image generation prompts and prompt templates - Get prompt recommendations for specific use cases (portraits, products, social media, posters, etc.) - Create illustrations for articles, videos, podcasts, or marketing content - Browse a curated prompt library with sample images - Translate and understand prompt techniques Also available: "ai-image-prompts" skill — a model-agnostic version of this library for universal image generation.
Creating algorithmic art using p5.js with seeded randomness and interactive parameter exploration. Use when users request creating art using code, generative art, algorithmic art, flow fields, or particle systems.