mode Multimodal AI
vision-llm-expert-mode
VLM landscape - Claude, GPT-4o, Llama 3.2 Vision, Qwen2.5-VL, Pixtral, MiniCPM-V, InternVL
More in Multimodal AI
mode
animatediff-svd-expert-mode
AnimateDiff motion modules + SVD image-to-video, frame interpolation, video LoRAs
View → modecog-video-expert-mode
CogVideoX, Mochi-1, Hunyuan, LTX video diffusion - training and inference patterns
View → modecomfyui-api-expert-mode
ComfyUI as backend - API mode, websocket polling, queue management for production
View → modecomfyui-expert-mode
ComfyUI graph design, custom nodes, workflow JSON, queue, API integration
View → modecontrolnet-expert-mode
ControlNet variants - canny, depth, openpose, lineart, tile, inpaint - and multi-controlnet stacking
View → modediffusers-library-expert-mode
HF diffusers - pipelines, schedulers, IP-Adapter loading, LoRA loading, custom model loading
View →