User Guide
AI Models
Complete reference of all AI models available in OpenStory
OpenStory integrates with a wide range of AI models across four categories: script analysis, image generation, motion/video generation, and music/audio generation. All media models are accessed via Fal.ai, while script analysis uses OpenRouter.
Script Analysis Models
These LLM models analyze your script, extract scenes, characters, and locations, and generate prompts. You can select multiple models to generate parallel sequences for comparison.
| Model | Provider | Context Window | License |
|---|---|---|---|
| Grok 4.1 Fast | xAI | 2M tokens | Proprietary |
| Claude Sonnet 4.6 | Anthropic | 1M tokens | Proprietary |
| Grok 4.2 | xAI | 2M tokens | Proprietary |
| Claude Opus 4.6 | Anthropic | 1M tokens | Proprietary |
| Mistral Small 4 | Mistral | 262K tokens | Open Source (Apache 2.0) |
| DeepSeek V3.2 | DeepSeek | 164K tokens | Open Source (MIT) |
| GLM-5 | Z.ai | 203K tokens | Open Source (MIT) |
| Gemini 3.1 Pro | 1M tokens | Proprietary | |
| GPT-5.4 | OpenAI | 1M tokens | Proprietary |
| Gemini 3 Flash | 1M tokens | Proprietary | |
| GPT-5.4 Mini | OpenAI | 400K tokens | Proprietary |
| Seed 2.0 Mini | ByteDance | 262K tokens | Proprietary |
| GPT-5.4 Nano | OpenAI | 400K tokens | Proprietary |
Image Generation Models
These models create the visual images for each scene. You can select multiple models to generate variant images for comparison.
| Model | Provider | License | Notes |
|---|---|---|---|
| Nano Banana 2 | Proprietary | Fast generation and editing (default) | |
| Nano Banana Pro | Proprietary | Enhanced realism and typography | |
| Grok Imagine Image | Grok | Proprietary | Aesthetic with low censoring |
| FLUX.2 Max | Black Forest Labs | Proprietary | Exceptional realism |
| Phota | Phota | Proprietary | Character consistency via profiles |
| Hunyuan Image v3 | Tencent | Open Source | Strong composition |
| FLUX.2 Dev | Black Forest Labs | Open Source | 32B open weights with native editing |
| Qwen Image 2 Pro | Alibaba | Open Source (Apache 2.0) | Native 2K, text rendering |
| HiDream I1 | HiDream | Open Source (MIT) | 17B parameters |
| Seedream 5 | ByteDance | Proprietary | Unified generation and editing |
Edit Endpoints
Most image models support reference image editing via dedicated edit endpoints. This allows the AI to use character and location reference images when generating scenes, improving visual consistency.
Motion/Video Models
These models animate still images into video clips.
| Model | Provider | Est. Time | License | Notes |
|---|---|---|---|---|
| LTX 2.3 Pro | Lightricks | ~15s | Open Source | Best quality ranking |
| Veo 3.1 | ~25s | Proprietary | 20K max prompt length | |
| Kling v3 Pro | Kling | ~20s | Proprietary | Default model |
| Grok Imagine Video | Grok | ~20s | Proprietary | |
| MiniMax Hailuo 02 | MiniMax | ~15s | Proprietary | |
| Seedance 1.5 Pro | ByteDance | ~12s | Proprietary | 4K max prompt |
| Seedance 2 | ByteDance | ~20s | Proprietary | Animation styles only |
Aspect Ratio Compatibility
Not all motion models support all aspect ratios. OpenStory automatically filters to show only compatible models and will switch to a compatible default if your current model doesn't support the selected ratio.
Audio Support
Some motion models can generate audio alongside video. OpenStory checks each model's capabilities to determine audio support.
Music & Audio Models
| Model | Provider | Max Duration | Type | License |
|---|---|---|---|---|
| ElevenLabs Music | ElevenLabs | 600s (10 min) | Music | Proprietary |
| MiniMax Music v2 | MiniMax | 300s (5 min) | Music | Proprietary |
| ACE-Step 1.5 | ACE Studio | 240s (4 min) | Music | Open Source |
| Lyria 2 | 30s | Music | Proprietary | |
| MMAudio V2 | MMAudio | 8s | SFX | Open Source |
| ElevenLabs SFX | ElevenLabs | 22s | SFX | Proprietary |
Music vs. Sound Effects
Music models generate background music tracks from text prompts and optional tags. SFX models generate short sound effects — MMAudio V2 is unique in that it can generate audio from video input (video-to-audio).
Capabilities
| Feature | ElevenLabs Music | MiniMax v2 | ACE-Step | Lyria 2 |
|---|---|---|---|---|
| Prompt-based | Yes | Yes | Yes | Yes |
| Lyrics support | No | Yes | Yes | No |
| Instrumental | Yes | Yes | Yes | Yes |
| Long-form | Yes (10 min) | Yes (5 min) | Yes (4 min) | No (30s) |