OpenStory

OpenStory integrates with a wide range of AI models across four categories: script analysis, image generation, motion/video generation, and music/audio generation. All media models are accessed via Fal.ai, while script analysis uses OpenRouter.

Script Analysis Models

These LLM models analyze your script, extract scenes, characters, and locations, and generate prompts. You can select multiple models to generate parallel sequences for comparison.

Model	Provider	Context Window	License
Grok 4.1 Fast	xAI	2M tokens	Proprietary
Claude Sonnet 4.6	Anthropic	1M tokens	Proprietary
Grok 4.2	xAI	2M tokens	Proprietary
Claude Opus 4.6	Anthropic	1M tokens	Proprietary
Mistral Small 4	Mistral	262K tokens	Open Source (Apache 2.0)
DeepSeek V3.2	DeepSeek	164K tokens	Open Source (MIT)
GLM-5	Z.ai	203K tokens	Open Source (MIT)
Gemini 3.1 Pro	Google	1M tokens	Proprietary
GPT-5.4	OpenAI	1M tokens	Proprietary
Gemini 3 Flash	Google	1M tokens	Proprietary
GPT-5.4 Mini	OpenAI	400K tokens	Proprietary
Seed 2.0 Mini	ByteDance	262K tokens	Proprietary
GPT-5.4 Nano	OpenAI	400K tokens	Proprietary

Image Generation Models

These models create the visual images for each scene. You can select multiple models to generate variant images for comparison.

Model	Provider	License	Notes
Nano Banana 2	Google	Proprietary	Fast generation and editing (default)
Nano Banana Pro	Google	Proprietary	Enhanced realism and typography
Grok Imagine Image	Grok	Proprietary	Aesthetic with low censoring
FLUX.2 Max	Black Forest Labs	Proprietary	Exceptional realism
Phota	Phota	Proprietary	Character consistency via profiles
Hunyuan Image v3	Tencent	Open Source	Strong composition
FLUX.2 Dev	Black Forest Labs	Open Source	32B open weights with native editing
Qwen Image 2 Pro	Alibaba	Open Source (Apache 2.0)	Native 2K, text rendering
HiDream I1	HiDream	Open Source (MIT)	17B parameters
Seedream 5	ByteDance	Proprietary	Unified generation and editing

Edit Endpoints

Most image models support reference image editing via dedicated edit endpoints. This allows the AI to use character and location reference images when generating scenes, improving visual consistency.

Motion/Video Models

These models animate still images into video clips.

Model	Provider	Est. Time	License	Notes
LTX 2.3 Pro	Lightricks	~15s	Open Source	Best quality ranking
Veo 3.1	Google	~25s	Proprietary	20K max prompt length
Kling v3 Pro	Kling	~20s	Proprietary	Default model
Grok Imagine Video	Grok	~20s	Proprietary
MiniMax Hailuo 02	MiniMax	~15s	Proprietary
Seedance 1.5 Pro	ByteDance	~12s	Proprietary	4K max prompt
Seedance 2	ByteDance	~20s	Proprietary	Animation styles only

Aspect Ratio Compatibility

Not all motion models support all aspect ratios. OpenStory automatically filters to show only compatible models and will switch to a compatible default if your current model doesn't support the selected ratio.

Audio Support

Some motion models can generate audio alongside video. OpenStory checks each model's capabilities to determine audio support.

Music & Audio Models

Model	Provider	Max Duration	Type	License
ElevenLabs Music	ElevenLabs	600s (10 min)	Music	Proprietary
MiniMax Music v2	MiniMax	300s (5 min)	Music	Proprietary
ACE-Step 1.5	ACE Studio	240s (4 min)	Music	Open Source
Lyria 2	Google	30s	Music	Proprietary
MMAudio V2	MMAudio	8s	SFX	Open Source
ElevenLabs SFX	ElevenLabs	22s	SFX	Proprietary

Music vs. Sound Effects

Music models generate background music tracks from text prompts and optional tags. SFX models generate short sound effects — MMAudio V2 is unique in that it can generate audio from video input (video-to-audio).

Capabilities

Feature	ElevenLabs Music	MiniMax v2	ACE-Step	Lyria 2
Prompt-based	Yes	Yes	Yes	Yes
Lyrics support	No	Yes	Yes	No
Instrumental	Yes	Yes	Yes	Yes
Long-form	Yes (10 min)	Yes (5 min)	Yes (4 min)	No (30s)