The 5-Element Frame
The prompt structure that works across major models is to write in the order subject → style → composition → lighting → details. Example: "Tokyo at sunset (subject) / ukiyo-e style (style) / diagonal overhead wide shot (composition) / warm cinematic light (lighting) / Hokusai wave patterns, cloud texture, 8 people (details)."
Subject and Object
- Narrow the lead to one (multiple subjects easily break)
- Choose concrete nouns ("30s woman, business suit" over "person")
- Add verbs/states ("brewing coffee," "concentrating")
Style Reference
- Broad categories like "photo," "watercolor," "3D render"
- Specific artist names (Midjourney references past-work URL with --sref)
- SD reuses composition with IP-Adapter / ControlNet
- Flux Kontext interactively edits like "change only the clothes to red"
Composition Keywords
- Camera: close-up, medium shot, wide shot, bird's-eye, low angle


