Sora提示词:Vertical smartphone video, 9:16, ul

Vertical smartphone video, 9:16, ultra-realistic, no subtitles or text.
Outdoor parking lot with natural lighting. An American man holds the exact same 3-layer transparent retainer clip box from the reference image. The box must match perfectly: same layout, same compartments, same beige and black clip types, same quantity feel.

Close-up handheld phone shot of the man lifting the box toward the camera. Autofocus shifts on the clips inside the clear compartments, showing all details without distortion.

Cut to a top-down phone angle: he opens one drawer, revealing the black and beige clips neatly sorted exactly like the reference. No changes to shapes, colors, or sizes.

Close-up: he selects a clip, then walks to the car fender.
Handheld shot as he uses the clip to replace a missing fastener on the bumper area.

Macro shot: the clip presses in with a clear “click,” secure and tight.

Final shot: the man holds up the full 3-layer clip box beside the repaired bumper, giving a small nod of approval. Natural shaky smartphone realism, no cinematic effects.


? 社区反馈(2025年11月28日14点37分更新)
“建议在俯拍镜头中明确标注每层抽屉开启顺序,避免观众误判第二层的 beige 夹片数量偏移;另外宏观‘咔哒’声应增加频谱波形提示真实感” —— @AICraftMaster_09
✅ 已优化:2025年11月28日14点37分更新
← 上一篇 下一篇 →