Search for a command to run...
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models