It supports inputting text, images, and videos. The image needs to contain the static subject to be animated, while the reference video offers the motion, expression, background, or audio to be transferred. As for the text prompt, it allows you to customize the parameters like actions, character movements, camera effects, and more.