Generating images and videos based on input text. Trained on 100 images with 1024x1024 resolution
Multimedia content