YOW! Perth 2023

Building Owly an AI Comic Video Generator For My Son

Utilising an OpenAI GPT3.5 Large Language Model and a fine-tuned Stable Diffusion 2.1 on Amazon SageMaker JumpStart, I developed an AI tech called Owly that crafts personalised comic videos with music, starring my son’s toys as the lead characters. I will take you through the process (but not limited to) how I utilised GPT to generate the story script via prompt engineering and how I fine-tuned the model to learn to generate an image with a new characters.