Sessions
YOW! Perth 2023

Tuesday Sep 12
11:00 –
11:50

Building Owly an AI Comic Video Generator For My Son

Slides:


This video is also available in the GOTO Play video app! Download it to enjoy offline access to our conference videos while on the move.

Available in Google Play Store or Available in Apple App Store




Utilising an Amazon Bedrock Large Language Model (LLM) and a fine-tuned Stable Diffusion 2.1 on Amazon SageMaker JumpStart, I developed an AI tech called Owly that crafts personalised comic videos with music, starring my son’s toys as the lead characters. I will take you through the process (but not limited to) how I utilised LLM to generate the story script via prompt engineering and how I fine-tuned the model to learn to generate an image with a new characters.

artificial intelligence (AI)
large language models (LLM)
game
prompt engineering