Tuesday Sep 12
11:00 –
11:50
Building Owly an AI Comic Video Generator For My Son
Slides:
This video is also available in the GOTO Play video app! Download it to enjoy offline access to our conference videos while on the move.
Utilising an Amazon Bedrock Large Language Model (LLM) and a fine-tuned Stable Diffusion 2.1 on Amazon SageMaker JumpStart, I developed an AI tech called Owly that crafts personalised comic videos with music, starring my son’s toys as the lead characters. I will take you through the process (but not limited to) how I utilised LLM to generate the story script via prompt engineering and how I fine-tuned the model to learn to generate an image with a new characters.