Session

Using GPT Visual Capabilities to Solve a Wordle Puzzle

The visual capabilities of GPT-4 open up new scenarios of possibility with a multimodal model. In this session, we will explore what this model can do, and rather than just showing a perfect polished final demo, I will walk you through my entire journey of trying to use the model to solve Wordle puzzles, starting with "Hello World". Along the way, you will gain a good understanding of the model's capabilities, along with learning some prompt engineering techniques that drove progress in this journey (along with what didn't work!). We'll close with a live demo to attempt to solve today's Wordle! This session will tackle a fun problem, but the underlying prompt engineering techniques for image understanding that you will learn are applicable to a wide variety of business problems.

Jennifer Marsman

Microsoft, Principal Engineer for the CTO

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top