Sebastiano Galazzo

Information & Communications Technology

Artificial Intelligence Machine Learning Machine Learning and AI

Milan, Lombardy, Italy

Deep dive on creating a photorealistic talking avatar

Creating a photorealistic avatar speaking any sentence starting from a written input text.

Focusing on autoencoders, we will do a journey from the beginning (Of the speaker experience), mistakes and tips learned along the path.
Will be showcased:

- Intro, the timeline from beginning to nowadays
- Is NOT a deepfake
- Audio processing techniques: STFT (Short Term Fourier Transform), MELs and custom solutions
- Deeplearning models and architecture
- The technique, inspired to inpaiting, used to animate the mouth
- Masks and convolution
- Landmarks extraction
- Morphing animation technique based on autoencoders features
- Microsoft Azure Speech services used to support audio and animation processing
- Putting all together

https://1drv.ms/v/s!AvcsRrhl_mWp-r8WWjc4edmWzZ5tqQ?e=DRHaFK


Sebastiano Galazzo

Artificial intelligence researcher and proud dad

Winner of two AI awards, I’ve been working in AI and machine learning for 25 years, designing and developing AI and computer graphic algorithms.

I’m very passionate about AI, focusing on Audio, Image and Natural Language Processing, and predictive analysis as well.
I received several national and international awards, that recognizes my work and contributions in these areas.

Microsoft MVP for Artificial Intelligence Category, I have the pleasure of being a guest speaker in national and international events.

- Microsoft Ignite Tour
- Microsoft Build Insider Dev Tour
- Microsoft MVP Summit, Seattle - US
- QT Conference Berlin, Germany
- EMEA AI Saturday University of Pordenone, Italy
- EMEA C++ Conference Modena, Italy
- EMEA conferences talking about A.I.
- and more...

Sebastiano's full speaker profile