Reinforcement Learning Brains

Generative AI is often "confidently wrong" because it lacks a feedback loop. Using the "Reinforcement Learning (RL) Brain" methodology, I will show how to build local "Reward Models" that train agents on your specific PR review standards. This talk moves from basic inference to Behavioral Fine-tuning, ensuring AI-generated code doesn't just "work," but aligns with your team's long-term architectural health.

Hrushikesh Pokala

Sr Software Engineer Lead at Equifax

St. Louis, Missouri, United States

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

Reinforcement Learning Brains

Hrushikesh Pokala

Links

Actions