Automated Red-Teaming: Engineering Evaluation Pipelines for Clinical GenAI.

How you measure "hallucinations" in medical advice. e.g., using 'LLM-as-a-Judge' to grade responses against golden datasets before a model goes to production.

Priyadarshni Natarajan

Technical Fellow - Walmart

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

Automated Red-Teaming: Engineering Evaluation Pipelines for Clinical GenAI.

Priyadarshni Natarajan

Links

Actions