Session

The unexpected journey to data cleansing

Medical data is as messy as a teenager's room. It is often unstructured and can vary immensely. Conducting a simple analysis could become a burden: Where is the data I need? How do I make sure it’s accurate? Why are my queries taking so long?!
Designing the correct solutions to answer these questions in the optimal way could be painfully challenging, with very few success stories along the way.

In this talk, we’ll explore the data normalization issues we faced at Aidoc, and our take in defining and breaking down this complex problem. We’ll discuss some of the solutions we tested, from Materialized views to Airflow pipelines. Finally, we’ll discuss the key points in choosing the correct solution to answer these complex questions.

Miki Segall

Software Engineering Team Lead @ NeuraLight

Tel Aviv, Israel

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top