Speaker

Shay Lin

Shay Lin

Software Engineer, Confluent

San Francisco, California, United States

Actions

Shay Lin is experienced in event-driven microservices and stream processing frameworks. Shay works at Confluent’s data team, focusing on real-time analytics and data infrastructure. Before Confluent, she built systems at scale for payment risk and fraud detection.

Area of Expertise

  • Finance & Banking
  • Information & Communications Technology

Topics

  • stream processing
  • Realtime Analytics
  • Decision Engine
  • Fintech
  • Risk Management

Streaming Aggregation of Cloud Scale Telemetry

How does one serve telemetry metrics at cloud scale? At Confluent, raw telemetry is flowing in at 5 million metrics per second. Not only is the storage expensive- often with extended retention to meet compliance requirements, but the computational cost of aggregation can also skyrocket in a pull model. In the pull model, metrics consumers like data science and billing used to query metrics from the OLAP data stores on demand, which created inconsistencies over time. This session will showcase how we switched to a push model for telemetry analytics, and tackled these challenges with Kafka Streams and Apache Druid.

You will walk away with an understanding of:
* Architecture choices for real-time aggregation
* Time semantics, and handling out-of-order events
* Partition and autoscaling story of the streaming platform

Shay Lin

Software Engineer, Confluent

San Francisco, California, United States

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top