Speaker

An Phan

An Phan

Senior Data Infrastructure Engineer @ Hippo Harvest

San Jose, California, United States

Actions

An Phan is a Senior Data Infrastructure Engineer working at the intersection of data, infrastructure, robotics, decision-making systems, and machine learning.

His work focuses on building the reliability layer beneath AI systems, where telemetry, reproducibility, and historical truth matter more than novelty. He designs architectures for sensor-rich and autonomous systems operating across cloud and edge environments, where signals are incomplete, environments change, and failures often cannot be replayed once they are lost.

Over the years, he has worked across industrial AI, manufacturing, maritime systems, and robotics, where many production failures emerged not from the model itself, but from fragmented ownership, missing telemetry, and systems that could no longer explain reality over time.

Badges

Area of Expertise

  • Information & Communications Technology

Topics

  • Software Engineering
  • Data Engineering
  • MLOps & AI Infrastructure
  • AI Engineering
  • AI & Machine Learning
  • AI & ML Solutions
  • AI Ethics

Why Most AI Platforms Break in Production

Many organizations successfully build AI proofs of concept, only to see those systems struggle once deployed in production. Models continue to run. Pipelines execute as expected. Dashboards appear healthy. Yet performance drifts, trust erodes, and teams can no longer explain what changed or why.

The root cause is rarely the model itself. More often, the problem lies in the platform and organizational architecture around data. Backfills rewrite historical datasets. Features are recomputed without clear provenance. Inference runs without traceability. Teams move quickly in isolation, and no one owns end-to-end reproducibility.

This talk reframes AI reliability as a systems and organizational design problem rather than a modeling problem. Drawing from real-world AI systems and machine telemetry pipelines, it shows how data lineage, telemetry capture, observability, and reproducibility must be treated as core platform capabilities rather than afterthoughts.

Attendees will learn how organizational structure, team boundaries, and architectural decisions interact to either preserve or undermine trust in AI systems over time. The session concludes with practical guidance on structuring teams and platforms so AI systems can scale without accumulating silent data and technical debt.

Designing a Cloud-Edge Data Backbone for Physical AI Systems

Physical AI systems in robotics, industrial automation, and other real-world environments rely on continuous telemetry from sensors, machines, and human-in-the-loop actions. Unlike cloud-native software systems, these signals represent irreversible real-world events that cannot be reconstructed if they are not captured when they occur. However, many production data pipelines still assume that data can be recomputed or backfilled, which leads to irreproducible training datasets and blind spots in debugging and drift analysis.

This session presents a production cloud-edge data architecture that treats telemetry as immutable historical truth and unifies raw sensor data, inference metadata, and operational events into a time-aligned, append-only record. The architecture separates capture correctness from downstream compute, preserves late-arriving data, and enables reproducible reconstruction of training datasets. Practical workflows for model drift debugging and historical dataset reproduction are discussed, along with trade-offs in storage cost and operational complexity.

IEEE Cloud Summit 2026 Sessionize Event Upcoming

June 2026 Washington, District of Columbia, United States

AI DevSummit + DeveloperWeek Management 2026 Sessionize Event Upcoming

May 2026 South San Francisco, California, United States

An Phan

Senior Data Infrastructure Engineer @ Hippo Harvest

San Jose, California, United States

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top