Speaker

Tarun Annapareddy

Tarun Annapareddy

MTS-3 at Nutanix

Bengaluru, India

Actions

Tarun is a developer at Nutanix and worked on building and maintaining the Flow Security Central product. In addition, Tarun developed the backend for the Security Planning feature for Nutanix Cloud users, allowing users to categorize their Virtual Machines and apply security policies. This architecture includes distributed systems like Apache Pulsar, Flink, and Druid. Tarun also writes articles on various open-source software developers use daily, such as Postgres, Flink, and Temporal.

Topics

  • distributed systems
  • stream processing

Keeping on top of hybrid cloud usage with Pulsar

This presentation will cover how we force controls on an application over a hybrid cloud infrastructure built from a combination of different clouds that could include private and public clouds. For instance, you could deploy your microservice in AWS but use BigTable as your data store.
Every cloud or on-premise infrastructure provider provides monitoring, alerting, metering, audit trail etc. In a hybrid cloud use case, the IT team needs a single view of the usage across the cloud providers. Such a platform needs to combine the data sourcing of these utilities from different infrastructure providers, parse them into a common format and build an integrated data sink. Adding to it the challenge of each data source evolving its data formats, volume, velocity, throughput, latency etc. You have a challenging task to understand data from varied sources and present it in one view.

We will present an architecture that has been battle-tested in production for over five years. The components include Pulsar, Flink, PostgreSQL, Redis, Neo4J DB, rule/ML engine etc., to name a few technologies.

After this presentation, you will learn more about
1. Combining infrastructure from multiple clouds and on-premise providers to build your application.
2. Appreciate the need for lambda architecture.
3. How to stream ever-evolving multi-schema data using pulsar
4. How to write custom rules over a stream analytics framework to make your application.

Real-time Stream processing at scale

Data Ingestion and processing are the core of any tech stack. So, in this session, we will discuss various patterns for designing and implementing stream and batch processing pipelines. We will also discuss the real-world use cases at Nutanix, where we developed multiple solutions to provide Multi-Cloud governance to our customers. So, we will cover real-time stream processing of events from clouds like AWS/AZURE, etc.,

We will also present an architecture that has been battle-tested in production for over five years. The components include Apache Pulsar, Apache Flink, PostgreSQL, Redis, Neo4J DB, rule/ML engine, etc., to name a few technologies.

After this presentation, you will learn more about
1. Designing systems to handle and process both Ordered and Unordered data streams
2. Features of distributed Messaging systems that we can take advantage of while designing systems
3. Understand Stream-processing frameworks and design solutions that can process events at the scale of Billions
4. Metrics-driven development of distributed systems to keep production issues in check.
5. Good practices like Implementing heart-beat monitoring and auto-restart systems in your production, so you can take a break without worrying about outages.
6. Achieving fault tolerance while developing event-driven systems by understanding message delivery and processing guarantees.

Pulsar Summit Asia 2022 Sessionize Event

November 2022

Tarun Annapareddy

MTS-3 at Nutanix

Bengaluru, India

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top