Shivji Kumar Jha

Staff Engineer at Nutanix, Distributed Systems & Open Source

Bengaluru, India

Actions

Shivji Kumar Jha is a Staff Engineer at Nutanix, where he leads the charter for analytics, time-series, and streaming platforms across cloud and on-prem environments. He focuses on building scalable ingestion systems and designing high-performance OLAP architectures, with deep hands-on experience extending ClickHouse for production-grade analytics. A contributor to Apache Pulsar, MySQL, and ClickHouse, Shiv approaches distributed systems as a builder—prototyping core components, shaping system design, and driving platform evolution from the ground up.

Beyond engineering, Shiv is an active community leader and speaker. He has delivered 30+ talks at conferences including ApacheCon, KubeCon + CloudNativeCon, and Postgres community events. He also curates deep-tech meetups in Bangalore, bringing together practitioners passionate about databases, streaming systems, and open-source infrastructure

Area of Expertise

Environment & Cleantech
Health & Medical
Information & Communications Technology
Travel & Tourism

Topics

AWS Databases
Azure Data Platform
All things data
AWS Data
Streaming
Data Streaming
stream processing
Event Streaming

Structured Event Streaming with Avro and Pulsar

Type safety is extremely important in an event streaming pipeline. Type definition and evolution can either be built in the application or relied upon the data layer to support it out of the box allowing the application to only concentrate on business logic, not how of data store and evolution.
Apache pulsar offers server as well as client side support for the structured streaming. We have been using pulsar for asynchronous communication among microservices in our nutanix beam app for over an year in production.
This talk presents the technical details on what is available in the apache pulsar server and client side, how we have used pulsar's schema support to build our use cases and our learnings from them.

Streaming app changes to Event Store

In this talk we will go through different use cases and design patterns (CDC, events, outbox pattern etc) for moving data from your web. services (and Databases) to an event store for analytics and real time computation. We will also discuss how this can be done in a type safe ways and the good, bad and ugly of each design choice.

Structured Data Stream with Pulsar

Type safety is extremely important in any application built around a message bus like Pulsar. Type definition and evolution can either be built in the application or relied upon the data layer to support it out of the box allowing the application to only concentrate on business logic, not how of data store and evolution. Apache pulsar offers server as well as client side support for the structured streaming.
We have been using pulsar for asynchronous communication among microservices in our nutanix beam app for over an year in production.
This talk presents the technical details on what is available in the apache pulsar server and client side, how we have used pulsar's schema support to build our use cases and our learnings from them.

Running a secure Apache pulsar cluster

A Pulsar instance consists of one or more Pulsar clusters. Clusters, in turn, consist of a broker cluster, a bookkeeper cluster and a zookeeper cluster. While this provides a modular and flexible design, there is a lot of bytes flowing over the network. In an application that is paranoid about security that is a lot of connections, both intra cluster and inter-cluster, to secure. At nutanix we have deploy pulsar in a secure environment and run it through a lot of security audits for infrastructure certifications.

In this talks, we will go over how we set up authentication and authorization on all the network communications to make our pulsar deployment secure.

How Pulsar stores your data

In order to leverage the best performance characters of your stream backend, it is important to understand the nitty gritty details of how pulsar stores your data. Understanding this empowers you to design your use case solutioning so as to make the best use of resources at hand as well as get the optimum amount of consistency, availability, latency and throughput for a given amount of resources at hand.

With this underlying philosophy, in this talk, we will get to the bottom of storage tier of pulsar (apache bookkeeper), the barebones of the bookkeeper storage semantics, how it is used in different use cases ( even other than pulsar), understand the object models of storage in pulsar, different kinds of data structures and algorithms pulsar uses therein and how that maps to the semantics of the storage class shipped with pulsar by default. Oh yes, you can change the storage backend too with some additional code!

This session will empower you with the right background to map your data right with pulsar.

EventSourcing Live 2021 Sessionize Event

October 2021

Pulsar Virtual Summit North America 2021 Sessionize Event

June 2021

Pulsar Summit North America 2021

How Pulsar stores your data
Abstract: https://www.na2021.pulsar-summit.org/all-talks/how-pulsar-stores-your-data
Slides: https://www.slideshare.net/streamnative/how-pulsar-stores-your-data-pulsar-summit-na-2021
Video: https://www.youtube.com/watch?v=xIibbB5Y0MM

June 2021

Pulsar Summit Asia 2020 Sessionize Event

November 2020

MySQL Developers Day 2014

MySQL Replication and Scalability
Slides : https://www.slideshare.net/shiv4289/my-sql-replicationscalability

April 2014 Bengaluru, India

MySQL User Camp 2014

Global Transaction Identifiers in MySQL Replication
Slides: https://www.slideshare.net/shiv4289/mysql-user-camp-gtids

March 2014 Bengaluru, India

MySQL User Camp 2013

Multi-threaded Slaves in MySQL Replication
Slides: https://www.slideshare.net/shiv4289/mysql-user-campmultithreadedslaves

November 2013

Open Source India 2013

MySQL: Multi-Source Replication
Slide : https://www.slideshare.net/shiv4289/my-sql-labs-multi-source-replication
Blog : https://mysqlhighavailability.com/mysql-5-7-6-introducing-multi-source-replication/

November 2013 Bengaluru, India

Shivji Kumar Jha

Staff Engineer at Nutanix, Distributed Systems & Open Source

Bengaluru, India

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Speaker

Shivji Kumar Jha

Actions

Links

Area of Expertise

Topics

Sessions

Structured Event Streaming with Avro and Pulsar

Streaming app changes to Event Store

Structured Data Stream with Pulsar

Running a secure Apache pulsar cluster

How Pulsar stores your data

Events

EventSourcing Live 2021 Sessionize Event

Pulsar Virtual Summit North America 2021 Sessionize Event

Pulsar Summit North America 2021

Pulsar Summit Asia 2020 Sessionize Event

StreamNative Webinar Series

FOSSASIA 2015

Open Source India 2014

MySQL Developers Day 2014

MySQL User Camp 2014

MySQL User Camp 2013

Open Source India 2013

Shivji Kumar Jha

Links

Actions