© Mapbox, © OpenStreetMap

Speaker

Akhilesh Gupta

Akhilesh Gupta

Principal Engineer at LinkedIn

Mountain View, California, United States

Actions

Akhilesh is the technical lead for LinkedIn's system infrastructure behind the LinkedIn Feed. Before this, he was leading engineering for Ride Experience at Uber. He holds a Master's degree in CS from Stanford University.

Area of Expertise

  • Information & Communications Technology

Topics

  • Distributed Systems
  • Technical Leadership
  • Artificial Intelligence
  • Large Language Models (LLMs)
  • Generative AI
  • Machine Learning

Feed Forward: The Infra behind LinkedIn’s LLM powered content recommendation system

This year, at LinkedIn, we re-engineered our Feed recommendation system to be powered by a Large Language Model. Beyond fine-tuning the model itself for this specialized task, we solved many infrastructure challenges to create a lightning fast, responsive experience where the Feed can quickly adapt to provide immense value to you as a professional looking to expand your horizons to be better at what you do.
- Learn how we generate user prompts that capture user profiles and their recent activity within seconds
- Learn how we generate item prompts for videos, articles, images and text posted by users along with their interaction history to help the model understand the content pool and its dynamic popularity among user cohorts
- Learn how we use a fine-tuned LLM as a retriever to generate user and item embeddings in a dual encoder configuration to serve tens of thousands of kNN queries/s on a corpus of hundreds of millions of items with sub-50ms latency
- Learn how we use the same LLM as a ranker in a cross-encoder configuration to rank the retrieved items where we do fancy optimizations like member context prefill in parallel to item retrieval to optimize online serving latency with minimal GPU footprint

In this talk, I will do a technical deep-dive into how we solved the challenges above with simple and elegant system infrastructure that can harness the power of modern LLMs for large-scale recommendation systems.

QCon SF 2022

The Secret to Finding Impactful Projects to Land a Staff-Plus Engineer Role

October 2022 San Francisco, California, United States

QCon London 2020

Streaming a Million Likes/Second: Real-Time Interactions on Live Video

March 2020 London, United Kingdom

Codemotion Milan 2019

October 2019 Milan, Italy

Mobile Era Oslo 2017

Realtime Content Delivery: Powering dynamic instant experiences on your mobile apps

October 2017 Oslo, Norway

Akhilesh Gupta

Principal Engineer at LinkedIn

Mountain View, California, United States

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top