Call for Papers

in 4 months

PyTorch Conference North America 2026 - POSTER SESSIONS

event starts

20 Oct 2026

event ends

21 Oct 2026

location

San Jose Convention Center San Jose, California, United States


Join us in San Jose, CA, October 20-21 for PyTorch Conference North America 2026. This two-day event hosted by the PyTorch Foundation gathers top-tier AI pioneers, researchers, and developers to explore the future of open source AI and the impact of PyTorch Foundation projects like PyTorch, vLLM, DeepSpeed, and Ray.

PyTorch Conference features in-depth technical talks, hands-on workshops, and candid conversations spanning the full AI stack, from bare metal infrastructure to applications and agent-based systems. The program features keynote sessions from leading voices in AI and practical deep dives on training, inference, GenAI, and focused tracks on responsible AI & compliance, security & privacy, frameworks & compilers, and more.
open, 34 days left
Call for Papers
Call opens at 12:00 AM

31 Mar 2026

Call closes at 11:59 PM

26 Jul 2026

Call closes in Pacific Daylight Time (UTC-07:00) timezone.
Closing time in your timezone () is .

CFP GUIDE

Please review our CFP Guide to answer many common questions before submitting.

PLEASE NOTE - Breakout Sessions and Poster Sessions are separate CFP’s for PyTorch North America 2026. To submit a talk for a breakout session please apply separately here

DATES TO REMEMBER:

  • Poster Session CFP Close: Sunday, July 26  at 11:59 pm PDT (UTC +-7)
  • Poster Session Notifications: Monday, August 17, 2026 
  • Poster Sessions Announced: Tuesday, August 18, 2026
  • Event Dates: Tuesday, October 20 - Wednesday, October 21, 2026

Reminder: This is a community event — so no product and/or vendor sales pitches.

CODE OF CONDUCT

By submitting, you agree to The Linux Foundation's Code of Conduct.

COMMITMENT TO INCLUSIVITY

Please review The Linux Foundation's Inclusive Speaker Orientation and Inclusive Language Initiative.

Please be mindful that The Linux Foundation does not allow any talk with 3 or more speakers to only have men participating in an effort to increase equity & inclusion.

PRIVACY POLICY

At The Linux Foundation, we are committed to safeguarding your privacy. Sensitive speaker information will only be accessible to event organizers and program committee members who adhere to the highest confidentiality standards. Rest assured that your information will never be sold or shared beyond these parties.

Speaker personal information including name, company, job title, biography and photo will appear on the public schedule. For information on our privacy practices and commitment to protecting your privacy, please review our Privacy Policy.

You can update or delete your account information at any time through your Sessionize profile account settings. Please contact support@sessionize.com directly for any questions or issues. 

QUESTIONS?

Question about submitting a proposal? Contact us at cfp@linuxfoundation.org.  


all submitted sessions

publicly listed on this page

Login with your preferred account


If you haven't logged in before, you'll be able to register.

Using social networks to login is faster and simpler, but if you prefer username/password account - use Classic Login.

38 submissions
Submitted sessions
Chirabrata Senapati
  • Adaptive Consensus for Low-Latency Distributed Systems
  • Continuous Causal Inference for Real-Time Advertising Measurement
Vishal Shah
  • Quantize-Then-Refine: Two-Stage Scoring for Memory-Efficient GPU Retrieval
Jeffrey Mahou
  • From Scheduled to Real Overlap: All-Gather/Reduce-Scatter in PyTorch Distributed
Andrew Bond
  • Moral Tensors and DecisionProofs: Compiling Language into an Auditable, Grounded Safety Layer
  • Atlas + Erebus: A Personal AI Research Platform on Two Prosumer GPUs
Abhilekh Verma
  • Building Global Allies: How Male Mentors Can Accelerate Women in AI & Startups
Muniker Aragon
  • VLA Models on Intel Arc Pro: Enabling Real-Time Robot Control with PyTorch XPU and OpenVINO
Rahul Unnikrishnan Nair, MinSung Kim
  • Stage-Specialized E-PD Disaggregation for SLO-Aware Multimodal Serving on Intel Arc Pro B50/B70
Isha Narula
  • AI in Analytics
Ivan Potapov
  • From PyTorch to a Phone: What Quantizes Cleanly On-Device, and What Doesn't
Radu Salavat, Nikhil Gupta
  • Importance Aware Attention for Faster PyTorch Inference on Arm CPUs
show all submissions
Haichen Zhang, Huamin Chen
  • Training Embedding Models Resiliently for Multimodal Model Inference Routing
Kartikaya Purohit
  • Real-Time Ranking Systems for E-Commerce Search and Discovery
Ziming Zhou
  • Before the Loss Spike: Training Alignment for Precise PyTorch Debugging at Scale
ceci lv
  • vLLM-plugin-FL -- Multi-Chip vLLM framework based on FlagOS backend
Zhipeng Wang
  • Scaling Large-scale Distributed Training with DeepSpeed using Muon Optimizer
Sanskar Prasad, Aheli Poddar
  • KernelOpt: Multi-Agent GPU Kernel Optimizaton
Kaoutar El Maghraoui, Priyanka Naik
  • A Multi-Layer Profiling Toolkit for Out-of-Tree PyTorch Accelerators
Etai Lev Ran
  • REBAR: Routing Extensions for Burst-Aware RL
Abhishek Jain, Ashwin Sekhar
  • Efficient MoE LLM Inference on Arm with vLLM and OpenVINO
Eyal Chocron
  • Maestro: Benchmarking Concurrent GPU Operations as They Run in Real Workloads
Sohail Mohammad
  • A Practical Taxonomy of LLM Inference Bottlenecks: Prefill, Decode, Memory, and Scheduling
N Maajid Khan, Ashish Chopra
  • Improving PyTorch Efficiency on ARM with SVE-Accelerated Memory Primitives
Pranav Saji
  • weights_only Was Supposed to Save You: PyTorch Model RCE in 2026 and Safe Loading
Yidi Wu
  • Escape Hatches for torch.compile
Shuai Yang
  • CUDA Graph on Large Scale Recommender Systems
Roy Allela, Aravind Neelakantan
  • Train Across Your Entire GPU Fleet: Heterogeneous Mixed-Generation Training in Pure PyTorch
Andrew Madson
  • "Powering AI/ML with Python and Apache iceberg"
Paulo Aragao
  • Breaking torch.distributed on Purpose: Chaos Engineering for Large-Scale PyTorch Training
  • AllReduce, AllGather, or AllToAll? How Parallelism Shapes PyTorch Network Traffic
Aleksey Vlasenko
  • TPU Model Performance Auto-optimization
Rutuja Pathade
  • Diagnosing LLM Streaming Bottlenecks: Profiling TTFT, Decode Throughput, and Inference Telemetry
Shashank Agarwal
  • Building Self-Healing Infrastructure for AI Agents
Adit Modi
  • The 11-Minute Cold Start: A Visual Anatomy of Why GPU Pods Take Forever to Become Useful
Amit R
  • Gradient Ghosts: Detecting Silent Numerical Failure in Mixed-Precision Training
Manvi Gupta
  • State Persistence in Containers
Linux Foundation Events
  • The health benefits of a cheese based diet