NJ
Nehil Jain
Member of Technical Staff @ Anyscale
SF Bay Area

Skills

AI and Machine Learning
Agentic Applications
RAG
LLM Serving
Finetuning
Forecasting
MLOps
Data Engineering and Infrastructure
Python
SQL
Ray
Spark
Dagster
Airflow
MLFlow
Snowflake
Databricks
dbt
Cloud and Distributed Systems
AWS
Azure
GCP
Kubernetes
Docker
A100/H100 GPUs

Experience

Member of Technical Staff

Anyscale

Sep 2025 - Present

Technical consulting for Fortune 500 AI infrastructure on Ray. Drive customer demos, POCs, and expansions for accounts like PayPal, Notion, Instacart, and Rivian. Shipped Turbopuffer DataSink connector to Ray OSS (PR #58910). Drove 6x contract renewal at Notion ($40K to $250K).

Founder & CEO

DemoDrive AI

May 2024 - Aug 2025

Built AI-powered video editor for DevTools teams from zero-to-one. Created 120+ automated videos for 5 pilot customers, reducing content creation time by 70%. Won 4 hackathons including MongoDB GenAI ($2K) and Luma AI (solo win).

Principal AI Engineer

QuantumBlack, McKinsey

Nov 2022 - Apr 2024

Led AI engineering for Fortune 500 clients. Built LLM RAG system saving ~$5M/year for insurance client, delivered $29M EBITDA impact in CPG supply chain, and managed 9-person team reducing mining carbon footprint by 4%.

Senior AI Engineer II

QuantumBlack, McKinsey

Jan 2021 - Oct 2022

Built churn prediction models increasing customer win-back by 11% QoQ. Reduced pipeline runtime from 2 hours to 4 minutes (96% improvement) through incremental processing.

Tech Lead - Data

Super.com

Jan 2019 - Nov 2020

Led 12-person team building unified data platform processing 5M+ daily events. Pivoted company to profitability in 3 months during COVID. Built smart bidding system improving ROAS by 20%.

Founding Engineer - Data

Super.com

Jul 2016 - Dec 2018

Built scalable event pipeline (5M events/day) and location recognition model (F1: 0.96). Contributed to 22% YoY revenue uplift.

Co-Founder

Athletigen

Apr 2013 - Jul 2016

Co-founded biotech startup combining genomics and AI for elite athletes. Scaled to 16,000 reports, built 8-person engineering team, and designed genetic analytics pipeline on Spark and AWS.

Education

BITS Pilani University

MS in Mathematics & B.E. in Electronics

Pilani, Rajasthan, India

2008 - 2013

Projects

Turbopuffer DataSink Connector for Ray Data

AI Infrastructure

Built and shipped production-grade vector database connector to Ray OSS (PR #58910), enabling streaming writes from Ray Data pipelines to Turbopuffer. Solved complex memory optimization using sort+slice over dictionary accumulation for zero additional allocation.

Petabyte-Scale Robotics Data Pipeline

Robotics

Designed Ray Data pipeline for autonomous systems processing 3+ petabytes of sensor data. Direct MCAP-to-tensor pipeline with on-the-fly H265 decoding, eliminating multi-day ETL bottlenecks.

DemoDrive - AI Video Editor for DevRel

DevTools

Built AI-powered video editor from scratch with AI agents as first-class citizens. Created 120+ automated videos for 5 pilot customers.

View All Projects →