Shubh Saxena

Builds ML and AI engineering systems for production. Personal portfolio of a machine learning engineer building ML, AI engineering, MLOps/DevOps, and cloud platforms that turn research-grade models into monitored, secure, deployable systems.

View Projects Resume

Scroll to explore

I work across the full stack of applied ML and AI engineering: data and streaming systems, model serving, evaluation harnesses, security gates, observability, and cloud deployment. My projects are built like production systems, not demos: containerized services, reproducible ML workflows, and CI pipelines that test both behavior and safety.

Experience

Recent Work

Remote MLE Intern · Decompute

Apr 2026 - Jun 2026 · Cupertino, CA

Reduced routing errors by 50% with a FastAPI smart router using LLM intent classification and state-preserved task frames; improved system resilience by 60% with reverse proxy security, rate limiting, circuit breakers, queues, and fail-closed Stripe entitlement checks.

DevOps Intern · DoMS, IIT Roorkee

Feb 2026 - Mar 2026

Reduced manual compliance lookups by 80% and automated 70% of manual verification steps by building a production platform for a Government IPR initiative with natural-language search and containerized CI/CD.

DevOps Intern · HCLTech

May 2025 - Jul 2025

Saved $55K+ annually with a 96% infrastructure cost reduction by consolidating NAT Gateways via Terraform; cut release cycles by 40% with Jenkins to ArgoCD GitOps and Trivy/SonarQube quality gates.

Selected Projects

Systems I Have Built

Star Project · Swarm AI GitHub

Aegis Swarm

Production-grade autonomous drone fleet AI platform combining real-time telemetry, GPU vision inference, stream analytics, LangGraph mission coordination, MCP policy gates, and DevSecOps automation.

50-100 telemetry messages/sec sustained across a 5-drone swarm
~45ms P50 detection latency with Ray Serve, YOLO, SAHI, and tracking
P99 ~350ms telemetry-to-command loop with Flink and Redpanda
Hardened LLM-to-actuator path with deterministic MCP policy gates

Agent Evaluation GitHub

Autonomous Agent Evaluation & Orchestration Framework

Cut manual agent evaluation overhead by 4x and increased throughput by 3x with a FastAPI + WebSocket orchestration server that models Terminal-Bench tasks as an OpenEnv-compliant RL environment.

Document AI GitHub

Intelligent Document AI Pipeline

Secured 93%+ processing accuracy at a sub-$0.01/document cost target by building a containerized OCR ensemble with selective SLM/VLM adjudication and isolated Docker microservices.

RAG GitHub

Cascade Intelligence

Reduced unnecessary LLM usage with a 3-tier confidence cascade and improved response trust using PII scrubbing, HyDE, BM25 + FAISS, cross-encoder reranking, abstention gates, and citation enforcement.

Publication

Published Research

Zenodo Record · 2026

MACE-RL: Meta-Adaptive Curiosity-Driven Exploration with Episodic Memory in RL Environments

This paper proposes an RL exploration method that adapts intrinsic curiosity rewards over time and uses episodic memory to reduce repeated exploration of already-visited states. The approach targets sparse-reward environments where agents need a stronger signal for novelty, memory, and long-horizon discovery.

Zenodo PDF

Toolbox

Technical Skills

Cloud & DevOps

AWS, Kubernetes, Docker, Terraform, ArgoCD, Jenkins, GitHub Actions

MLOps & Data

MLflow, DVC, Ray Serve, Redpanda, Apache Flink, LangGraph, LangSmith, MCP

Security & Languages

Trivy, OWASP ZAP, SonarQube, Falco, Python, Bash, YAML, PyTorch

Recognition

Honors & Awards

Semi-Finalist · Pan-IIT AI/ML Hackathon powered by IDFC First Bank, 2026

National Rank 4 · Launch Pad 2025, Pan-India Case Competition at SRCC

Contact

Let’s build something that survives production.

Email LinkedIn GitHub Resume