Shubh Saxena

Open to ML, AI engineering, MLOps/DevOps, and cloud roles

ML · AI Engineering · MLOps/DevOps · Cloud

Shubh Saxena

Builds ML and AI engineering systems for production. Personal portfolio of a machine learning engineer building ML, AI engineering, MLOps/DevOps, and cloud platforms that turn research-grade models into monitored, secure, deployable systems.

Scroll to explore

I work across the full stack of applied ML and AI engineering: data and streaming systems, model serving, evaluation harnesses, security gates, observability, and cloud deployment. My projects are built like production systems, not demos: containerized services, reproducible ML workflows, and CI pipelines that test both behavior and safety.

Experience

Recent Work

Remote MLE Intern · Decompute

Apr 2026 - Jun 2026 · Cupertino, CA

Reduced routing errors by 50% with a FastAPI smart router using LLM intent classification and state-preserved task frames; improved system resilience by 60% with reverse proxy security, rate limiting, circuit breakers, queues, and fail-closed Stripe entitlement checks.

DevOps Intern · DoMS, IIT Roorkee

Feb 2026 - Mar 2026

Reduced manual compliance lookups by 80% and automated 70% of manual verification steps by building a production platform for a Government IPR initiative with natural-language search and containerized CI/CD.

DevOps Intern · HCLTech

May 2025 - Jul 2025

Saved $55K+ annually with a 96% infrastructure cost reduction by consolidating NAT Gateways via Terraform; cut release cycles by 40% with Jenkins to ArgoCD GitOps and Trivy/SonarQube quality gates.

Selected Projects

Systems I Have Built

Agent Evaluation GitHub

Autonomous Agent Evaluation & Orchestration Framework

Cut manual agent evaluation overhead by 4x and increased throughput by 3x with a FastAPI + WebSocket orchestration server that models Terminal-Bench tasks as an OpenEnv-compliant RL environment.

  • FastAPI
  • WebSockets
  • GRPO
  • Kubernetes
Document AI GitHub

Intelligent Document AI Pipeline

Secured 93%+ processing accuracy at a sub-$0.01/document cost target by building a containerized OCR ensemble with selective SLM/VLM adjudication and isolated Docker microservices.

  • OCR
  • Docker
  • Qwen
  • YOLOv8
RAG GitHub

Cascade Intelligence

Reduced unnecessary LLM usage with a 3-tier confidence cascade and improved response trust using PII scrubbing, HyDE, BM25 + FAISS, cross-encoder reranking, abstention gates, and citation enforcement.

  • RAG
  • FAISS
  • SetFit
  • Groq

Publication

Published Research

Zenodo Record · 2026

MACE-RL: Meta-Adaptive Curiosity-Driven Exploration with Episodic Memory in RL Environments

This paper proposes an RL exploration method that adapts intrinsic curiosity rewards over time and uses episodic memory to reduce repeated exploration of already-visited states. The approach targets sparse-reward environments where agents need a stronger signal for novelty, memory, and long-horizon discovery.

Toolbox

Technical Skills

Cloud & DevOps

AWS, Kubernetes, Docker, Terraform, ArgoCD, Jenkins, GitHub Actions

MLOps & Data

MLflow, DVC, Ray Serve, Redpanda, Apache Flink, LangGraph, LangSmith, MCP

Security & Languages

Trivy, OWASP ZAP, SonarQube, Falco, Python, Bash, YAML, PyTorch

Recognition

Honors & Awards

Semi-Finalist · Pan-IIT AI/ML Hackathon powered by IDFC First Bank, 2026

National Rank 4 · Launch Pad 2025, Pan-India Case Competition at SRCC

Contact

Let’s build something that survives production.