Satyajit Roy

Engineering Executive | Platform, SRE & AI Infrastructure

I design, scale, and lead hyperscale platforms delivering reliability, efficiency, and clarity at internet scale.

Satyajit Roy
20+ Years Experience
30B Daily Requests Scale
99.95%+ Reliability
55+ Engineers Leadership

Featured Case Studies

View All Work →

Hyperscale ML/Search Platform at Adobe

Architect & Technical Leader.

Adobe's Core Search and Sensei platform serves as the intelligence layer behind flagship products, processing 30B+ daily requests.

AI/ML workloads were outgrowing the existing infrastructure, creating scaling, latency, and cost challenges.

Impact & Metrics
  • Multi Billion requests, GPU utilization +38%
  • Supported 30B+ daily API requests with >99.98% availability.
  • Increased GPU utilization by 38% through smarter scheduling.

Global SRE Operating Model at F5

Sr. Director of Product Engineering & Head of SRE.

F5’s Distributed Cloud platform powers global multi‑cloud networking and security for enterprise customers.

Silos, inconsistent incident response, and burnout were slowing down a platform facing explosive traffic growth.

Impact & Metrics
  • 55+ engineers, MTTR −73%
  • Reduced MTTR by 73% and improved incident consistency.
  • Lowered attrition by 10% by eliminating hero culture.

Platform Modernization & FinOps at Arkose Labs

Director of Engineering & SRE.

Arkose Labs fights fraud at internet scale, requiring real‑time decisioning under unpredictable attack traffic.

Cloud spend was rising faster than revenue, and technical debt was slowing delivery.

Impact & Metrics
  • 22% cloud spend reduction
  • Reduced cloud spend by 22% while supporting 7x transaction growth.
  • Maintained 99.9% SLA even during attack spikes.

Enterprise CI/CD & Platform Modernization at Macys.com

Architect & Technical Leader.

Macy’s needed a modern deployment platform to support rapid retail innovation and peak‑season reliability.

Deployments were slow, manual, and risky — causing downtime during revenue‑critical periods.

Impact & Metrics
  • Near-zero downtime releases
  • Achieved near‑zero downtime releases across the e‑commerce stack.
  • Cut deployment time from days to under an hour.

How I Work

Leadership & Philosophy

Systems Thinking

I approach engineering organizations as distributed systems—optimizing for flow, feedback loops, and resilience at scale.

Empowered Teams

I build high-trust cultures where engineers own their outcomes, with clear paths for growth and autonomy.

Operational Excellence

Reliability is a feature. I champion SRE principles to shift from reactive firefighting to proactive stability.

Open Source

View All →

Setup DevBox

Onboarding slows down when every engineer’s laptop behaves differently.

View Repo

Git Selective Ignore

Local configs and secrets often sneak into commits.

View Repo

Blogs Publisher

Cross‑posting content manually wastes time and breaks consistency.

View Repo

Writing

View All →

Overlay Networking Deep Dive

A layered exploration of modern networking — from packets to policy — and how it shapes cloud‑native systems.

GitOps Journey

A practical look at adopting GitOps at scale — what worked, what didn’t, and what surprised us.

Rust Basics & Ownership

A friendly walkthrough of Rust’s ownership model and why it changes how you think about systems programming.

Interested in platform leadership, architecture reviews, or collaboration?