Available for Senior / Staff Roles

Thaneesh Aadithya

Platform Engineer & SRE  ·  DevOps  ·  Cloud Infrastructure  ·  7 Years

Building and operating the platform that 30+ engineering teams rely on — sustaining 99.9%+ uptime, tripling deployment frequency, cutting incident detection by 60% at Fortune-500 scale.

profile.yaml
name: Thaneesh Aadithya
role: Platform Engineer & SRE
experience: 7 years
cluster_size: ~200 nodes
uptime: 99.9%+
teams_served: 30+
compute_saved: $120K / year
p1_incidents: 0
mttd_reduction: 60%
open_to: Hyderabad + Global Remote
visa: USA · UK · CA · AU · EU
# ready to deploy 🚀
99.9%+ Platform Uptime
Faster Deployments
↓60% MTTD Reduction
6h→30m Env Provisioning
$120K Saved / Year
0 P1 Deploy Incidents
30+ Teams Served
18mo Zero Downtime Streak

Who I Am

Senior Platform Engineer & SRE with 7 years designing, automating, and operating large-scale cloud-native platforms on AWS — across Fortune-500 product companies and enterprise IT consulting engagements.

I own the platform that 30+ engineering teams rely on to ship safely — sustaining 99.9%+ uptime, tripling deployment frequency, and cutting incident detection by 60% through principled IaC, GitOps, SLO-based reliability engineering, and full-stack observability.

Equally fluent in agile product-engineering culture and structured service-delivery models. I combine deep infrastructure expertise with SRE practices and DevSecOps to build platforms teams love and trust.

profile · thaneesh.aadithya
roleSenior Platform Engineer & SRE
companyTarget Corporation
experience7 Years
locationHyderabad, India
remoteUTC+4–UTC+8 overlap
visaUSA · UK · Canada · AU · EU
emailthaneeshaadithya5@gmail.com
☁️
Primary Cloud
AWS · Azure · GCP
☸️
Orchestration
Kubernetes / EKS at 200-node scale
🏗️
IaC
Terraform · Ansible · GitOps
📊
Observability
OpenTelemetry · Grafana LGTM · Prometheus
🔐
Security
DevSecOps · OPA · Falco · CIS Benchmarks
📈
SRE Practices
SLO/SLI · Error budgets · Chaos engineering

What I Work With

☁️ Cloud / AWS
EC2EKSVPCIAMALB/NLBAuto ScalingS3RDS/AuroraECRRoute 53Secrets ManagerCloudWatchX-RayCost ExplorerAzure AKSGCP GKE
☸️ Kubernetes
EKSHelmKarpenterHPAVPAKEDARBACNetworkPoliciesPod Security StandardsAdmission WebhooksOTel Operator
🏗️ IaC & Config
TerraformOPA policy-as-codeDrift detectionCheckovAnsibleCloudFormationRemote State
🔄 CI/CD & GitOps
GitHub ActionsGitLab CIJenkinsArgo CDArgo RolloutsCanary/Blue-GreenPR-gated promotionsAuto-rollback
📊 Observability
OpenTelemetryGrafana LGTMLokiMimirTempoPrometheusAWS X-RaySLO Dashboards
🔐 Security & SRE
OPA GatekeeperFalcoTrivySAST/DASTSLO/SLIError BudgetsChaos EngineeringCIS Benchmarks
💻 Scripting
PythonBash/ShellYAMLGoLinuxDockerJIRAConfluence
🧠 AI/ML Infra
GPU NodePoolsvLLMKServeMLflowKubeflowKEDASageMaker

Professional Experience

Senior Platform Engineer & SRE
Target Corporation
Platform Engineering · SRE · AWS EKS · Terraform · GitOps · Observability · Team Leadership
Mar 2023 – Present
Platform reliability & autoscaling — sustained 99.9%+ uptime over 24 months on a ~200-node EKS cluster running 30+ microservices across 4 environments (~5M req/day). HPA + Karpenter absorbs 2–3× surges within 90 seconds. Zero downtime across Black Friday & Cyber Monday.
Multi-AZ architecture & DR — designed 3-tier multi-AZ VPC (public/private/intra) across 3 AZs with HA NAT gateways. Route 53 health-check failover + RDS Multi-AZ — RTO <15min / RPO <5min validated in quarterly DR drills.
SLO engineering & error budget management — defined SLIs/SLOs for 30+ services; burn-rate alerting (14.4× fast-burn → page, 6× slow-burn → warn) — reduced alert noise by ~45%.
Full-stack observability & OpenTelemetry — OTel Collector DaemonSet routing ~50M spans/day to Grafana LGTM; 100% trace coverage with zero code changes; single-click alert → trace → log RCA. MTTD ↓60%, RCA time ↓35%.
GitOps CI/CD & Terraform IaC — GitHub Actions → Helm → Argo CD with Argo Rollouts canary (5→25→50→100%). Tripled deploy frequency (~60 releases/week), zero P1 deploy incidents. Terraform module library cut provisioning from 6h to 30min (−92%).
FinOps & security — Karpenter Spot-first saved ~$120K/year. OPA Gatekeeper, Falco, Trivy — zero critical CVEs for 12 months, CIS EKS Benchmark compliant.
Leadership — led & mentored 3-engineer sub-team; bi-weekly SRE sessions adopted by 30+ teams. MTTR <30min for all P1s over 18 months.
Cloud Infrastructure Consultant
Virtusa
Multi-Client AWS Infrastructure · Jenkins CI/CD · Terraform at Scale · Consulting Delivery
Jun 2022 – Feb 2023
Multi-environment IaC & CI/CD — architected dev/QA/pre-prod/prod AWS environments for 2 enterprise clients with isolated Terraform state, OPA gates, and Jenkins shared-library pipelines — cut provisioning time ~65%.
Module library & observability — reusable Terraform module library adopted by 8 engineers; zero state corruption. OTel SDK tracing — surfaced 5 latency hotspots reducing p99 API latency by ~25%.
DevOps Engineer
Quest Global
Containerisation · Jenkins CI/CD · Kubernetes Operations
Oct 2021 – May 2022
Containerisation & CI/CD — containerised 10+ microservices with multi-stage Docker builds — cut build time ~35%. Jenkins pipelines compressed releases from bi-weekly to daily. K8s probe tuning reduced false-positive pages ~45%.
Associate Systems Engineer
Tanama Software
Linux Systems · AWS Fundamentals · Monitoring & Documentation
Oct 2019 – Sep 2021
AWS infra & documentation — managed Linux-based AWS infra (EC2/VPC/ALB/RDS/DynamoDB/S3/Route 53) for multiple clients maintaining 99%+ SLA. Authored runbooks and SOPs — cut escalations ~30% and reduced onboarding time ~30%.

Featured Work

🔄 View ↗
gitops-cicd-pipeline
GitHub Actions → Argo Rollouts canary (5%→25%→50%→100%) with Prometheus analysis gates, sync windows, auto-rollback on SLO breach.
✅ 3× deploy frequency · Zero P1 deploy incidents
GitHub ActionsArgo RolloutsPrometheus
📊 View ↗
k8s-observability-stack
Grafana LGTM + OpenTelemetry Collector + X-Ray. SLO burn-rate alerting, pre-built dashboards, runbook templates per alert. ~50M spans/day.
✅ MTTD ↓60% · RCA time ↓35%
OpenTelemetryGrafanaPrometheusLoki
🔐 View ↗
aws-secrets-hardening
Secrets Manager + ESO + OPA zero-creds enforcement + Falco runtime detection + Checkov IaC scanning. IAM least-privilege patterns.
✅ Zero critical CVEs for 12 consecutive months
Secrets ManagerOPAFalcoTrivy
📋 View ↗
Incident-runbook
Battle-tested SRE runbooks — EKS P1/P2, Terraform state, Argo CD sync, Karpenter, DR/Multi-AZ. Blameless postmortem framework included.
✅ MTTR <30min · Repeat incidents ↓40%
SREEKSTerraform
🤖 View ↗
mlops-eks-platform
AI/ML Platform on EKS — GPU NodePools (Karpenter), vLLM LLM serving, KServe inference with canary rollouts, MLflow, Kubeflow Pipelines, KEDA autoscaling.
🤖 GPU inference · LLM serving · MLOps pipelines
vLLMKServeMLflowGPU

How I Think

"The pipeline isn't the product. Confidence is."
// SRE asks
"How do I eliminate the toil that keeps pulling engineers away from building?"
Tools: SLOs, error budgets, incident response, blameless postmortems, chaos engineering.
// Platform Engineering asks
"How do I build the product that 30 teams use to ship without needing infrastructure expertise?"
Tools: golden-path pipelines, self-service provisioning, GitOps, developer experience.

Academic Background

🎓
Bachelor of Science — Computer Science
Acharya Nagarjuna University, Guntur, India
2013 – 2016

Open To

📍
India
Hyderabad / Pan-India Remote
Senior / Staff Platform Engineer · SRE · DevOps
🌐
Global Remote
UTC+4–UTC+8 overlap
Senior / Staff Platform Engineer · SRE
🇺🇸
USA
Senior / Staff Platform · SRE
H-1B / L-1 Visa
🇬🇧
United Kingdom
Senior / Staff Platform · SRE
Skilled Worker Visa
🇨🇦
Canada
Senior / Staff Platform · SRE
Express Entry
🇪🇺
Europe
Germany · Netherlands · Ireland
EU Blue Card

Let's Connect

Hiring for a Senior / Staff Platform Engineer, SRE, DevOps, or Cloud Infrastructure role? Building an AI infrastructure team? I'd love to connect.

Available for Opportunities
  • Senior / Staff Platform Engineer
  • Site Reliability Engineer (SRE)
  • DevOps Engineer
  • Cloud Infrastructure Engineer
  • India · Global Remote · Relocation
  • References available on request