
Amirreza Rezaie
Site Reliability Engineer at Snapp
I work on large-scale distributed systems at Iran's leading ride-hailing platform, focusing on infrastructure reliability, observability, and building systems that work under pressure.
Beyond my day job, I'm the creator of Goalixa — a productivity platform I've been developing to deepen my understanding of modern software architecture, from microservices to GitOps deployments.
Work Experience
Site Reliability Engineer
- Incident management and on-call response for critical production systems
- OpenShift cluster management for containerized workloads
- Designing and implementing alerting rules for system reliability
- Developing Python-based SRE microservices for automation
- Working with message queue technologies (RabbitMQ, Kafka, NATS)
- Ensuring reliability and performance of high-traffic services
Personal Projects
Goalixa — Productivity Platform
My learning playground where I experiment with modern software architecture
Skills & Technologies
Container Orchestration
Kubernetes, OpenShift, Docker
Storage
Longhorn
Languages
Python, Go, Bash
Observability
Prometheus, Grafana, Thanos, ELK
Message Queues
RabbitMQ, Kafka, NATS
CI/CD & GitOps
ArgoCD, GitHub Actions, GitLab CI
AI
Automation With AI, Development With AI, Orchestration with AI
Certifications
Featured Work
From k3s to kubeadm: My Kubernetes Migration Journey
Every infrastructure decision is a trade-off. This documents why I migrated and the practical strategy for the transition.
Read more →Latency Taught Me Better Software Engineering
The mindset shift when performance moved from a dashboard metric to something I could feel directly.
Read more →PWA Path Latency Incident Report
A production incident, full timeline, root cause analysis, and lessons learned.
Read more →