A collection of my published articles on Site Reliability Engineering, DevOps, and cloud infrastructure.
VMware’s approach to balancing feature delivery speed with system reliability.
Transform your ops team into an SRE powerhouse with this comprehensive guide.
Apply SRE practices to improve monolithic application reliability.
Best practices for defining meaningful SLIs that accurately reflect user experience.
Reduce MTTR and speed up incident resolution with proven SRE techniques.
Learn how to minimize the impact of production incidents with SRE best practices.
An introduction to VMware CRE and their approach to customer reliability.
How VMware CRE helps organizations reduce operational burden and improve reliability.
Tools and techniques for assessing reliability risks in Kubernetes deployments.
Overview of Google Cloud’s new Architecture Framework for designing cloud-native applications.