codemapoest. 2024

BlogBLOG RoadmapROADMAP UtilityUTILITY ProjectsPROJECTS AboutABOUT

© codemapo · est. 2024

All rights reserved.

#SRE

5 posts tagged

DateCoordTitleTags

2026.03.20I·02

Incident Management: Writing Postmortems and Managing Incidents

Incidents will happen. What matters is how fast you recover and what you learn. From severity levels and incident roles to blameless postmortems and action items that actually get done.

Incident ManagementPostmortemSRE

→2025.09.25I·17

Postmortem: Post-Incident Analysis

Postmortem purpose and writing method

postmortemincidentsre

→2025.09.18I·16

What Is SRE: Google's Philosophy for Turning Operations into Engineering

Running a service means failures will happen. Reading Google's SRE book made me realize that operations is a high-level engineering problem, not just toil. I walk through how the concepts of SLI, SLO, and Error Budget shift your mindset from firefighter to architect.

SREDevOpsReliability

→2025.08.05I·10

Would You Drive with Your Eyes Closed? (Why You Need Monitoring)

Users complained the service was slow, but I was blindly grepping log files. I share how I moved from 'driving blind' to full observability using Prometheus and Grafana, and explain Google's 4 Golden Signals of monitoring.

DevOpsMonitoringPrometheus

→2025.05.22I·03

Chaos Engineering: Building Immunity by Breaking Things

Why would Netflix intentionally shut down its own production servers? Explore the philosophy of Chaos Engineering, the Simian Army, and detailed strategies like GameDays and Automating Chaos to build resilient distributed systems.

DevOpsSREInfrastructure

▸ Other tags

#CS164 #Performance60 #React55 #Security52 #DevOps46 #Architecture42 #Frontend36 #Web35 #OS33 #Network32 #Backend31 #Debugging29 #hardware29 #Next.js29 #Database28 #Flutter28 #TypeScript24 #Algorithm23 #AI22 #Mobile22 #Optimization22 #JavaScript21 #Infrastructure19 #System Design19

Browse all tags →