<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>Pete Shima</title><description>Engineering leader focused on reliability and high-scale backend systems — plus weekend side projects. Previously Fortnite, Epic Games, HashiCorp, AWS, and Rockstar.</description><link>https://peteshima.com/</link><item><title>On Software Quality</title><link>https://peteshima.com/2026/04/01/on-software-quality/</link><guid isPermaLink="true">https://peteshima.com/2026/04/01/on-software-quality/</guid><description>Everyone’s busy, tl;dr Trust is a tricky thing “Trust comes on foot but leaves on horseback” is a Dutch proverb often attributed to the 19th-century Dutch statesman Johan Rudolph Thorbecke. It means t</description><pubDate>Wed, 01 Apr 2026 00:00:00 GMT</pubDate><category>Essay</category></item><item><title>Job Post: Matchmaking Backend Engineer (Go)</title><link>https://peteshima.com/2026/01/27/embark-matchmaking-backend-engineer-job/</link><guid isPermaLink="true">https://peteshima.com/2026/01/27/embark-matchmaking-backend-engineer-job/</guid><description>In Job Posting Review we take a quick, or sometimes detailed, look at a public job posting and comment on it. Since job postings come and go, a copy is listed here where we break it down. If you are t</description><pubDate>Tue, 27 Jan 2026 00:00:00 GMT</pubDate><category>Job Review</category></item><item><title>The most interesting thing about the Cloudflare incident report isn’t in the report.</title><link>https://peteshima.com/2025/11/21/the-most-interesting-thing-about-the-cloudflare-incident-report-isnt-in-the-report/</link><guid isPermaLink="true">https://peteshima.com/2025/11/21/the-most-interesting-thing-about-the-cloudflare-incident-report-isnt-in-the-report/</guid><description>The most recent Cloudflare outage was a pretty big one. They followed up with an incident report really quickly, same day is impressive. At other places I have worked that operated quickly, even 24 ho</description><pubDate>Fri, 21 Nov 2025 00:00:00 GMT</pubDate><category>Incidents</category></item><item><title>Getting Cooked by AI – A GameJam story</title><link>https://peteshima.com/2025/08/18/getting-cooked-by-ai-a-gamejam-story/</link><guid isPermaLink="true">https://peteshima.com/2025/08/18/getting-cooked-by-ai-a-gamejam-story/</guid><description>I have never participated in a Game Jam before so I thought I would give this one a try. I am not expecting to win a game jam, more to learn and have fun building some games. What is a game jam? From </description><pubDate>Mon, 18 Aug 2025 00:00:00 GMT</pubDate><category>Building</category></item><item><title>Job Posting Review: Grinding Gear Games – Web Programmer</title><link>https://peteshima.com/2025/08/07/job-posting-review-grinding-gear-games-web-programmer/</link><guid isPermaLink="true">https://peteshima.com/2025/08/07/job-posting-review-grinding-gear-games-web-programmer/</guid><description>In Job Posting Review we take a quick, or sometimes detailed, look at a public job posting and comment on it. Since job postings come and go, a copy is listed here where we break it down. If you are t</description><pubDate>Thu, 07 Aug 2025 00:00:00 GMT</pubDate><category>Job Review</category></item><item><title>Will AI Agents Transform Senior Engineer Career Peaks?</title><link>https://peteshima.com/2025/08/04/ai-agents-transforming-senior-engineer-career-peaks/</link><guid isPermaLink="true">https://peteshima.com/2025/08/04/ai-agents-transforming-senior-engineer-career-peaks/</guid><description>There was a chart drawn for me that I’ve also drawn for many folks over the years. It looks something like this: Career Impact Over Time This is someone’s impact over time in their career working as a</description><pubDate>Mon, 04 Aug 2025 00:00:00 GMT</pubDate><category>Career</category></item><item><title>Exploring Vibe Coding: Building Glitchjack</title><link>https://peteshima.com/2025/07/31/exploring-vibe-coding-building-glitchjack/</link><guid isPermaLink="true">https://peteshima.com/2025/07/31/exploring-vibe-coding-building-glitchjack/</guid><description>Over the last few days I have been vibe coding the game Glitchjack, a black jack inspired game with all sorts of glitches. Right now it is pretty basic with only generating a random deck on each game </description><pubDate>Thu, 31 Jul 2025 00:00:00 GMT</pubDate><category>Building</category></item><item><title>Reliability in Games (RIG) Series</title><link>https://peteshima.com/2025/07/26/reliability-in-games-series/</link><guid isPermaLink="true">https://peteshima.com/2025/07/26/reliability-in-games-series/</guid><description>Game launches and reliability over the past decade have had a troubled past. If you look at most major games they have had an outage or degraded experience at launch. There is almost an expectation fr</description><pubDate>Sat, 26 Jul 2025 00:00:00 GMT</pubDate><category>Reliability</category></item><item><title>Reliability in Games: High Level Issues</title><link>https://peteshima.com/2025/07/26/reliability-in-games-high-level-issues/</link><guid isPermaLink="true">https://peteshima.com/2025/07/26/reliability-in-games-high-level-issues/</guid><description>This is part 1 of a series on Reliability in Games(RIG). Where we explore why games have so many reliability problems. This focuses more on large online games rather than single player or board games.</description><pubDate>Sat, 26 Jul 2025 00:00:00 GMT</pubDate><category>Reliability</category></item><item><title>Destiny Back End Stability</title><link>https://peteshima.com/2023/08/07/18/</link><guid isPermaLink="true">https://peteshima.com/2023/08/07/18/</guid><description>Over the past few months Bungie put out a few posts on Destiny that detailed what they were doing on the back end and I finally got a chance to read through these. THIS WEEK AT BUNGIE – 05/18/2023THIS</description><pubDate>Mon, 07 Aug 2023 00:00:00 GMT</pubDate><category>Reliability</category></item><item><title>Monitoring Your Home Network - Updated</title><link>https://peteshima.com/2022/07/27/monitoring-your-home-network-updated/</link><guid isPermaLink="true">https://peteshima.com/2022/07/27/monitoring-your-home-network-updated/</guid><description>A practical, low-cost way to monitor your whole home network with InfluxDB 2.0 and Telegraf — ISP, wifi, and device health across multiple sites, with automatic alerting.</description><pubDate>Wed, 27 Jul 2022 00:00:00 GMT</pubDate><category>Building</category></item><item><title>On Incident Assumptions</title><link>https://peteshima.com/2020/03/16/on-incident-assumptions/</link><guid isPermaLink="true">https://peteshima.com/2020/03/16/on-incident-assumptions/</guid><description>The world is on fire If you have been through a large number of high severity events, you’ve probably found yourself working with a mental model based on graphs, logs, or other telemetry – except the </description><pubDate>Mon, 16 Mar 2020 00:00:00 GMT</pubDate><category>Incidents</category></item><item><title>Onboarding, On-Call and Learning</title><link>https://peteshima.com/2017/04/05/onboarding-on-call-and-learning/</link><guid isPermaLink="true">https://peteshima.com/2017/04/05/onboarding-on-call-and-learning/</guid><description>A while back in the hangops slack I offered to try and help anyone that I could in the #career_advice or #job_board channels. I am no expert on these topics but maybe I have some knowledge that will h</description><pubDate>Wed, 05 Apr 2017 00:00:00 GMT</pubDate><category>Career</category></item><item><title>Postmortem Review: Stack Exchange Outage (July 2016)</title><link>https://peteshima.com/2016/07/20/stack-exchange-outage-postmortem-review/</link><guid isPermaLink="true">https://peteshima.com/2016/07/20/stack-exchange-outage-postmortem-review/</guid><description>A review of Stack Exchange&apos;s July 20, 2016 outage postmortem — the regex that spiked CPU, the cascading load-balancer failure, and how they communicated it.</description><pubDate>Wed, 20 Jul 2016 00:00:00 GMT</pubDate><category>Incidents</category></item><item><title>Pete’s Terraform Tips</title><link>https://peteshima.com/2016/06/10/petes-terraform-tips/</link><guid isPermaLink="true">https://peteshima.com/2016/06/10/petes-terraform-tips/</guid><description>My Terraform experience started with a single Terraform environment which is over a year old in production with over 3000 Terraform applies and over 4500 versions of state. The single environment grew</description><pubDate>Fri, 10 Jun 2016 00:00:00 GMT</pubDate><category>Building</category></item><item><title>Postmortem Review: GitHub Outage (January 2016)</title><link>https://peteshima.com/2016/01/28/github-outage-postmortem-review/</link><guid isPermaLink="true">https://peteshima.com/2016/01/28/github-outage-postmortem-review/</guid><description>A review of GitHub&apos;s public postmortem for the January 28, 2016 outage — what they communicated, how, and what made their incident response build customer trust.</description><pubDate>Thu, 28 Jan 2016 00:00:00 GMT</pubDate><category>Incidents</category></item><item><title>5 Lessons Learned in Ops</title><link>https://peteshima.com/2016/01/21/5-lessons-learned-in-ops/</link><guid isPermaLink="true">https://peteshima.com/2016/01/21/5-lessons-learned-in-ops/</guid><description>Five lessons from years on operations teams: ask why, kill invisible work, own the negative space, build safety through process instead of policing, and optimize for both safety and speed.</description><pubDate>Thu, 21 Jan 2016 00:00:00 GMT</pubDate><category>Career</category></item><item><title>Response to Chef in 2012</title><link>https://peteshima.com/2012/01/07/response-to-chef-in-2012/</link><guid isPermaLink="true">https://peteshima.com/2012/01/07/response-to-chef-in-2012/</guid><description>A long response to &apos;Chef in 2012&apos; — the sysadmin-to-DevOps transition, testing infrastructure, cookbooks, and config-management adoption. (From the 2012 archives.)</description><pubDate>Sat, 07 Jan 2012 00:00:00 GMT</pubDate><category>Essay</category></item><item><title>Happy New Beards</title><link>https://peteshima.com/2012/01/01/happy-new-beards/</link><guid isPermaLink="true">https://peteshima.com/2012/01/01/happy-new-beards/</guid><description>A New Year&apos;s reflection from 2012 — moving back to Seattle, leaving Windows for Mac/Linux, and learning Ruby and Chef. (From the archives.)</description><pubDate>Sun, 01 Jan 2012 00:00:00 GMT</pubDate><category>Career</category></item><item><title>Riding the Unicorn</title><link>https://peteshima.com/2011/12/14/riding-the-unicorn/</link><guid isPermaLink="true">https://peteshima.com/2011/12/14/riding-the-unicorn/</guid><description>A field guide to running the Unicorn rack server in production — worker tuning, preloading, sockets and backlog, plus monitoring with Nagios and god. (From the 2011 archives.)</description><pubDate>Wed, 14 Dec 2011 00:00:00 GMT</pubDate><category>Building</category></item></channel></rss>