
Most people don’t think of documentation as a security control.
When we talk about defensive security, we usually talk about things we can point to:
- Firewalls
- EDR
- SIEMs
- Detection rules
- MFA
Documentation often sits in a different category. It feels administrative - something that comes on the side of security work, rather than security work itself.
Lately, I’ve been questioning that assumption.
Documentation seems to behave like a defensive control. It fails in many of the same ways, degrades over time, and requires maintenance. When it is missing during an incident, everyone notices immediately.
This is Roundup 0 of Blue Team Tech Writing, a series built around questions and observations from defensive security work. I’ll be exploring how documentation, communication, and technical writing influence the way blue teams operate.
A lot of those questions seem to lead back to the same place: documentation.
So that’s where I’ll start.
The Failure Mode Is Identical
The idea that pushed me toward this investigation wasn’t simply that documentation supports security work. Most things in a security program support security work in some way. What caught my attention was how documentation fails.
The failure patterns feel familiar.
| Security Control | Documentation |
|---|---|
| Requires maintenance | Requires maintenance |
| Degrades over time | Degrades over time |
| Often ignored when working | Often ignored when working |
| Tested during incidents | Tested during incidents |
| Failure becomes obvious under pressure | Failure becomes obvious under pressure |
An incident response playbook can look complete during a review. A security runbook can exist in a wiki and satisfy every documentation requirement. Everything appears fine until someone actually needs it.
Then reality starts asking questions:
- Does this process still exist?
- Is this screenshot still accurate?
- Is this escalation path correct?
- Does anyone still have these permissions?
- Who owns this system now?
The incident didn’t create those problems - it simply exposed them.
What interests me most is that neither security controls nor documentation usually announce their decline. They tend to fail quietly.
Knowledge Has a Single Point of Failure
Another pattern I’ve noticed has less to do with technology and more to do with people. Most teams rely on some amount of undocumented knowledge.
For example:
| Question | Common Answer |
|---|---|
| Who owns this system? | “Ask Sarah.” |
| How do we investigate this alert? | “John knows the process.” |
| Where is that credential stored? | “I think Mike set that up.” |
| Why was this exception approved? | “I’m not sure, but Alex would know.” |
Sometimes, this works. The problem is that incidents change the operating environment.
During an incident:
- People are tired.
- Time is compressed.
- Information is incomplete.
- Decisions carry more weight.
- Key personnel may be unavailable.
Under those conditions, tribal knowledge becomes harder to access.
This is where incident response documentation starts looking less like administration and more like infrastructure. A runbook doesn’t replace expertise, but it can preserve expertise. A playbook doesn’t eliminate judgment, but it can reduce uncertainty.
At a minimum, it gives responders somewhere to start.
And the more I think about it, the more that sounds like the purpose of many defensive controls in the first place: reducing uncertainty when conditions are at their worst.
What the Frameworks Already Know
The major incident response frameworks already understand that documentation is operational infrastructure, not administrative overhead. They just don’t always use that language.
NIST’s Cybersecurity Framework calls out documentation as part of the “Detect” and “Respond” functions. Specifically, it expects organizations to have documented processes for incident detection, analysis, and response. NIST 800-61 (Computer Security Incident Handling Guide) goes further: it requires playbooks, procedures, and communication plans. These aren’t suggestions. They’re control requirements.
CIS Controls takes a similar line. Control 19 (Incident Response Management) explicitly requires documented procedures for incident response.
What’s interesting is that both frameworks recognize documentation degrades. NIST 800-61 includes maintenance schedules. CIS Controls includes testing. Neither framework seems to assume documentation will simply exist and remain current. They build degradation into the model.
But here’s the gap in how frameworks are typically implemented: they treat documentation as a checkbox. “Do you have a playbook? Yes.” “Is it current? We think so.” The frameworks require documentation to exist and to be tested, but they don’t give teams much guidance on how to write documentation that survives contact with reality. That’s where the practitioner work begins.
The Practicioner Gap
In my experience, the gap between what frameworks require and what teams actually use comes down to a single problem: documentation written by people who have never had to use it during an incident.
When a runbook is created during a calm period, it often reflects how someone thinks the process works, not how it actually works under pressure.
More critically, documentation written without pressure testing doesn’t account for the cognitive load of incident response. A perfectly complete playbook can still fail if it’s written in a way that assumes people have time to parse it. During an incident, responders need to scan, not read. They need clear decision trees, not narrative explanations. They need to know what to do in the first thirty seconds.
The practitioner gap is the space between “documentation that satisfies compliance” and “documentation that actually gets used when everything is on fire.”
There’s another gap, too: most blue teamers aren’t trained as writers. They’re security professionals who’ve been asked to document their work. The result is often technically accurate but operationally unclear. Runbooks that are correct but slow to parse. Playbooks that cover the process but miss the pressure points.
This is where incident response documentation stops looking like a writing problem and starts looking like a design problem. How do you architect documentation so that it’s maintainable, scannable, and pressure-tested? How do you make it something teams will actually refer to when conditions are worst?
That’s what Blue Team Tech Writing is here to explore.
What Blue Team Tech Writing Is Building
This publication is built around a single premise: documentation is a defensive control, and it deserves the same rigor, testing, and maintenance that we apply to technical defenses.
That means we’re going to be exploring:
- Patterns and architectures for incident response documentation that survive real incidents and degrade gracefully
- Testing strategies that reveal whether documentation will actually work when you need it
- Design principles for runbooks, playbooks, and procedures that account for cognitive load and time pressure
- Maintenance frameworks that keep documentation current without requiring constant rewriting
- Case studies from real incidents: what documentation worked, what failed, and why
Each roundup will examine a specific aspect of documentation as it functions in defensive security work. Some will be theoretical. Some will be tactical. All of them will start from the same place: the observation that documentation fails like a security control fails, and that’s not a metaphor—it’s an architectural problem we can solve.
The frameworks already know documentation matters. The practitioners already understand the gap. What’s missing is a systematic approach to treating documentation as infrastructure, not as administration.
That’s what we’re building here.