Safeguard 18.1: Establish and Maintain a Penetration Testing Program

9 min readLast verified: February 2026By Isaac SilvermanOur methodology

Safeguard 18.1 requires you to run a defined, repeatable penetration testing program that is scoped to your real attack surface, performed by qualified testers, and tracked through remediation to closure with retained evidence. To operationalize it fast, document program rules (scope, cadence triggers, independence, ROE), execute tests against prioritized systems, and prove follow-up.

Key takeaways:

Write a penetration testing program standard that defines scope, triggers, independence, and reporting aligned to your asset inventory.
Treat remediation as part of the control: findings need owners, deadlines, retesting, and closure evidence.
Keep audit-ready artifacts: test plans, rules of engagement, reports, tickets, retest results, and management review.

The safeguard 18.1: establish and maintain a penetration testing program requirement is about disciplined validation, not “running a pen test once.” A credible program shows that you (1) know what you’re testing, (2) test it in a controlled, authorized way, (3) learn from the results, and (4) fix what matters with proof.

For a Compliance Officer, CCO, or GRC lead, the fastest path is to convert “penetration testing” into a control you can explain in one page to auditors and operators: what triggers a test, what environments are in scope, what methods are allowed, who can perform it, how you approve it, and how findings move from report to remediation to retest. Then you run it and retain clean evidence.

CIS Controls v8 positions penetration testing under the broader goal of testing and validating security defenses, so your program should tie directly to your enterprise risk assessment, your asset inventory, and the systems that matter (internet-facing services, sensitive data environments, identity, and key third parties). (CIS Controls v8; CIS Controls Navigator v8)

Regulatory text

Excerpt (provided): “CIS Controls v8 safeguard 18.1 implementation expectation (Establish and Maintain a Penetration Testing Program).” (CIS Controls v8; CIS Controls Navigator v8)

Operator interpretation: You must define and run a penetration testing program as an ongoing security practice. Auditors will look for two things: (1) a documented program (design) and (2) executed tests with remediation tracking (operation). The easiest way to fail 18.1 is to have a single penetration test report with no clear scope logic, no authorization trail, and no evidence that findings were fixed or retested. (CIS Controls v8; CIS Controls Navigator v8)

Plain-English interpretation

A penetration testing program is a governed process to simulate real attacker behavior against your systems, identify exploitable weaknesses, and drive verified fixes. “Program” means it is planned, repeatable, approved, and measured. It is broader than a vulnerability scan: it includes human-led attack paths, chaining misconfigurations, and validating whether controls actually stop an attack.

If you run tests but cannot show how scope was chosen, who approved testing, how testers were qualified, and how results were remediated, you do not have an operational program.

Who it applies to (entity and operational context)

This safeguard applies to enterprises and technology organizations adopting CIS Controls v8 as a security baseline, including:

SaaS and tech companies with production environments, CI/CD, APIs, and cloud control planes.
Regulated enterprises using CIS Controls as part of a control mapping exercise (for example, to support broader security and assurance objectives).
Organizations with meaningful external exposure: internet-facing apps, remote access, identity providers, customer portals, and sensitive data environments.
Organizations relying on third parties that host, operate, or materially impact your systems (for example, managed service providers, cloud providers, app security firms). Pen tests may include third-party hosted components, but scope and permission must be explicit.

Operationally, 18.1 touches Security, Engineering, IT, and GRC. GRC typically owns the program standard and evidence model; Security/Engineering owns execution and remediation.

What you actually need to do (step-by-step)

1) Define the penetration testing program standard (your “control design”)

Create a short program document (often 2–6 pages) with these required decisions:

A. Scope model (what gets tested)

Tie scope to your asset inventory and data flows.
Define in-scope categories such as: internet-facing apps/APIs, cloud control plane, external network perimeter, internal “crown jewels,” and high-risk third-party integrated services.
Define out-of-scope categories and why (then track them as risk acceptances, not silent exclusions).

B. Triggers (when you test) Specify what causes a test. Common triggers you can adopt as policy language:

Material application releases or architecture changes
Major network boundary changes
Significant IAM changes (SSO/IdP migrations)
New critical third-party integrations or hosting model changes
Post-incident validation testing

Keep triggers in plain operational terms so engineering can recognize them and open a request.

C. Independence and qualifications (who tests)

Decide when testing must be independent from the system owner (often for production or high-risk systems).
Define minimum tester qualifications (firm credentials, prior experience, or internal separation of duties).
Require conflict-of-interest controls when using third parties.

D. Rules of Engagement (ROE) requirements Your ROE template should require:

Written authorization and dates/times
In-scope targets (hosts, apps, APIs, cloud accounts)
Testing methods allowed (e.g., social engineering allowed or prohibited)
Data handling rules (no customer data extraction; redact secrets in reports)
Stop conditions and escalation path
Evidence retention expectations

E. Reporting and severity model Standardize what a report must include: attack narrative, affected assets, reproduction steps, business impact, recommended fixes, and mapping to your ticketing workflow.

F. Remediation and retest requirements Define required actions:

Assign an owner for every finding
Set remediation expectations based on severity (you set the timelines as internal standards)
Retest criteria and what counts as closure
Risk acceptance path with approvers and expiration

This is where many programs break. A penetration test without closure mechanics becomes a shelf document.

2) Build an annual (or rolling) test plan from real risk

Create a test plan that lists:

Target systems and rationale (risk ranking, external exposure, sensitive data)
Testing type (web app, API, cloud configuration attack paths, internal lateral movement)
Test method (black/gray/white box as you define internally)
Dependencies (test accounts, staging vs prod constraints)

Make sure the plan is traceable to your asset inventory and risk assessment so scope looks intentional during an audit. (CIS Controls v8; CIS Controls Navigator v8)

3) Execute tests with clean governance

Operational checklist for each engagement:

Open a penetration test request record (ticket or GRC workflow).
Finalize scope and ROE; obtain written approvals from system owner and Security.
Confirm logging/monitoring expectations (so SecOps doesn’t treat the test as an uncontrolled incident).
Run testing, collect evidence, and hold daily/weekly check-ins for critical findings.
Deliver a final report and management readout for high-risk tests.

4) Convert findings into trackable work (tickets) with accountability

Minimum mechanics:

Each finding becomes a ticket with: owner, affected system, severity, recommended fix, and due date.
Link tickets back to the pen test engagement record.
Track exceptions: if a finding isn’t fixed, capture a formal risk acceptance with approver and expiration.

This is where Daydream fits naturally: use it to map Safeguard 18.1 to a documented control operation and set recurring evidence capture so you can prove the program ran, not just that a report exists. (CIS Controls v8; CIS Controls Navigator v8)

5) Retest and close (prove validation)

Require one of these closure methods for each finding:

Retest by the tester (preferred for critical paths)
Internal validation with documented steps and evidence (screenshots, logs, configuration diffs)
Compensating control acceptance with documented rationale and expiration

6) Program governance: measure, review, improve

Run a periodic management review that covers:

What was tested vs plan
Open findings and overdue items
Themes (recurring root causes like IAM misconfig, authz gaps, secrets management)
Updates needed to standards, SDLC gates, or monitoring

Required evidence and artifacts to retain

Auditors typically want design evidence plus operational proof. Retain:

Program design

Penetration Testing Program Standard (scope model, triggers, independence, ROE, reporting, remediation, retest)
ROE template and approval workflow description
Test plan (annual/rolling) tied to asset inventory/risk

Per engagement

Signed/approved ROE (or equivalent approval record)
Final report (with scope, methodology summary, findings)
Evidence of tester qualifications/engagement agreement (internal assignment memo or third-party SOW)
Communication artifacts for critical findings (readout notes, emails, meeting minutes)

Remediation and closure

Ticket export or screenshots showing assignment, progress, and closure
Retest results or validation evidence
Risk acceptances with approver, rationale, and expiration date

Governance

Management review notes and action items
Metrics dashboard (qualitative is fine if you avoid unsourced numbers)

Common exam/audit questions and hangups

A good examiner line of questioning is predictable:

“Show me your penetration testing policy/standard.”
Hangup: you only have a vendor report and no internal standard.
“How did you choose scope?”
Hangup: scope is “whatever the vendor tested last time,” not tied to inventory/risk.
“Who authorized testing, and where is the ROE?”
Hangup: informal approvals in chat with no retention.
“How do you ensure remediation happens?”
Hangup: findings live in PDFs with no tickets, owners, or closure criteria.
“How do you handle third-party hosted systems?”
Hangup: you assume your provider’s testing covers you, but you cannot show what was tested and what you received.

Frequent implementation mistakes and how to avoid them

Mistake	Why it fails audits	Fix
Treating a one-time pen test as a “program”	No repeatability, triggers, or governance	Publish a program standard + rolling plan, then run it
No ROE or weak authorization	Creates operational and legal risk; evidence gap	Require written ROE approvals for each engagement
No linkage from findings to tickets	No proof of remediation	Enforce “no finding left behind” ticketing workflow
Testing only perimeter IPs	Misses apps, APIs, cloud control plane	Define scope categories and map to inventory
Closing findings without retest evidence	Control not validated	Define closure criteria and collect proof

Enforcement context and risk implications

No public enforcement cases are provided in the supplied source catalog for this safeguard. Practically, the risk is less about “failing CIS” and more about predictable downstream outcomes: repeated control failures, unknown attack paths, and weak defensibility after an incident because you cannot show you tested and remediated known weaknesses. (CIS Controls v8; CIS Controls Navigator v8)

Practical 30/60/90-day execution plan

First 30 days (stand up the control)

Draft and approve the Penetration Testing Program Standard (scope model, triggers, independence, ROE, remediation, retest).
Publish ROE template and approval workflow.
Build your initial test inventory: list priority systems from asset inventory and risk input.
Configure evidence capture in your GRC system (Daydream or equivalent): engagement record, artifact checklist, and remediation linkage. (CIS Controls v8; CIS Controls Navigator v8)

Day 31–60 (run and operationalize)

Execute at least one high-value test (internet-facing app/API or cloud control plane path).
Convert findings into tickets with owners and due dates; start remediation work.
Run a formal readout with engineering leadership; document decisions and exceptions.

Day 61–90 (close the loop and stabilize)

Retest critical fixes and document closure evidence.
Hold a management review: plan vs actual, open items, systemic themes.
Update the next-cycle test plan based on lessons learned (new scope targets, improved ROE, updated triggers).

Frequently Asked Questions

Do we have to use an external third party for penetration testing?

CIS v8 Safeguard 18.1 calls for a program, not a specific sourcing model. If you use internal testers, document independence, qualifications, and separation of duties so results are credible. (CIS Controls v8; CIS Controls Navigator v8)

Can vulnerability scanning count as penetration testing?

A vulnerability scan is useful input, but it rarely satisfies a penetration testing program by itself because it does not validate exploit chains and real attacker paths. Keep both: scanning for breadth, pen testing for depth and validation. (CIS Controls v8; CIS Controls Navigator v8)

What systems should be first in scope?

Start with internet-facing applications and APIs, identity and access paths, and environments that store or process sensitive data. If you can’t justify scope, tie it back to your asset inventory and risk assessment. (CIS Controls v8; CIS Controls Navigator v8)

How do we prove remediation for auditors without sharing sensitive exploit details?

Keep the full report restricted, and provide auditors a controlled evidence pack: finding list, linked tickets, closure notes, and retest confirmation. Redact exploit strings and secrets, but preserve traceability from report to remediation. (CIS Controls v8; CIS Controls Navigator v8)

How do we handle penetration testing when production testing is risky?

Use strict ROE constraints, time windows, stop conditions, and safe testing techniques, and consider staging tests plus targeted production validation. Document why constraints exist and how you still validate real exposure. (CIS Controls v8; CIS Controls Navigator v8)

What does “maintain” mean in practice for this requirement?

“Maintain” means you have a repeatable lifecycle: defined triggers, a living test plan, executed engagements, tracked remediation, and periodic governance review with retained evidence. If any step is ad hoc, write it down and turn it into workflow. (CIS Controls v8; CIS Controls Navigator v8)

Frequently Asked Questions

Do we have to use an external third party for penetration testing?

Can vulnerability scanning count as penetration testing?

What systems should be first in scope?

How do we prove remediation for auditors without sharing sensitive exploit details?

How do we handle penetration testing when production testing is risky?

What does “maintain” mean in practice for this requirement?

Authoritative Sources

Operationalize this requirement

Map requirement text to controls, owners, evidence, and review workflows inside Daydream.

See Daydream

Regulatory text

Plain-English interpretation

Who it applies to (entity and operational context)

What you actually need to do (step-by-step)

1) Define the penetration testing program standard (your “control design”)

2) Build an annual (or rolling) test plan from real risk

3) Execute tests with clean governance

4) Convert findings into trackable work (tickets) with accountability

5) Retest and close (prove validation)

6) Program governance: measure, review, improve

Required evidence and artifacts to retain

Common exam/audit questions and hangups

Frequent implementation mistakes and how to avoid them

Enforcement context and risk implications

Practical 30/60/90-day execution plan

First 30 days (stand up the control)

Day 31–60 (run and operationalize)

Day 61–90 (close the loop and stabilize)

Frequently Asked Questions

Do we have to use an external third party for penetration testing?

Can vulnerability scanning count as penetration testing?

What systems should be first in scope?

How do we prove remediation for auditors without sharing sensitive exploit details?

How do we handle penetration testing when production testing is risky?

What does “maintain” mean in practice for this requirement?

Frequently Asked Questions

Authoritative Sources

Related Resources

Operationalize this requirement