SOC2

SOC 2 Continuous Monitoring: Maintaining Compliance Year-Round

SOC 2 compliance doesn't end when you get your report. Learn how to build a continuous monitoring program that keeps your controls effective and makes annual audits painless.

GT

GRCTrail Team

SOC 2 Continuous Monitoring Guide

The biggest mistake SaaS companies make after receiving their first SOC 2 report is treating it as a completed project. The report arrives, the team celebrates, and compliance fades into the background until the next audit cycle begins β€” at which point three months of frantic preparation ensues, evidence gaps are discovered, and control failures that could have been caught in January surface in October.

SOC 2 Type II examines the operating effectiveness of your controls over the entire observation period β€” typically 12 months. An access review that was completed in every month except July is an audit finding. A vulnerability that went unpatched for 90 days because nobody was tracking the remediation SLA is an audit finding. A vendor whose SOC 2 report expired six months ago without renewal is an audit finding. Continuous monitoring catches these issues when they are correctable, not when they are evidence of failure.

This guide covers what to monitor, how to build a monitoring system that scales with your SaaS company, the metrics that matter, and how to handle the inevitable moments when monitoring reveals that a control is not working.

Why Continuous Monitoring Matters for SOC 2

Type II covers the full observation period. A Type II audit does not evaluate your controls on the day the auditor arrives. It evaluates whether those controls operated effectively throughout the entire observation period. If your MFA enforcement was disabled for two weeks in March while you migrated identity providers, that gap will appear in your report β€” either as an exception or as a qualified opinion, depending on severity. Continuous monitoring ensures you know about gaps when they occur, not when the auditor finds them.

Controls must work 365 days per year. A quarterly access review that was conducted in Q1, Q2, and Q4 but missed in Q3 is not 75% compliant. It is a control failure for the entire quarter. SOC 2 does not grade on a curve. Either the control operated as designed during the observation period, or it did not. Continuous monitoring verifies that controls are functioning according to their defined cadence.

Early detection enables remediation. When monitoring detects a control failure in real time, you have the opportunity to remediate it, document the remediation, and demonstrate to your auditor that the failure was an isolated event rather than a systemic problem. A control that failed once but was detected and corrected within days is categorized very differently in an audit report than a control that failed silently for months.

Audit preparation drops from months to days. SaaS companies with mature continuous monitoring programs do not have an β€œaudit prep” phase. Evidence is collected continuously, control status is visible in real time, and the auditor receives a complete evidence package on day one. This reduces audit duration, auditor fees, and the operational disruption caused by pulling engineers into evidence collection.

What to Monitor β€” Control Categories

Your monitoring program must cover every control domain in your SOC 2 scope. The specific controls vary by organization, but the following categories apply to virtually every SaaS company.

Access Controls

Access control failures are among the most common SOC 2 findings. Monitoring must cover the full lifecycle of user access.

User provisioning and deprovisioning timeliness. When an employee is hired, how quickly do they receive appropriate access? More critically, when an employee is terminated, how quickly is access revoked? SOC 2 expects deprovisioning within a defined SLA β€” typically within 24 hours of termination. Monitor your identity provider and HR system for gaps between termination date and access revocation. Your policies and procedures should define the expected SLA, and your monitoring should verify compliance.

MFA enforcement across all systems. Multi-factor authentication is a baseline SOC 2 control. Monitor for users or service accounts that do not have MFA enabled, and for systems that allow authentication without MFA. Pay particular attention to newly provisioned accounts (MFA may not be enforced by default in all systems) and service accounts (which are often excluded from MFA policies but should not have interactive login capability).

Quarterly access reviews completed and documented. Access reviews verify that users still require the access they have been granted. Monitor that reviews are conducted on schedule, that reviewers actually evaluate each access entry (not just rubber-stamp the list), and that identified revocations are executed. Track the percentage of access entries reviewed and the percentage of flagged entries remediated.

Privileged access usage and justification. Production database access, cloud console admin access, and root/sudo usage should be monitored continuously. Every instance of privileged access should have a corresponding justification β€” a ticket, an incident, a maintenance window. Unexplained privileged access is a red flag for auditors and a potential indicator of compromise.

Service account management. Service accounts often accumulate excessive permissions over time and lack the monitoring applied to human accounts. Track service account inventory, permissions, key rotation schedules, and usage patterns.

Change Management

Change management controls ensure that modifications to your systems are authorized, tested, and deployed safely.

All code changes have peer review before deployment. Monitor your code repository for merges to production branches that bypassed required reviews. GitHub branch protection rules, GitLab merge request approvals, and similar controls should be enforced at the platform level β€” but monitoring verifies they have not been circumvented or temporarily disabled.

No direct production access. Engineers should deploy code through your CI/CD pipeline, not by SSH-ing into production servers. Monitor for direct production access events and ensure each one has a documented justification (emergency change, incident response, etc.). If your architecture requires direct production access for certain operations, document those exceptions and monitor that access is limited to approved scenarios.

Change approval workflows functioning. For changes that require approval beyond code review (infrastructure changes, configuration changes, database migrations), verify that approvals are obtained before deployment. Monitor your ticketing system and deployment pipeline for changes deployed without the required approvals.

Emergency change procedures followed correctly. Emergency changes β€” hotfixes deployed outside normal processes due to an active incident β€” are acceptable, but they require retrospective documentation and review. Monitor for emergency changes and verify that each one has a corresponding incident record, post-deployment review, and management acknowledgment.

System Operations

System operations monitoring covers the ongoing health and security of your infrastructure.

Vulnerability scanning cadence met. Your risk assessment likely identified vulnerability management as a key control. Monitor that scans are running on schedule β€” weekly for infrastructure, on every build for application dependencies, and as defined for container images. A missed scan is a missed detection opportunity.

Critical and high vulnerabilities remediated within SLA. Finding vulnerabilities means nothing if they are not fixed. Define remediation SLAs by severity (e.g., Critical: 7 days, High: 30 days, Medium: 90 days) and monitor compliance. Track the age of every open vulnerability and alert when SLAs are approaching or breached.

Endpoint protection deployed across all devices. Every employee device that accesses corporate systems or customer data should have endpoint protection installed and updated. Monitor your endpoint management platform for devices missing protection, devices with outdated signatures, or devices that have not checked in recently.

Backup completion and integrity verification. Backups must complete successfully on their defined schedule, and they must be tested to confirm recoverability. Monitor for backup failures and track the last successful restore test for each critical system. A backup that has never been tested is not a backup β€” it is a hope.

Uptime and performance against SLAs. If your SOC 2 includes the Availability criterion, monitor service uptime and performance metrics against your published SLAs. Track incidents, downtime minutes, and SLA compliance rates monthly.

Vendor Management

Your SOC 2 compliance depends partly on your vendors’ compliance. Monitoring must extend to your third-party ecosystem.

Vendor SOC 2 reports current. Track the expiration dates of your critical vendors’ SOC 2 reports and flag renewals that are overdue. A vendor whose report expired six months ago may have experienced control degradation that affects your data. See our vendor management guide for the full vendor assessment framework.

New vendors assessed before onboarding. Every new vendor that will access, store, or process customer data must be assessed before they go live. Monitor your procurement and vendor onboarding process to ensure security assessments are completed β€” not skipped because the integration is β€œurgent.”

Vendor changes monitored. Track changes in vendor security posture β€” breaches reported in the news, changes to their SOC 2 scope, or modifications to their sub-processor list. These changes may affect the risk profile you documented in your risk register.

Training and Awareness

People are both your strongest control and your greatest vulnerability. Training effectiveness must be monitored.

Security awareness training completion rates. Track which employees have completed required training and which are overdue. A 95% completion rate means 5% of your workforce has not received security training β€” and auditors will ask about the gap. Target 100% completion within the defined timeframe (typically within 30 days of hire and annually thereafter).

Policy acknowledgments current. Employees should acknowledge key policies (information security, acceptable use, data handling) annually. Monitor acknowledgment rates and follow up on outstanding acknowledgments before the audit window.

Phishing simulation results. If you run phishing simulations (and you should), track click rates, reporting rates, and trends over time. Improving trends demonstrate that your training program is effective. Worsening trends indicate a problem that needs attention before your auditor asks about it.

Building Your Monitoring System

Effective monitoring requires both automated checks and periodic human reviews. Neither is sufficient alone.

Automated Monitoring

Automation is the backbone of continuous monitoring. Manual checks do not scale, and they are prone to the same human errors they are meant to detect.

Integrate with your existing tools. Your monitoring system should pull data from the tools your team already uses:

  • Cloud provider β€” AWS Config Rules, AWS Security Hub, GCP Security Command Center, Azure Policy. These evaluate your cloud configuration against security baselines continuously.
  • Identity provider β€” Okta, Azure AD, Google Workspace. Pull user status, MFA enrollment, group membership, and last login data.
  • Code repository β€” GitHub, GitLab, Bitbucket. Monitor branch protection rules, review requirements, and merge activity.
  • Ticketing system β€” Jira, Linear, Asana. Track change approvals, incident records, and remediation task completion.
  • Endpoint management β€” Jamf, Intune, CrowdStrike. Monitor device compliance, protection status, and encryption.
  • Vulnerability scanner β€” Qualys, Rapid7, Snyk, Dependabot. Pull scan results, vulnerability counts, and remediation timelines.

Set up automated checks for critical controls:

  • MFA status for all users (daily)
  • Access review completion tracking (weekly)
  • Vulnerability scan execution (per schedule)
  • Backup completion status (daily)
  • Branch protection rule status (daily)
  • Endpoint protection coverage (daily)
  • Certificate expiration (daily)
  • Cloud configuration compliance (continuous)

Alert on deviations from expected behavior. Configure alerts for conditions that indicate a control is not operating as designed: MFA disabled for a user account, a merge to production without required approvals, a backup failure, or a vulnerability scan that did not execute on schedule. Route alerts to the appropriate owner with sufficient context to take action.

Periodic Reviews

Automated monitoring catches control failures in real time. Periodic reviews catch systemic issues, trends, and process gaps that individual alerts miss.

Monthly reviews:

  • Access review completion for the month β€” were all scheduled reviews conducted?
  • Vulnerability status β€” open vulnerability count by severity, SLA compliance rate, aging analysis
  • Evidence completeness β€” are all expected evidence artifacts being collected and stored?
  • Incident review β€” were all incidents handled according to procedures? (link to incident response)
  • Alert review β€” are alert volumes reasonable? Are false positive rates acceptable? Is alert fatigue developing?

Quarterly reviews:

  • Risk register update β€” have new risks emerged? Have existing risk scores changed? (link to risk assessment)
  • Vendor review β€” are vendor SOC 2 reports current? Have any vendors experienced security events?
  • Policy review β€” are policies still accurate and reflective of current practices?
  • Metrics trend analysis β€” are key compliance metrics improving, stable, or degrading?

Annual reviews:

  • Full risk assessment β€” comprehensive reassessment of all risks, not just incremental updates
  • Policy refresh β€” formal review and approval of all security and compliance policies
  • Control framework review β€” evaluate whether the current control set is sufficient given changes in the threat landscape, your architecture, and your business
  • Training program evaluation β€” assess training effectiveness and update content

Evidence Collection Integration

Continuous monitoring should automatically generate the audit evidence your auditor will request. This is not a separate workstream β€” it is a natural output of your monitoring activities.

Every automated check produces a timestamped record. When your system verifies that MFA is enforced for all users at 2:00 AM on March 11, that check result is evidence. When your access review tracking shows that Q1 reviews were completed by all managers on March 15, that is evidence. Store these records in a centralized, tamper-evident system.

Map evidence to control objectives. Each piece of evidence should be tagged with the SOC 2 control it supports. When your auditor requests evidence for CC6.1 (logical access controls), you should be able to produce a complete evidence set with a single query β€” not spend days searching through email, Slack, and shared drives.

Store evidence in a centralized, auditor-accessible system. Scattered evidence β€” some in Google Drive, some in Confluence, some in email archives β€” is operationally painful and creates risk of gaps. Use a centralized compliance platform where all evidence is stored, organized by control, and accessible to your auditor.

Metrics and Dashboards

What gets measured gets managed. Define key compliance metrics, track them over time, and make them visible to leadership.

Key SOC 2 compliance metrics to track:

  • Control effectiveness rate β€” Percentage of controls operating as designed. Target: 100%. Reality: track every deviation, no matter how minor, and trend toward 100% over time.
  • Mean time to remediate findings β€” How quickly do you fix control failures once detected? A low MTTR demonstrates that your monitoring is effective and your team is responsive.
  • Evidence collection completeness β€” Percentage of required evidence artifacts that are collected and current. Gaps here indicate monitoring blind spots.
  • Access review completion rate β€” Percentage of scheduled access reviews completed on time. This is one of the most frequently tested controls.
  • Vulnerability remediation SLA compliance β€” Percentage of vulnerabilities remediated within defined SLAs, broken down by severity.
  • Training completion rate β€” Percentage of employees current on required training.
  • Vendor assessment currency β€” Percentage of critical vendors with current risk assessments and SOC 2 reports.

Dashboard for management visibility. Build a compliance dashboard that gives your CISO, CTO, or VP of Engineering a real-time view of your compliance posture. The dashboard should make it immediately obvious where things are green (operating as expected), yellow (approaching a deadline or threshold), and red (a control failure or SLA breach requiring attention).

Trend analysis. A single point-in-time metric tells you where you are. A trend tells you where you are heading. Track all key metrics monthly and analyze trends quarterly. Improving trends demonstrate program maturity to auditors. Degrading trends are early warning signals that require investigation before they become findings.

Handling Monitoring Failures

When monitoring detects that a control is not operating as designed β€” and it will β€” your response process determines whether the failure becomes an audit finding or a demonstration of program effectiveness.

Document the failure immediately. Record what happened, when it was detected, and what the potential impact is. Do not wait to understand root cause before documenting β€” capture the initial detection facts and update the record as investigation progresses.

Investigate root cause. Determine why the control failed. Was it a one-time human error (an engineer forgot to add a reviewer to a PR)? A systemic process gap (the onboarding checklist does not include MFA enrollment)? A technical failure (the automated check was not running due to an expired API token)? Root cause determines the appropriate remediation.

Remediate and document the fix. Fix the immediate failure and address the root cause to prevent recurrence. Document both actions with timestamps. For example: β€œMFA was not enforced for user X due to a configuration error in Okta group assignment. User’s MFA was enabled on March 11 at 14:30 UTC. Okta group assignment automation was updated to prevent recurrence on March 12 at 10:00 UTC.”

Assess whether it impacts the SOC 2 report. Not every control failure becomes an audit exception. Isolated incidents that were promptly detected and remediated are typically described in the report as operating effectively with a noted deviation. Systemic failures or prolonged gaps are more likely to result in exceptions or a qualified opinion. If you are uncertain about the impact, discuss it with your auditor proactively.

Proactively disclose to your auditor if material. If a control failure is significant β€” affecting multiple customers, persisting for an extended period, or involving a core security control β€” disclose it to your auditor before they discover it during testing. Proactive disclosure demonstrates transparency and gives you the opportunity to present the failure alongside your remediation. Auditors respond far more favorably to honest disclosure than to discovering failures that appear to have been concealed.

Update the risk register. Every control failure is relevant to your risk assessment. Did the failure reveal a risk you had not identified? Was a risk scored too low? Does the failure indicate that a control is not as effective as you assumed? Update your risk register to reflect what you learned.

Common Mistakes

Monitoring only during audit season. SaaS companies that ramp up monitoring three months before the audit and scale it back afterward defeat the purpose of continuous monitoring entirely. A Type II observation period covers 12 months. Gaps in any month are visible in the evidence β€” or more precisely, the absence of evidence during those months tells the auditor exactly when you were not monitoring.

Too many alerts causing alert fatigue. A monitoring system that generates hundreds of alerts per day is a monitoring system that gets ignored. Tune your alerts ruthlessly. Every alert should require action. If an alert fires regularly and the response is always β€œignore,” the alert threshold is wrong. Alert fatigue is a security risk β€” the real issue gets buried in noise, and your team stops paying attention.

No defined remediation SLAs. Detecting a control failure is only half the job. If there is no defined timeline for remediation, failures linger unresolved. Define SLAs for remediation by severity: critical control failures remediated within 24 hours, high within one week, moderate within 30 days. Track compliance with these SLAs as a key metric.

Not tracking metrics over time. A snapshot metric tells you today’s status. A trend tells you whether your program is improving or degrading. Without historical metrics, you cannot identify patterns (access review completion rates dropping each quarter), demonstrate improvement to auditors, or justify investments in additional tooling or headcount.

Monitoring control existence but not effectiveness. Verifying that a WAF is deployed is not the same as verifying that the WAF is blocking malicious traffic. Verifying that an access review was conducted is not the same as verifying that inappropriate access identified during the review was actually revoked. Mature monitoring programs check not just whether controls exist but whether they are producing the intended security outcomes.

How GRCTrail Helps

GRCTrail transforms continuous monitoring from a manual, spreadsheet-driven exercise into an automated system that runs in the background while your team focuses on building product.

  • Automated control monitoring with integrations that connect to your cloud provider, identity provider, code repository, and other tools to continuously verify control operation without manual checks
  • Compliance dashboard with real-time control status showing every control as green, yellow, or red β€” so your leadership team has visibility into compliance posture at any time, not just during audit prep
  • Automated evidence collection and storage that captures timestamped evidence from every monitoring check and organizes it by SOC 2 control, eliminating the end-of-year evidence scramble
  • Alert management with escalation workflows that route control failures to the right owner with context, track remediation SLAs, and escalate unresolved issues before they become audit findings
  • Audit-ready reporting at any time β€” generate a complete evidence package for your auditor on demand, covering any date range within your observation period

Get started with GRCTrail β†’

#soc-2 #continuous-monitoring #compliance #saas #security #automation