Service Level Agreements
Overview
Service Level Agreements (SLAs) define CSP commitments for availability, performance, and security, forming a critical part of cloud contracts.
Learning Objectives
- Understand SLA components
- Define appropriate service levels
- Implement SLA monitoring
- Manage SLA violations
- Apply ISO 27017 SLA requirements
SLA Components
Core Elements
| Component | Description | Example |
|---|---|---|
| Availability | Uptime commitment | 99.9% monthly |
| Performance | Response time | API < 200ms |
| Support | Response time | Critical < 1 hour |
| Data Protection | Backup/recovery | RPO 1 hour, RTO 4 hours |
| Remedies | Compensation | Service credits |
Availability SLA
Uptime Calculations
SLA Level Downtime/Year Downtime/Month Downtime/Week
99% 3.65 days 7.2 hours 1.68 hours
99.5% 1.83 days 3.6 hours 50.4 minutes
99.9% 8.76 hours 43.2 minutes 10.1 minutes
99.95% 4.38 hours 21.6 minutes 5.04 minutes
99.99% 52.6 minutes 4.32 minutes 1.01 minutes
99.999% 5.26 minutes 25.9 seconds 6.05 seconds
ISO 27017 Control A.17.2.1
Availability of Information Processing Facilities
CSP Requirements:
- Define clear availability targets
- Implement redundancy
- Monitor availability continuously
- Report SLA compliance
- Provide compensation for violations
SLA Monitoring
Key Metrics
Performance Dashboard
┌─────────────────────────────────┐
│ Current Month SLA: 99.97% │
├─────────────────────────────────┤
│ Uptime: 99.97% ✓ │
│ API Latency: 145ms avg ✓ │
│ Error Rate: 0.01% ✓ │
│ Support Response: 45min avg ✓ │
└─────────────────────────────────┘
SLA Remedies
Service Credits Table
| Availability | Credit |
|---|---|
| < 99.95% | 10% |
| < 99.9% | 25% |
| < 99.0% | 50% |
| < 95.0% | 100% |
Key Takeaways
- SLAs define clear service expectations
- Availability targets must be realistic
- Monitoring validates SLA compliance
- Remedies compensate for violations
- Transparency builds trust
Self-Assessment
- What is the difference between 99.9% and 99.99% uptime?
- What should an SLA include?
- How are service credits typically calculated?
- What ISO 27017 control addresses availability?
- Why is SLA monitoring important?