Operational Service Objectives

Service Level Objectives (SLOs)

RuntimeHQ is designed to provide dependable runtime-state delivery for customer-facing production applications during outages, degraded conditions, and planned maintenance events.

✓ 99.9% Monthly Uptime✓ Factual Communication✓ 4-Hour Critical Support✓ SRE Operational Integrity

Runtime-State API Availability

Availability Target

• Runtime-state API target availability: 99.9% monthly uptime

Scope

Availability targets apply strictly to edge resolved outputs:

  • Runtime-state API endpoints
  • Runtime-state fetch operations
  • SDK runtime-state resolution requests

Exclusions

Uptime calculations exclude:

  • Customer application-level outages
  • Third-party cloud infrastructure outages outside RuntimeHQ control
  • Scheduled, communicated maintenance windows
  • Global internet or client DNS connectivity failures

Incident Response Objectives

Incident Acknowledgement

  • Critical operational incidents affecting edge delivery are acknowledged internally as quickly as possible after automated detection logs trigger.

Customer Communications

  • Customer-impacting RuntimeHQ incidents are communicated through verified official status channels.
  • Major operational disruptions are updated periodically until final mitigation and resolution reports are compiled.

Runtime-State Integrity Priority

During active internal platform incidents, we prioritize SRE mitigations in this sequence:

1. Edge State Delivery
2. Operational Consistency
3. API Uptime
4. Audit History Preservation

Planned Maintenance Policy

Maintenance Scheduling

  • Planned maintenance is scheduled during low-traffic operational windows to minimize potential integration impacts.

Notice Target

  • Advance notice is provided for planned maintenance expected to impact organizational dashboard controls.

Notifications Content

Maintenance alerts outline:

• Expected duration• Target systems impacted• Admin console impacts• SRE mitigation guidance

Emergency Maintenance

  • Performed immediately without notice when critical operational stability, system availability, or network security requires urgent action.

Support Response Targets

Operational support requests are actively monitored during standard business hours. Initial response targets are managed dynamically by priority:

Critical Operational IssuesWithin 4 Hours

Core edge state delivery degradations, API outages, or administrative lockout events.

High Priority IssuesWithin 1 Day

Scheduled maintenance scheduling problems, targeting issues, or sandbox credential bugs.

General QuestionsWithin 2 Days

Integration SDK questions, schema designs, account administration, or billing inquiries.

Incident Communication

Communication Principles

We provide calm, factual status metrics without marketing adjectives, ensuring clear SRE impact visibility.

Incident Update Content

Updates detail affected applications, operational impacts, active mitigations, and expected timing for the next status sweep.

Post-Incident Review

Major platform incidents undergo a detailed post-incident review to improve recovery procedures and edge reliability.

Operational Retention

Audit History

Operational logs are preserved to support incident visibility and SRE timeline reconstructions.

Archived Records

Archived application configs and active keys are preserved internally to safeguard audit trace integrity.

Data Preservation Standards

Audit history databases are append-only to prevent tampering or accidental history pruning during active reviews.

Service Scope Limits

To maintain absolute focus on edge reliability, RuntimeHQ strictly does not provide general operations tooling:

✖ No Incident Paging or alerts
✖ No Developer Responder coordination
✖ No Infrastructure observability monitoring
✖ No Alert management systems
✖ No Customer IT support ticketing

RuntimeHQ strictly focuses on propagating runtime state variables securely at the Edge.

SLA Evolution

Service Commitments Evolve

Operational objectives and support commitments are expected to evolve alongside infrastructure maturity, enterprise adoption scales, and customer operational requirements. Updated commitments will be communicated through official RuntimeHQ channels.

FAQ

Service Commitments FAQ

Review key answers regarding uptime targets, maintenance notification windows, and API availability scopes.

What uptime target does RuntimeHQ provide?
RuntimeHQ targets 99.9% monthly API availability for production runtime-state APIs.
How quickly does RuntimeHQ communicate operational incidents?
RuntimeHQ aims to acknowledge critical production-impacting incidents within defined operational response windows.
Does RuntimeHQ provide planned maintenance notifications?
Yes. RuntimeHQ aims to provide advance notice for planned maintenance activities whenever operationally feasible.
Does RuntimeHQ guarantee uninterrupted service?
No infrastructure platform can guarantee uninterrupted operation. RuntimeHQ focuses on dependable operational runtime-state delivery and operational transparency.
How are RuntimeHQ outages communicated?
RuntimeHQ communicates operational incidents through official customer communication channels and operational support workflows.
Does RuntimeHQ provide enterprise SLAs?
Enterprise SLA arrangements may be discussed for larger operational environments.
Are API rate limits part of the SLA?
RuntimeHQ may apply rate limiting and operational protections to preserve infrastructure stability.
Does RuntimeHQ provide disaster recovery protections?
RuntimeHQ maintains operational backup and infrastructure recovery practices designed to support service continuity.
Enterprise SLOs

Evaluate high-availability operational runtime states

Connect your platforms with reliable edge state controls. Set up a dedicated technical discussion to discuss regional architectures, replication metrics, and SLA coverage.

Book technical discussion

No sales pitch. Connect directly with our platform SREs to evaluate integration feasibility.

runtimehq-sre-shell
$ echo $SRE_EMAIL
Recommended Context

To help us prepare for our discussion, please share some context on your current outage banner workflow, such as:

  • Hardcoded banner deployments
  • Toggling CMS content manually
  • Editing feature flags during active incidents