Reliability Operations, Reimagined

Monitor faster, resolve smarter, and show trust in real time.

Alertum combines uptime checks, incident response, escalation routing, and status communication in one visual command center so teams move from signal to action without context switching.

Latency p95 142ms
Error intake synced
Escalation sla 99.4%
Status page green

Live Reliability Pulse

Global Service Health

Uptime
99.97%
api-gateway
Recovered · 2m ago
checkout-flow
Investigating · 6m ago
worker-jobs
Stable · 9m ago
Monitoring types
5

Website, API, SSL, Ping, TCP

Incident severities
4

Critical, High, Medium, Low

Incident states
4

Open, Acknowledged, Resolved, Reopened

Analytics windows
5

1h, 24h, 7d, 30d, 365d

Native integrations
10

Chat, webhook, and API channels

Plans
4

Free, Solo, Team, Business

Platform Modules

Visual Coverage Across Reliability Work

Open module deep dive

Monitors

Track website availability, API health, SSL expiration, ICMP reachability, and TCP port availability.

  • Status code, response-time, and payload assertions
  • Flexible check intervals and failure/recovery thresholds
Faster outage detectionConsistent endpoint health baselines

Incidents

Detect, classify, and resolve incidents from monitoring failures, integrations, and journey errors.

  • Severity levels and assignment workflows
  • Automatic and manual grouping support
Lower alert noiseClear ownership during incidents

Heartbeats

Confirm cron jobs, workers, and scheduled tasks by expecting pings within interval + grace windows.

  • Missed heartbeat detection
  • Dedicated ping endpoint per heartbeat
Early job-failure visibilityReduced silent data pipeline failures

Synthetic Journeys

Run multi-step synthetic flows to validate end-to-end user journeys, not only single endpoints.

  • HTTP, assert, and wait steps
  • Frequency and max-duration controls
Business-flow verificationRoot-cause clues for user-impacting issues

Status Pages

Publish service health externally with dedicated status pages linked to selected monitors.

  • Public and private visibility modes
  • Monitor and incident communication surface
Higher customer trustLower inbound support load

Maintenance Windows

Schedule planned downtime, set clear windows, and communicate expected impact in advance.

  • Upcoming, active, and completed state handling
  • Duration-aware maintenance timeline
Predictable change windowsReduced false-positive incident noise

On-Call Scheduling

Manage rotation schedules and maintain 24/7 ownership for incident response.

  • Current and next on-call visibility
  • Weekly timeline and rotation forecasting
Always-defined respondersStronger escalation reliability

Escalation Policies

Define structured, multi-level routing with timed delays and channel-based notification steps.

  • Targets: users and on-call schedules
  • Channels include Email, Slack, PagerDuty, Webhook, SMS, and phone call
Less manual pagingConsistent incident response paths

Integrations + Error Intake

Connect Alertum to chat tools, webhooks, APIs, and ingest app errors directly as incidents.

  • Team API key based ingestion endpoint
  • Payload context: project, service, environment, group
Unified alert distributionFaster triage from app-level events
Incident Orchestration

Signal to Resolution Timeline

1
1. Detect

Monitors, heartbeats, journeys, and integrations create incident signals.

Owner: Monitoring systemOutput: New incident with source context
2
2. Qualify

Teams set severity, group related incidents, and assign ownership.

Owner: First responderOutput: Prioritized and scoped incident
3
3. Route

Escalation policy levels notify users, schedules, and integration channels.

Owner: Escalation engineOutput: Right people notified at the right time
4
4. Resolve

Responders coordinate actions, update status, and verify recovery.

Owner: On-call and service ownersOutput: Incident resolved with timeline history
5
5. Communicate

Status pages and maintenance context communicate customer-facing updates.

Owner: Operations and supportOutput: Transparent external communication
Read full operations model
Governance and Access

Security by Design

Roles
Granular Access
Identity
Password + 2FA
Coverage
24/7 Response
Role-based team access with Editor, Administrator, and Consultant roles
Account security settings with password management and optional two-factor authentication
Audit logs, incident exports, and reporting features based on plan entitlements
Private and white-label status page capabilities available by plan
Granular escalation target management (users and on-call schedules)
Error Intake Context
status, message, project, service, environment, group, severity, details.

Integration Catalog

Native Connectors

Start with your workspace
Slack
Slack
Chat
Incident updates in channels
Google Chat
Google Chat
Chat
Team space notifications
Telegram
Telegram
Chat
Mobile-first alert delivery
Discord
Discord
Chat
Operational notification rooms
Teams
Teams
Chat
Microsoft ecosystem alerting
Webhook
Webhook
Webhook
Custom event routing
Splunk
Splunk
Api
Event analytics and search
Pushbullet
Pushbullet
Chat
Device and channel pushes
Pushover
Pushover
Chat
Real-time app notifications
New Relic
New Relic
Api
Alert ingestion via webhook

Plan Structure

Scale by Team Maturity, Not Guesswork

Compare all plans
Free

Launch monitoring fundamentals

1 included

Best for: New projects and initial reliability setup

  • Core monitoring and incident workflows
  • Single-team baseline for early-stage operations
  • Upgrade path into deeper reliability tooling
Solo

Paid plan for individual operators

1 included, +EUR 6 per extra seat

Best for: Independent operators and small workloads

  • Expanded operational limits over Free
  • Designed for single-operator ownership
  • Additional team seats available with per-seat pricing
Team
Popular

Cross-team operational coordination

3 included, +EUR 7 per extra seat

Best for: Growing teams with production ownership

  • Shared incident workflows across teammates
  • Error Intake capability for direct event ingestion
  • Greater scale for monitors, integrations, and reporting
Business

Advanced reliability operations

5 included, +EUR 8 per extra seat

Best for: Critical services and larger operations teams

  • On-call scheduling and enterprise response coordination
  • Highest capacity for teams and advanced features
  • Built for mature, always-on service organizations

Plan Feature Matrix

Every row below is loaded from backend plan configuration and marked with tick/X by plan.

Loading feature matrix...
Detail Pages

Go Deeper Into Each Operating Layer

Final Layer

Operate Reliability with One Visual Command Center

Build one shared workflow for monitoring, incidents, escalations, and customer trust communication.