What is Observability?
Observability is the ability to understand what's happening inside a system by looking at what it produces โ metrics, logs, traces, and events. It's the difference between knowing that something is broken and knowing why.
The Four Pillars
Monitoring vs Observability
Monitoring Observability
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
"Is the server up?" "Why is the checkout page slow?"
Predefined checks Explore any question
Dashboards you built in advance Query any data on the fly
Alerts on known thresholds AI detects unknown anomalies
Separate tools for metrics/logs/traces One platform, one query language
Why Dynatrace?
Most observability tools require you to instrument code, build dashboards, and manually correlate data. Dynatrace is different:
- Automatic discovery โ OneAgent finds every host, process, service, and dependency without configuration
- Smartscape โ a real-time topology map showing how everything connects
- Davis AI โ automatically detects anomalies and finds root causes across the full stack
- Grail โ one data lakehouse for metrics, logs, traces, events, and entities
- DQL โ one query language for everything (no more switching between 9 tools)
What You'll Learn in This Course
Track What You'll Master Modules
โโโโโโโโโโ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โโโโโโโโ
Foundation Platform architecture, navigation, first data 00-03
Query DQL for metrics, logs, entities, notebooks 04-08
Monitor Dashboards, alerting, SLOs, synthetic 09-12
Automate Workflows, Terraform, IAM, extensions 13-16
Master Security, business analytics, apps, maturity 17-20
๐ก No prior Dynatrace knowledge required. By the end, you'll be able to monitor any application, write DQL queries, build dashboards, set up alerts, automate workflows, and assess platform maturity โ skills that align with the Dynatrace Associate Certification.
๐ Try it: Open your Dynatrace environment โ Ctrl+K โ type "Services" โ explore the auto-detected services. Click any service to see its response time, failure rate, and throughput โ that's observability in action.