Databricks Troubleshooting
Structured path from core incident triage to SQL, Delta Lake, security, and cost controls.
0% completed
0 / 12 lessons complete
Module 1: Foundations and Core Failures
Module 2: Performance and Data Reliability
Module 3: SQL, Security, and Operations
Self-Check
Which page is your first stop when a Databricks job is slow?
Spark UI gives stage-level and task-level clues for skew, shuffle pressure, and executor bottlenecks.
Try It Snippet
-- Find the most expensive jobs by duration
SELECT job_id, run_id, state, duration_ms
FROM monitoring.job_runs
ORDER BY duration_ms DESC
LIMIT 20;