Job Description
Key Responsibilities
Platform Design & Architecture
Define and evolve the architecture of observability platform, integrating logs, metrics, traces, events, and alerts
Establish reference implementations and patterns for integrating observability into cloud-native and monolithic applications
Evaluate and integrate best-in-class tools for telemetry (e.g., Open Telemetry, Prometheus, New Relic, Grafana, Elastic, Splunk, etc.)
Governance & Standards
Define enterprise-wide observability standards and maturity models (instrumentation guidelines, SLOs/SLIs, retention policies)
Drive instrumentation consistency across services through libraries, SDKs, and developer onboarding assets
Embed observability standards into CI/CD pipelines, golden paths, and developer enablement frameworks
Platform Engineering & Operations
Build and maintain core observability infrastructure as internal platform services
Ensure observability platform is highly available, scalable, co...
Platform Design & Architecture
Define and evolve the architecture of observability platform, integrating logs, metrics, traces, events, and alerts
Establish reference implementations and patterns for integrating observability into cloud-native and monolithic applications
Evaluate and integrate best-in-class tools for telemetry (e.g., Open Telemetry, Prometheus, New Relic, Grafana, Elastic, Splunk, etc.)
Governance & Standards
Define enterprise-wide observability standards and maturity models (instrumentation guidelines, SLOs/SLIs, retention policies)
Drive instrumentation consistency across services through libraries, SDKs, and developer onboarding assets
Embed observability standards into CI/CD pipelines, golden paths, and developer enablement frameworks
Platform Engineering & Operations
Build and maintain core observability infrastructure as internal platform services
Ensure observability platform is highly available, scalable, co...