Staff Production Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Staff Production Engineer** is a senior individual contributor in the **Cloud & Infrastructure** organization responsible for ensuring that production systems are **reliable, scalable, secure, cost-efficient, and operable** under real-world conditions. This role combines deep systems engineering with operational excellence, focusing on reducing operational risk and toil while improving service health, incident response maturity, and deployment safety.
Staff Observability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
A Staff Observability Engineer is a senior individual contributor in Cloud & Infrastructure responsible for designing, evolving, and operating the organization’s observability capabilities—metrics, logs, traces, profiling, alerting, and service-level measurement—so engineering teams can build and run reliable systems. The role focuses on platform-level enablement (tooling, standards, automation, and best practices) rather than owning a single service, while still participating deeply in incident response and reliability improvements for critical systems.
Staff Network Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Staff Network Engineer is a senior individual contributor responsible for designing, building, and operating resilient network connectivity across cloud and hybrid environments while improving reliability, security, and delivery velocity through automation and standardization. This role exists to ensure the company’s products, internal platforms, and engineering teams have dependable, performant, and secure network foundations that scale with growth and change.
Staff Network Automation Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Staff Network Automation Engineer is a senior individual contributor in the Cloud & Infrastructure organization responsible for designing, building, and scaling automation systems that make network provisioning, configuration, validation, and operations reliable, fast, and repeatable. The role blends deep networking fundamentals with software engineering practices (version control, CI/CD, testing, observability) to deliver “network as code” capabilities across data center, cloud, and edge environments.
Staff Monitoring Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
A **Staff Monitoring Engineer** is a senior individual contributor in Cloud & Infrastructure who designs, standardizes, and continuously improves the company’s monitoring and observability capabilities across infrastructure and applications. The role exists to ensure the organization can detect issues early, diagnose them quickly, and prevent recurrence—at scale and with predictable operational quality.
Staff Linux Systems Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Staff Linux Systems Engineer** is a senior individual contributor (IC) responsible for the reliability, security, and performance of Linux-based compute platforms that underpin production services, internal developer platforms, and core business systems. This role designs and evolves standards, automation, and operating practices for fleets of Linux hosts across on-prem, cloud, and hybrid environments, with a strong focus on resilience, observability, and operational excellence.
Staff Kubernetes Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Staff Kubernetes Engineer is a senior individual contributor responsible for designing, evolving, and operating Kubernetes-based platforms that enable engineering teams to deliver software safely, reliably, and efficiently at scale. This role blends deep Kubernetes expertise with platform engineering practices, cloud infrastructure design, and strong operational leadership in incident response, resilience, and continuous improvement.
Staff Infrastructure Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
A Staff Infrastructure Engineer is a senior individual contributor (IC) responsible for designing, building, and operating the foundational cloud and on-prem infrastructure that enables software teams to deliver reliable, secure, and scalable products. The role combines deep technical expertise with cross-team technical leadership, focusing on platform reliability, operational excellence, and long-term infrastructure strategy.
Staff DevOps Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Staff DevOps Engineer** is a senior individual contributor in the **Cloud & Infrastructure** department responsible for designing, scaling, and governing the reliability, security, and operability of cloud platforms and delivery pipelines that power software delivery. This role focuses on **platform enablement**—building standardized, self-service infrastructure and CI/CD capabilities that allow product engineering teams to ship safely and quickly.
Staff Cloud Native Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
A **Staff Cloud Native Engineer** is a senior individual contributor (IC) who designs, builds, and continuously improves the cloud-native foundations that enable engineering teams to ship reliable software quickly and safely. This role is accountable for the technical direction and hands-on delivery of platform capabilities such as Kubernetes orchestration, infrastructure-as-code, CI/CD enablement, service-to-service networking, observability, and reliability practices.
Staff Cloud Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Staff Cloud Engineer** is a senior individual contributor in the **Cloud & Infrastructure** department responsible for designing, building, and evolving the company’s cloud platform capabilities so product engineering teams can deliver secure, reliable, and cost-effective services at scale. The role exists to translate business and engineering goals (speed, availability, compliance, cost) into **repeatable cloud patterns, automation, and platform guardrails** that reduce operational toil and risk.
SRE Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **SRE Engineer** (Site Reliability Engineering Engineer) is a hands-on reliability practitioner responsible for keeping production systems **available, performant, scalable, and cost-effective** while enabling frequent, safe software delivery. This role applies software engineering approaches to operational problems—using automation, observability, and reliability design patterns to reduce incidents and accelerate recovery when they occur.
Site Reliability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
A Site Reliability Engineer (SRE) ensures that customer-facing and internal services remain reliable, performant, secure, and cost-effective at scale by applying software engineering to operations. This role exists to reduce operational risk, improve service availability, and create leverage through automation, observability, and disciplined incident/problem management.
Senior Systems Reliability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Senior Systems Reliability Engineer** is a senior individual contributor in the **Cloud & Infrastructure** organization responsible for ensuring that production systems are **reliable, resilient, observable, performant, and cost-effective** at scale. This role blends deep systems engineering with SRE practice: defining service reliability targets (SLOs), strengthening operational readiness, driving automation, and leading complex incident response to protect customer experience and revenue.
Senior Storage Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Senior Storage Engineer designs, implements, and operates enterprise-grade storage and data protection platforms that underpin application availability, performance, and recoverability across on-premises and cloud environments. This role exists to ensure that data services (block, file, object, backup, and replication) are reliable, secure, cost-effective, and scalable—while meeting evolving product and engineering demands.
Senior SRE Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Senior SRE Engineer** is an experienced individual contributor responsible for designing, improving, and operating the reliability practices, platforms, and automation that keep customer-facing services available, performant, and cost-effective. This role blends software engineering with systems engineering, with a focus on **SLOs/SLIs, error budgets, observability, incident response, toil reduction, and resilient architecture** across cloud and infrastructure layers.
Senior Site Reliability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Senior Site Reliability Engineer (SRE)** ensures that customer-facing and internal cloud services are **reliable, performant, resilient, and cost-effective** at scale. This role applies software engineering principles to operations—designing reliability into systems through automation, observability, incident management rigor, and continuous improvement.
Senior Reliability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Senior Reliability Engineer** is a senior individual contributor in the **Cloud & Infrastructure** organization responsible for ensuring production services meet defined reliability, availability, performance, and recoverability targets. This role designs and operates reliability mechanisms (SLOs, error budgets, observability, automation, incident response, resilience engineering) to reduce customer-impacting outages and improve operational efficiency at scale.
Senior Production Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
A **Senior Production Engineer** is a senior individual contributor in the Cloud & Infrastructure organization responsible for ensuring that production systems are **reliable, scalable, secure, and cost-efficient** while enabling fast, safe delivery of software changes. The role blends software engineering, systems engineering, and operational excellence to reduce downtime, improve performance, and increase developer velocity through automation and well-defined production practices.
Senior Observability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
A **Senior Observability Engineer** designs, builds, and operates the monitoring, logging, tracing, and alerting capabilities that enable engineering teams to **detect, diagnose, and resolve production issues quickly** while meeting reliability and performance objectives. The role sits at the intersection of platform engineering, SRE/operations, and software engineering, translating system behavior into actionable signals and standards that scale across teams and services.
Senior Network Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Senior Network Engineer designs, builds, and operates reliable, secure, and scalable network connectivity across cloud and on-prem environments to enable product delivery, internal engineering productivity, and enterprise-grade service reliability. This role balances deep hands-on engineering (routing/switching, WAN, firewalls, load balancing, DNS, connectivity) with operational excellence (monitoring, incident response, change management, capacity planning) and modern automation practices (Infrastructure as Code, configuration management, CI/CD integration).
Senior Network Automation Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Senior Network Automation Engineer** is a senior individual contributor in the **Cloud & Infrastructure** organization responsible for designing, building, and operating automation systems that provision, configure, validate, and continuously manage network infrastructure at scale. The role bridges traditional network engineering and modern software engineering practices (NetDevOps), enabling safe, repeatable, and observable network change through code, pipelines, and policy-driven controls.
Senior Monitoring Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Senior Monitoring Engineer designs, implements, and continuously improves the organization’s monitoring and observability capabilities across cloud infrastructure, platforms, and production services. This role ensures that engineering teams can detect incidents early, diagnose issues quickly, and measure reliability through actionable metrics, logs, traces, and service-level objectives (SLOs).
Senior Linux Systems Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Senior Linux Systems Engineer** is a senior individual contributor responsible for the reliability, security, performance, and lifecycle management of Linux-based compute platforms that power production services, internal engineering systems, and core infrastructure. This role designs and operates scalable Linux environments across on-premises and cloud, automates system configuration and fleet operations, and hardens platforms to meet uptime and security requirements.
Senior Kubernetes Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Senior Kubernetes Engineer designs, builds, secures, and operates Kubernetes platforms that reliably run production workloads at scale. This role exists to provide a standardized, automated, and supportable container orchestration foundation—so application teams can ship faster while meeting enterprise expectations for availability, security, cost, and compliance.
Senior Infrastructure Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Senior Infrastructure Engineer designs, builds, and operates reliable, secure, and scalable infrastructure platforms that enable product engineering teams to ship and run software with confidence. This role is accountable for improving availability, performance, and operational efficiency across cloud and/or hybrid environments, while reducing risk through automation, standardization, and strong operational controls.
Senior DevOps Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Senior DevOps Engineer** is a senior individual contributor in the **Cloud & Infrastructure** department responsible for building, operating, and continuously improving the platforms, automation, and operational practices that enable engineering teams to deliver software safely, quickly, and reliably. This role designs and runs cloud infrastructure, CI/CD systems, observability, and operational controls that reduce lead time and change risk while improving availability and performance.
Senior Cloud Native Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Senior Cloud Native Engineer** designs, builds, and operates cloud-native platforms and runtime capabilities that enable application teams to ship secure, scalable, reliable software with high delivery velocity. This role sits in the **Cloud & Infrastructure** department and focuses on modern infrastructure engineering: containers, Kubernetes, service networking, infrastructure-as-code, CI/CD enablement, observability, and reliability practices.
Senior Cloud Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Senior Cloud Engineer** designs, builds, and operates secure, reliable, and cost-efficient cloud infrastructure that enables product engineering teams to deliver software quickly and safely. This role is accountable for production-grade cloud foundations (networking, compute, identity, observability, automation) and for evolving them into scalable internal platforms and patterns.
Reliability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Reliability Engineer ensures that cloud-based services and the infrastructure they run on are available, performant, resilient, and recoverable under real-world conditions—including failures, traffic spikes, deployments, and dependency issues. This role blends software engineering, operational excellence, and systems thinking to reduce customer-impacting incidents, improve mean time to restore (MTTR), and raise the reliability baseline through automation and engineering standards.
