Senior Observability Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Senior Observability Specialist** is a senior individual contributor responsible for designing, implementing, and continuously improving the organization’s observability capabilities across cloud infrastructure and production applications. This role ensures that engineering, SRE, and operations teams can reliably detect, understand, and resolve issues using high-quality telemetry (metrics, logs, traces, profiling, and synthetics) aligned to user experience and business outcomes.
Senior Cloud Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Senior Cloud Specialist** is a senior individual contributor responsible for designing, implementing, securing, and operating cloud infrastructure capabilities that enable product engineering teams to deliver reliable services at scale. This role combines deep cloud platform expertise with operational excellence, ensuring cloud environments are resilient, compliant, cost-effective, and automation-first.
Senior Cloud Migration Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Senior Cloud Migration Specialist** plans and executes the end-to-end migration of applications, data, and infrastructure from on‑premises or legacy hosting environments to public cloud and cloud-native platforms. This role combines hands-on engineering with migration strategy, risk management, and stakeholder leadership to deliver reliable cutovers while improving security posture, scalability, and cost efficiency.
Observability Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Observability Specialist** designs, implements, and continuously improves the telemetry, monitoring, alerting, and incident insight capabilities that enable engineering and operations teams to run reliable, performant, and cost-effective services. This role turns raw signals (metrics, logs, traces, events, synthetics, user experience signals) into **actionable operational intelligence**—reducing downtime, accelerating diagnosis, and improving customer experience.
Lead Observability Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Lead Observability Specialist is a senior individual-contributor (IC) and technical leader within Cloud & Infrastructure responsible for designing, operating, and continuously improving the organization’s observability capabilities—metrics, logs, traces, events, and user-experience signals—to ensure services are reliable, performant, and cost-effective. This role establishes standards and patterns for instrumentation, alerting, dashboards, and SLOs/SLIs, and partners with engineering and operations teams to reduce incident impact and accelerate detection and recovery.
Lead Cloud Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Lead Cloud Specialist** is a senior individual contributor (IC) who designs, implements, and continuously improves the organization’s cloud infrastructure and platform capabilities to ensure secure, reliable, cost-effective, and scalable delivery of software services. This role combines deep technical expertise across cloud services with practical operational leadership—setting standards, guiding delivery teams, and owning critical cloud outcomes without necessarily being a people manager.
Lead Cloud Migration Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Lead Cloud Migration Specialist** is a senior individual contributor who plans and drives complex application, data, and infrastructure migrations from on-premises or hosted environments into public cloud and hybrid cloud platforms. The role combines deep technical migration expertise with program-level orchestration—ensuring migrations are secure, reliable, cost-aware, and aligned to platform standards and business outcomes.
Cloud Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Cloud Specialist is a hands-on infrastructure specialist responsible for building, operating, and continuously improving cloud environments that host enterprise applications and services. The role ensures cloud platforms are secure, reliable, cost-effective, and aligned to engineering and business needs through strong operational discipline, automation, and stakeholder partnership.
Cloud Migration Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Cloud Migration Specialist** plans and executes the technical and operational work required to move applications, data, and infrastructure from on‑premises or legacy hosting into a public cloud, private cloud, or hybrid environment. The role focuses on **migration delivery excellence**—reducing risk, maintaining service continuity, and achieving target-state performance, security, and cost objectives.
Associate Observability Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Associate Observability Specialist helps ensure production systems are measurable, diagnosable, and reliable by supporting the implementation and day-to-day operations of logging, metrics, tracing, alerting, and dashboards across cloud and infrastructure platforms. This role exists to reduce time-to-detect and time-to-resolve incidents, improve service reliability, and enable engineering teams to make evidence-based decisions using high-quality telemetry. The business value is improved uptime, lower incident cost, faster troubleshooting, and more predictable customer experience through consistent observability practices.
Associate Cloud Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Associate Cloud Specialist** is an early-career, hands-on cloud operations and enablement role responsible for supporting the reliability, security, and cost-effective operation of cloud environments (IaaS/PaaS) under the guidance of senior cloud engineers or a cloud platform team. The role focuses on executing well-defined operational tasks—provisioning and managing cloud resources, responding to alerts and incidents, maintaining infrastructure-as-code (IaC) changes, and keeping documentation and runbooks accurate—while building foundational cloud engineering capability.
Associate Cloud Migration Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Associate Cloud Migration Specialist** supports the planning, execution, and stabilization of application and infrastructure migrations from on-premises or hosted environments to public cloud platforms (most commonly AWS and/or Azure) under the guidance of senior migration and cloud platform leaders. The role focuses on repeatable migration activities—discovery support, dependency capture, environment provisioning tasks, data transfer coordination, testing support, cutover checklists, and post-migration verification—while ensuring adherence to security, reliability, and change management controls.
DevOps and SRE Transformation Leader: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The DevOps and SRE Transformation Leader is accountable for designing and driving an enterprise-wide transformation in how software is delivered and operated—moving teams toward modern DevOps, Site Reliability Engineering (SRE), and platform engineering practices. The role establishes reliability standards (SLOs/SLIs), accelerates delivery through automation and paved roads, and institutionalizes operational excellence via incident management, observability, and continuous improvement.
Cloud and Infrastructure Leader: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Cloud and Infrastructure Leader** is accountable for the strategy, reliability, security, scalability, and cost-efficiency of the company’s cloud platforms and underlying infrastructure services. This role leads the teams and operating model that deliver core platform capabilities—compute, networking, storage, Kubernetes/container platforms, CI/CD enablement, observability, identity, and foundational security controls—so product and engineering teams can ship software quickly and safely.
Systems Reliability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
A **Systems Reliability Engineer (SRE)** designs, builds, and operates the reliability mechanisms that keep cloud platforms, infrastructure services, and production systems stable, performant, and recoverable. The role blends software engineering, systems engineering, and operations to reduce toil, prevent incidents, and shorten recovery time when failures occur.
Storage Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Storage Engineer designs, implements, and operates enterprise storage capabilities that reliably serve application, platform, and data workloads across on-premises and cloud environments. This role exists to ensure storage services meet performance, availability, scalability, security, and cost objectives—while enabling engineering teams to ship products without storage becoming a constraint or risk. The Storage Engineer creates business value by reducing downtime and incident impact, improving data protection and recovery posture, standardizing storage services, and optimizing spend through right-sizing, tiering, and automation.
Staff Systems Reliability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Staff Systems Reliability Engineer (SRE) is a senior individual contributor in Cloud & Infrastructure responsible for ensuring that production systems are reliable, performant, secure, and cost-efficient at scale. This role blends deep systems engineering with operational excellence, using automation, observability, and engineering best practices to reduce toil and improve service resilience.
Staff Storage Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Staff Storage Engineer** is a senior individual contributor responsible for designing, evolving, and operating enterprise-grade storage platforms that reliably serve production workloads across cloud, on-prem, and hybrid environments. This role ensures storage services meet performance, availability, data protection, security, and cost objectives, while enabling engineering teams to ship products faster with predictable, self-service infrastructure.
Staff SRE Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Staff SRE Engineer** is a senior individual contributor responsible for improving the reliability, scalability, performance, and operational maturity of production systems through a combination of software engineering, systems engineering, and operational leadership. This role focuses on building resilient platforms, establishing reliability standards (SLIs/SLOs/error budgets), and enabling product engineering teams to ship changes safely and repeatedly.
Staff Site Reliability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
A **Staff Site Reliability Engineer (SRE)** is a senior individual contributor responsible for ensuring that critical cloud and infrastructure-backed services are **reliable, scalable, secure, and cost-effective**. The role blends software engineering with systems engineering to reduce operational risk, improve service health, and enable product teams to deliver changes safely at high velocity.
Staff Reliability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Staff Reliability Engineer is a senior individual contributor in the Cloud & Infrastructure organization responsible for ensuring that critical production systems are reliable, scalable, performant, and cost-effective. This role blends deep systems engineering with operational excellence, leading reliability strategy across multiple services or platforms while enabling product engineering teams to ship safely at high velocity.
Staff Production Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Staff Production Engineer** is a senior individual contributor in the **Cloud & Infrastructure** organization responsible for ensuring that production systems are **reliable, scalable, secure, cost-efficient, and operable** under real-world conditions. This role combines deep systems engineering with operational excellence, focusing on reducing operational risk and toil while improving service health, incident response maturity, and deployment safety.
Staff Observability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
A Staff Observability Engineer is a senior individual contributor in Cloud & Infrastructure responsible for designing, evolving, and operating the organization’s observability capabilities—metrics, logs, traces, profiling, alerting, and service-level measurement—so engineering teams can build and run reliable systems. The role focuses on platform-level enablement (tooling, standards, automation, and best practices) rather than owning a single service, while still participating deeply in incident response and reliability improvements for critical systems.
Staff Network Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Staff Network Engineer is a senior individual contributor responsible for designing, building, and operating resilient network connectivity across cloud and hybrid environments while improving reliability, security, and delivery velocity through automation and standardization. This role exists to ensure the company’s products, internal platforms, and engineering teams have dependable, performant, and secure network foundations that scale with growth and change.
Staff Network Automation Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Staff Network Automation Engineer is a senior individual contributor in the Cloud & Infrastructure organization responsible for designing, building, and scaling automation systems that make network provisioning, configuration, validation, and operations reliable, fast, and repeatable. The role blends deep networking fundamentals with software engineering practices (version control, CI/CD, testing, observability) to deliver “network as code” capabilities across data center, cloud, and edge environments.
Staff Monitoring Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
A **Staff Monitoring Engineer** is a senior individual contributor in Cloud & Infrastructure who designs, standardizes, and continuously improves the company’s monitoring and observability capabilities across infrastructure and applications. The role exists to ensure the organization can detect issues early, diagnose them quickly, and prevent recurrence—at scale and with predictable operational quality.
Staff Linux Systems Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Staff Linux Systems Engineer** is a senior individual contributor (IC) responsible for the reliability, security, and performance of Linux-based compute platforms that underpin production services, internal developer platforms, and core business systems. This role designs and evolves standards, automation, and operating practices for fleets of Linux hosts across on-prem, cloud, and hybrid environments, with a strong focus on resilience, observability, and operational excellence.
Staff Kubernetes Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The Staff Kubernetes Engineer is a senior individual contributor responsible for designing, evolving, and operating Kubernetes-based platforms that enable engineering teams to deliver software safely, reliably, and efficiently at scale. This role blends deep Kubernetes expertise with platform engineering practices, cloud infrastructure design, and strong operational leadership in incident response, resilience, and continuous improvement.
Staff Infrastructure Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
A Staff Infrastructure Engineer is a senior individual contributor (IC) responsible for designing, building, and operating the foundational cloud and on-prem infrastructure that enables software teams to deliver reliable, secure, and scalable products. The role combines deep technical expertise with cross-team technical leadership, focusing on platform reliability, operational excellence, and long-term infrastructure strategy.
Staff DevOps Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path
The **Staff DevOps Engineer** is a senior individual contributor in the **Cloud & Infrastructure** department responsible for designing, scaling, and governing the reliability, security, and operability of cloud platforms and delivery pipelines that power software delivery. This role focuses on **platform enablement**—building standardized, self-service infrastructure and CI/CD capabilities that allow product engineering teams to ship safely and quickly.
