Senior Observability Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Senior Observability Specialist** is a senior individual contributor responsible for designing, implementing, and continuously improving the organization’s observability capabilities across cloud infrastructure and production applications. This role ensures that engineering, SRE, and operations teams can reliably detect, understand, and resolve issues using high-quality telemetry (metrics, logs, traces, profiling, and synthetics) aligned to user experience and business outcomes.

Read More

Senior Cloud Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Senior Cloud Specialist** is a senior individual contributor responsible for designing, implementing, securing, and operating cloud infrastructure capabilities that enable product engineering teams to deliver reliable services at scale. This role combines deep cloud platform expertise with operational excellence, ensuring cloud environments are resilient, compliant, cost-effective, and automation-first.

Read More

Senior Cloud Migration Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Senior Cloud Migration Specialist** plans and executes the end-to-end migration of applications, data, and infrastructure from on‑premises or legacy hosting environments to public cloud and cloud-native platforms. This role combines hands-on engineering with migration strategy, risk management, and stakeholder leadership to deliver reliable cutovers while improving security posture, scalability, and cost efficiency.

Read More

Observability Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Observability Specialist** designs, implements, and continuously improves the telemetry, monitoring, alerting, and incident insight capabilities that enable engineering and operations teams to run reliable, performant, and cost-effective services. This role turns raw signals (metrics, logs, traces, events, synthetics, user experience signals) into **actionable operational intelligence**—reducing downtime, accelerating diagnosis, and improving customer experience.

Read More

Lead Observability Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The Lead Observability Specialist is a senior individual-contributor (IC) and technical leader within Cloud & Infrastructure responsible for designing, operating, and continuously improving the organization’s observability capabilities—metrics, logs, traces, events, and user-experience signals—to ensure services are reliable, performant, and cost-effective. This role establishes standards and patterns for instrumentation, alerting, dashboards, and SLOs/SLIs, and partners with engineering and operations teams to reduce incident impact and accelerate detection and recovery.

Read More

Lead Cloud Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Lead Cloud Specialist** is a senior individual contributor (IC) who designs, implements, and continuously improves the organization’s cloud infrastructure and platform capabilities to ensure secure, reliable, cost-effective, and scalable delivery of software services. This role combines deep technical expertise across cloud services with practical operational leadership—setting standards, guiding delivery teams, and owning critical cloud outcomes without necessarily being a people manager.

Read More

Lead Cloud Migration Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Lead Cloud Migration Specialist** is a senior individual contributor who plans and drives complex application, data, and infrastructure migrations from on-premises or hosted environments into public cloud and hybrid cloud platforms. The role combines deep technical migration expertise with program-level orchestration—ensuring migrations are secure, reliable, cost-aware, and aligned to platform standards and business outcomes.

Read More

Cloud Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The Cloud Specialist is a hands-on infrastructure specialist responsible for building, operating, and continuously improving cloud environments that host enterprise applications and services. The role ensures cloud platforms are secure, reliable, cost-effective, and aligned to engineering and business needs through strong operational discipline, automation, and stakeholder partnership.

Read More

Cloud Migration Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Cloud Migration Specialist** plans and executes the technical and operational work required to move applications, data, and infrastructure from on‑premises or legacy hosting into a public cloud, private cloud, or hybrid environment. The role focuses on **migration delivery excellence**—reducing risk, maintaining service continuity, and achieving target-state performance, security, and cost objectives.

Read More

Associate Observability Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The Associate Observability Specialist helps ensure production systems are measurable, diagnosable, and reliable by supporting the implementation and day-to-day operations of logging, metrics, tracing, alerting, and dashboards across cloud and infrastructure platforms. This role exists to reduce time-to-detect and time-to-resolve incidents, improve service reliability, and enable engineering teams to make evidence-based decisions using high-quality telemetry. The business value is improved uptime, lower incident cost, faster troubleshooting, and more predictable customer experience through consistent observability practices.

Read More

Associate Cloud Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Associate Cloud Specialist** is an early-career, hands-on cloud operations and enablement role responsible for supporting the reliability, security, and cost-effective operation of cloud environments (IaaS/PaaS) under the guidance of senior cloud engineers or a cloud platform team. The role focuses on executing well-defined operational tasks—provisioning and managing cloud resources, responding to alerts and incidents, maintaining infrastructure-as-code (IaC) changes, and keeping documentation and runbooks accurate—while building foundational cloud engineering capability.

Read More

Associate Cloud Migration Specialist: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Associate Cloud Migration Specialist** supports the planning, execution, and stabilization of application and infrastructure migrations from on-premises or hosted environments to public cloud platforms (most commonly AWS and/or Azure) under the guidance of senior migration and cloud platform leaders. The role focuses on repeatable migration activities—discovery support, dependency capture, environment provisioning tasks, data transfer coordination, testing support, cutover checklists, and post-migration verification—while ensuring adherence to security, reliability, and change management controls.

Read More

DevOps and SRE Transformation Leader: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The DevOps and SRE Transformation Leader is accountable for designing and driving an enterprise-wide transformation in how software is delivered and operated—moving teams toward modern DevOps, Site Reliability Engineering (SRE), and platform engineering practices. The role establishes reliability standards (SLOs/SLIs), accelerates delivery through automation and paved roads, and institutionalizes operational excellence via incident management, observability, and continuous improvement.

Read More

Cloud and Infrastructure Leader: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Cloud and Infrastructure Leader** is accountable for the strategy, reliability, security, scalability, and cost-efficiency of the company’s cloud platforms and underlying infrastructure services. This role leads the teams and operating model that deliver core platform capabilities—compute, networking, storage, Kubernetes/container platforms, CI/CD enablement, observability, identity, and foundational security controls—so product and engineering teams can ship software quickly and safely.

Read More

Systems Reliability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

A **Systems Reliability Engineer (SRE)** designs, builds, and operates the reliability mechanisms that keep cloud platforms, infrastructure services, and production systems stable, performant, and recoverable. The role blends software engineering, systems engineering, and operations to reduce toil, prevent incidents, and shorten recovery time when failures occur.

Read More

Storage Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The Storage Engineer designs, implements, and operates enterprise storage capabilities that reliably serve application, platform, and data workloads across on-premises and cloud environments. This role exists to ensure storage services meet performance, availability, scalability, security, and cost objectives—while enabling engineering teams to ship products without storage becoming a constraint or risk. The Storage Engineer creates business value by reducing downtime and incident impact, improving data protection and recovery posture, standardizing storage services, and optimizing spend through right-sizing, tiering, and automation.

Read More

Staff Systems Reliability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The Staff Systems Reliability Engineer (SRE) is a senior individual contributor in Cloud & Infrastructure responsible for ensuring that production systems are reliable, performant, secure, and cost-efficient at scale. This role blends deep systems engineering with operational excellence, using automation, observability, and engineering best practices to reduce toil and improve service resilience.

Read More

Staff Storage Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Staff Storage Engineer** is a senior individual contributor responsible for designing, evolving, and operating enterprise-grade storage platforms that reliably serve production workloads across cloud, on-prem, and hybrid environments. This role ensures storage services meet performance, availability, data protection, security, and cost objectives, while enabling engineering teams to ship products faster with predictable, self-service infrastructure.

Read More

Staff SRE Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Staff SRE Engineer** is a senior individual contributor responsible for improving the reliability, scalability, performance, and operational maturity of production systems through a combination of software engineering, systems engineering, and operational leadership. This role focuses on building resilient platforms, establishing reliability standards (SLIs/SLOs/error budgets), and enabling product engineering teams to ship changes safely and repeatedly.

Read More

Staff Site Reliability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

A **Staff Site Reliability Engineer (SRE)** is a senior individual contributor responsible for ensuring that critical cloud and infrastructure-backed services are **reliable, scalable, secure, and cost-effective**. The role blends software engineering with systems engineering to reduce operational risk, improve service health, and enable product teams to deliver changes safely at high velocity.

Read More

Staff Reliability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The Staff Reliability Engineer is a senior individual contributor in the Cloud & Infrastructure organization responsible for ensuring that critical production systems are reliable, scalable, performant, and cost-effective. This role blends deep systems engineering with operational excellence, leading reliability strategy across multiple services or platforms while enabling product engineering teams to ship safely at high velocity.

Read More

Staff Production Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Staff Production Engineer** is a senior individual contributor in the **Cloud & Infrastructure** organization responsible for ensuring that production systems are **reliable, scalable, secure, cost-efficient, and operable** under real-world conditions. This role combines deep systems engineering with operational excellence, focusing on reducing operational risk and toil while improving service health, incident response maturity, and deployment safety.

Read More

Staff Observability Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

A Staff Observability Engineer is a senior individual contributor in Cloud & Infrastructure responsible for designing, evolving, and operating the organization’s observability capabilities—metrics, logs, traces, profiling, alerting, and service-level measurement—so engineering teams can build and run reliable systems. The role focuses on platform-level enablement (tooling, standards, automation, and best practices) rather than owning a single service, while still participating deeply in incident response and reliability improvements for critical systems.

Read More

Staff Network Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The Staff Network Engineer is a senior individual contributor responsible for designing, building, and operating resilient network connectivity across cloud and hybrid environments while improving reliability, security, and delivery velocity through automation and standardization. This role exists to ensure the company’s products, internal platforms, and engineering teams have dependable, performant, and secure network foundations that scale with growth and change.

Read More

Staff Network Automation Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The Staff Network Automation Engineer is a senior individual contributor in the Cloud & Infrastructure organization responsible for designing, building, and scaling automation systems that make network provisioning, configuration, validation, and operations reliable, fast, and repeatable. The role blends deep networking fundamentals with software engineering practices (version control, CI/CD, testing, observability) to deliver “network as code” capabilities across data center, cloud, and edge environments.

Read More

Staff Monitoring Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

A **Staff Monitoring Engineer** is a senior individual contributor in Cloud & Infrastructure who designs, standardizes, and continuously improves the company’s monitoring and observability capabilities across infrastructure and applications. The role exists to ensure the organization can detect issues early, diagnose them quickly, and prevent recurrence—at scale and with predictable operational quality.

Read More

Staff Linux Systems Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Staff Linux Systems Engineer** is a senior individual contributor (IC) responsible for the reliability, security, and performance of Linux-based compute platforms that underpin production services, internal developer platforms, and core business systems. This role designs and evolves standards, automation, and operating practices for fleets of Linux hosts across on-prem, cloud, and hybrid environments, with a strong focus on resilience, observability, and operational excellence.

Read More

Staff Kubernetes Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The Staff Kubernetes Engineer is a senior individual contributor responsible for designing, evolving, and operating Kubernetes-based platforms that enable engineering teams to deliver software safely, reliably, and efficiently at scale. This role blends deep Kubernetes expertise with platform engineering practices, cloud infrastructure design, and strong operational leadership in incident response, resilience, and continuous improvement.

Read More

Staff Infrastructure Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

A Staff Infrastructure Engineer is a senior individual contributor (IC) responsible for designing, building, and operating the foundational cloud and on-prem infrastructure that enables software teams to deliver reliable, secure, and scalable products. The role combines deep technical expertise with cross-team technical leadership, focusing on platform reliability, operational excellence, and long-term infrastructure strategy.

Read More

Staff DevOps Engineer: Role Blueprint, Responsibilities, Skills, KPIs, and Career Path

The **Staff DevOps Engineer** is a senior individual contributor in the **Cloud & Infrastructure** department responsible for designing, scaling, and governing the reliability, security, and operability of cloud platforms and delivery pipelines that power software delivery. This role focuses on **platform enablement**—building standardized, self-service infrastructure and CI/CD capabilities that allow product engineering teams to ship safely and quickly.

Read More