Slide 1
Most trusted JOB oriented professional program
DevOps Certified Professional (DCP)

Take your first step into the world of DevOps with this course, which will help you to learn about the methodologies and tools used to develop, deploy, and operate high-quality software.

Slide 2
DevOps to DevSecOps – Learn the evolution
DevSecOps Certified Professional (DSOCP)

Learn to automate security into a fast-paced DevOps environment using various open-source tools and scripts.

Slide 2
Get certified in the new tech skill to rule the industry
Site Reliability Engineering (SRE) Certified Professional

A method of measuring and achieving reliability through engineering and operations work – developed by Google to manage services.

Slide 2
Master the art of DevOps
Master in DevOps Engineering (MDE)

Get enrolled for the most advanced and only course in the WORLD which can make you an expert and proficient Architect in DevOps, DevSecOps and Site Reliability Engineering (SRE) principles together.

Slide 2
Gain expertise and certified yourself
Azure DevOps Solutions Expert

Learn about the DevOps services available on Azure and how you can use them to make your workflow more efficient.

Slide 3
Learn and get certified
AWS Certified DevOps Professional

Learn about the DevOps services offered by AWS and how you can use them to make your workflow more efficient.

previous arrow
next arrow

Interview Questions & Answers Sets on SRE

Spread the Knowledge

What are the differences between SRE and DevOps?

Google: “One could view DevOps as a generalization of several core SRE principles to a wider range of organizations, management structures, and personnel.”

What SRE team is responsible for?

Google: “the SRE team is responsible for availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of their services”

What is an error budget?

Atlassian: “An error budget is the maximum amount of time that a technical system can fail without contractual consequences.”

What do you think about the following statement: “100% is the only right availability target for a system”

Wrong. No system can guarantee 100% availability as no system is safe from experiencing zero downtime. Many systems and services will fall somewhere between 99% and 100% uptime (or at least this is how most systems and services should be).

What are MTTF (mean time to failure) and MTTR (mean time to repair)? What these metrics help us to evaluate?

  • MTTF (mean time to failure) other known as uptime, can be defined as how long the system runs before if fails.
  • MTTR (mean time to recover) on the other hand, is the amount of time it takes to repair a broken system.
  • MTBF (mean time between failures) is the amount of time between failures of the system.

What is the role of monitoring in SRE?

Google: “Monitoring is one of the primary means by which service owners keep track of a system’s health and availability”

Facebook Notice for EU! You need to login to view and post FB Comments!
Rajesh Kumar