Training Course: Advanced Observability and Site Reliability Engineering

Advanced Observability and Site Reliability Engineering is a comprehensive course designed to equip professionals with the knowledge and skills necessary to excel in modern IT environments characterized by microservices, cloud-native architectures, and distributed systems.

REF: IT3254337

DATES: 9 - 13 Jun 2025

CITY: London (UK)

FEE: 5200 £

All Dates & Locations

Introduction

Advanced Observability and Site Reliability Engineering is a comprehensive course designed to equip professionals with the knowledge and skills necessary to excel in modern IT environments characterized by microservices, cloud-native architectures, and distributed systems. This course merges the principles of Observability with the practices of Site Reliability Engineering (SRE), offering a holistic approach to building resilient, scalable, and secure systems. Participants will explore cutting-edge techniques, tools, and methodologies to enhance observability, streamline incident management, and foster a culture of reliability within their organizations.

 

Objectives

  • Develop a practical understanding of Observability and its significance in modern IT landscapes.
  • Explore the three pillars of Observability and their application in microservices-based containerized environments.
  • Implement open Telemetry standards for seamless distributed tracing and innovation.
  • Understand and apply the Observability Maturity Model for measuring practical observability.
  • Integrate full-stack Observability and distributed tracing into DevSecOps practices.
  • Utilize AI Ops to transition from reactive to proactive incident management.
  • Implement Network and Container-level Observability with a focus on security.
  • Learn about Time-based Topology and its role in Observability for distributed environments.
  • Address data issues and build a clean Observability pipeline using DataOps principles.
  • Incorporate DevSecOps wisdom into Observability practices.
  • Apply Observability practices for DevOps and SRE to enhance system reliability and performance.

 

Course Outlines 

Day 1

 Introduction to Advanced Observability and SRE

  • Introduction to Advanced Observability and SRE
  • Fundamentals of Observability

Day 2

 Open Source for Observability and Service Maps

  • Leveraging Open Source for Observability
  • Service Maps, Topology, and DataOps

Day 3

 AIOps, Security, and Networking

  • AIOps and Observability
  • Security and Networking in Observability

Day 4

 Incident Response, Chaos Engineering, and SRE Principles

  • Incident Response and Chaos Engineering
  • SRE Principles and Execution

Day 5

 Hands-on Exercises and Certification Preparation

  • Review of key concepts from previous modules
  • Hands-on exercises applying Observability and SRE principles
  • Certification exam preparation and practice

Training Course: Advanced Observability and Site Reliability Engineering

Advanced Observability and Site Reliability Engineering is a comprehensive course designed to equip professionals with the knowledge and skills necessary to excel in modern IT environments characterized by microservices, cloud-native architectures, and distributed systems.

REF: IT3254337

DATES: 9 - 13 Jun 2025

CITY: London (UK)

FEE: 5200 £

Request a Call?

*
*
*
*
*
BlackBird Training Center