Site Reliability Engineering Foundation

Enquiry
Programme CodeD199A
Domain
Cloud Infrastructure
Software Engineering
Level
Foundation
Learning Partner(s)
NTUC LearningHub
Duration
2 Days
FormatE-learning
Rating
Job Roles
ICT&SS Professional

Overview

Learn the principles and practices essential for your organisation to scale critical services reliably and economically.

Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The key objectives are to create ultra-scalable and highly reliable distributed software systems.

Introducing a site-reliability dimension requires organisational re-alignment, a new focus on engineering and automation, as well as the adoption of a range of new working paradigms.

Key Takeaways

At the end of this programme, you will be able to:
  • Discover the history of SRE and its emergence at Google
  • Understand the inter-relationship of SRE with DevOps and other popular frameworks
  • Understand the underlying principles behind SRE
  • Understand Service Level Objectives (SLOs) and their user focus
  • Understand Service Level Indicators (SLI’s) and the modern monitoring landscape
  • Identify error budgets and the associated error budget policies
  • Understand toil and its effect on an organisation’s productivity
  • Identify some practical steps that can help to eliminate toil
  • Understand observability as something to indicate the health of a service
  • Understand SRE tools, automation techniques and the importance of security
  • Apply anti-fragility, the approach to failure and failure testing
  • Understand the organisational impact that SRE can bring to an organisation

Who Should Attend

Please refer to the job roles section.

Prerequisites

  • Prior knowledge of DevOps, which can be achieved by attending: IT14A05 - DevOps Foundation.
  • It is recommended that you have prior working experience or knowledge in IT software development or IT industry operations.

What To Bring

  • Hardware and Software
  • This programme will be conducted as a Virtual Live Class (VLC) via the Zoom platform. You must own a Zoom account and have a laptop or a desktop with “Zoom Client for Meetings” installed. This can be downloaded from https://zoom.us/download.
  • Please ensure that your computer or laptop meets the following requirements:
    • Operating system: Windows 10 or MacOS (64-bit or above)
    • Processor/CPU: 1.8 GHz, 2-core Intel Core i3 or higher
    • Minimum 20 GB hard disk space.
    • Minimum 8 GB RAM
    • Webcam (The camera must be turn on during the entire duration of the class)
    • Microphone
    • Internet connection: wired or wireless broadband
    • The latest version of Zoom software is to be installed on your computer or laptop before the class
  • Good to have a wired internet connection to provide you with a stable and reliable connection.
  • Recommended to have dual monitors to improve your training experience, enabling you to simultaneously participate in hands-on exercises and maintain engagement with your instructor.