Jobiglo

No results.

Site Reliability Engineering Manager

Lucid Motors Middle East · Riyad

New
Senior 🇬🇧 English
Kubernetes Prometheus Grafana Oracle Cloud Infrastructure AWS Terraform ArgoCD Jenkins Maven Docker GitLab Helm Monitoring Observability

Job description

About the role

Lucid Motors' Cloud team is looking for a Senior Site Reliability Engineering (SRE) Manager to own the reliability, scalability, and operational excellence of its cloud infrastructure and production services. The role blends hands‑on technical leadership with people management, ensuring high availability while building and mentoring a high‑performing SRE team.

Key responsibilities

  • Own availability, performance and reliability of cloud services deployed in KSA.
  • Define and track SRE best practices, including SLIs, SLOs, SLAs and error budgets.
  • Lead architecture and governance of disaster‑resilient systems and validate DR strategies.
  • Drive capacity planning, auto‑scaling and performance tuning on Kubernetes platforms.
  • Manage monitoring, observability and alerting with Prometheus, Grafana and logging tools.
  • Lead incident response, impact assessment and root‑cause analysis for complex production issues.
  • Mentor and grow a team of SRE engineers, handling hiring, onboarding and on‑call rotations.
  • Oversee production environments on Oracle Cloud Infrastructure (OCI) and AWS.
  • Govern Infrastructure‑as‑Code using Terraform and configuration‑management tools.
  • Define CI/CD strategy and implement pipelines with ArgoCD, Jenkins, Maven, Docker and GitLab.
  • Ensure secure, reliable deployment of microservices and data pipelines on Kubernetes using Helm.

Required profile

  • Proven experience leading SRE or reliability teams in a cloud‑native environment.
  • Strong people‑management skills with a track record of coaching and performance feedback.
  • Deep understanding of reliability engineering concepts such as SLIs/SLOs and error budgets.
  • Experience designing highly available, disaster‑resilient architectures.
  • Ability to drive incident response and post‑mortem processes.

Required skills

  • Kubernetes
  • Prometheus
  • Grafana
  • Oracle Cloud Infrastructure (OCI)
  • AWS
  • Terraform
  • ArgoCD
  • Jenkins
  • Maven
  • Docker
  • GitLab
  • Helm
  • CI/CD pipeline design
  • Monitoring and observability

Questions fréquentes

Le salaire n'est pas communiqué publiquement par le recruteur. Vous pouvez postuler et négocier directement avec Lucid Motors Middle East.
Cliquez sur "Postuler maintenant" en haut de la page. Vous pouvez importer votre CV en 1 clic — Jobiglo extrait automatiquement vos informations et postule pour vous.

Why are you reporting this job?

Thank you for your report. We will review this job.

Apply in 30 seconds

Enter your email to apply. An account will be created automatically.

By continuing, you accept our terms of use.

Already have an account? Login

Published 1 day ago

Expires 1 month from now

8 views · 0 applications

Boost your chances

Upload your CV — we will match you with relevant openings.

Analyzing your CV...

Lucid Motors Middle East

Riyad