Senior Site Reliability Engineer (SRE)
Jobgether · Eastern Region
Job description
About the role
We are seeking a Senior Site Reliability Engineer to join Wikimedia Enterprise’s global team. In this role you will design, operate and evolve highly‑available API and data infrastructure that powers the worldwide reuse of Wikimedia content.
Key responsibilities
- Define, track, and continuously improve SLOs, SLIs, and error budgets for critical services.
- Design and enhance observability systems including metrics, logging, and distributed tracing.
- Participate in incident response, on‑call rotations, and post‑incident reviews to drive continuous improvement.
- Build and maintain CI/CD and GitOps pipelines enabling secure, automated, and reliable deployments.
- Implement infrastructure‑as‑code and automation‑first practices to reduce operational toil.
- Design and operate scalable cloud infrastructure across production environments.
- Drive capacity planning, performance optimization, and resilience testing, including chaos engineering.
- Improve developer experience by enabling self‑service infrastructure and streamlined workflows.
- Collaborate with security, software, and release engineering teams to embed reliability and security best practices.
- Optimize infrastructure cost and efficiency using FinOps principles without compromising availability.
- Develop and maintain operational metrics such as MTTR, MTTD, and incident frequency.
- Contribute to platform engineering initiatives that standardize infrastructure across teams.
- Mentor peers and promote best practices in SRE, automation, and systems reliability.
Required profile
- 5+ years of experience in SRE, DevOps, or infrastructure engineering.
- Strong expertise in site reliability engineering, distributed systems, and cloud infrastructure.
- Proficiency with infrastructure‑as‑code tools such as Terraform and/or Ansible.
- Experience programming in at least one language (Python, Go, etc.).
- Proactive, collaborative mindset with a focus on automation and continuous improvement.
Required skills
- Terraform
- Ansible
- Python
- Go
- CI/CD pipelines
- GitOps
- Cloud infrastructure
- Observability (metrics, logging, tracing)
- Chaos engineering
- FinOps
Questions fréquentes
Why are you reporting this job?
Apply in 30 seconds
Enter your email to apply. An account will be created automatically.
By continuing, you accept our terms of use.
Already have an account? Login
Published 2 days ago
Expires 1 month from now
15 views · 0 interested
Boost your chances
Upload your CV — we will match you with relevant openings.
Analyzing your CV...
Jobgether
Eastern Region
Related job offers
-
Developer Community Manager
Jobgether Eastern Region -
Software Engineer: Frontend
Jobgether Eastern Region -
Senior Site Reliability Engineer
Jobgether Eastern Region -
Senior Specialist, Digital Transformation
Environment Fund | صندوق البيئة Riyad -
Low-Code / Full-Stack Automation Developer
Sohoby IT Solutions Arabie saoudite