Senior Site Reliability Engineer
Jobgether · Eastern Region
Job description
About the role
We are seeking a Senior Site Reliability Engineer to ensure the reliability, scalability and performance of a globally‑used knowledge platform. The role is fully remote and part of a distributed engineering team that values transparency, open‑source collaboration and continuous learning.
Key responsibilities
- Perform day‑to‑day operations and DevOps responsibilities across large‑scale public‑facing infrastructure, including deployment, configuration, maintenance, and troubleshooting.
- Manage and optimise configuration and deployment systems using tools such as Puppet and Kubernetes.
- Automate infrastructure provisioning, service deployment, and operational workflows to improve reliability and efficiency.
- Collaborate with product and engineering teams to design scalable architectures and ensure systems operate reliably under global traffic loads.
- Participate in a 24/7 on‑call rotation, handling incident response, system alerts, troubleshooting and post‑incident reviews.
- Conduct root‑cause analysis of production incidents and implement preventive measures to improve system stability.
- Contribute to system monitoring, observability and performance optimisation initiatives.
- Mentor engineers and share operational expertise within a distributed, cross‑functional team.
- Work asynchronously with global teams while ensuring clear and effective technical communication.
Required profile
- 6+ years of experience in Site Reliability Engineering, DevOps, or infrastructure operations within complex distributed systems.
- Strong proficiency in Linux systems administration, troubleshooting and performance tuning.
- Experience with scripting languages such as Python, Bash, Go or Ruby for automation and tooling.
- Hands‑on experience with configuration management tools such as Puppet or Ansible.
- Solid understanding of distributed systems, caching technologies and system optimisation techniques.
- Experience with Linux package management on Debian‑based systems.
- Proven track record of automating operational processes and identifying improvement opportunities.
- Experience participating in incident response, post‑mortems and reliability engineering practices.
Required skills
- Linux
- Python
- Bash
- Go
- Ruby
- Puppet
- Ansible
- Kubernetes
- Debian (package management)
Questions fréquentes
Why are you reporting this job?
Apply in 30 seconds
Enter your email to apply. An account will be created automatically.
By continuing, you accept our terms of use.
Already have an account? Login
Published 3 days ago
Expires 1 month from now
17 views · 0 interested
Boost your chances
Upload your CV — we will match you with relevant openings.
Analyzing your CV...
Jobgether
Eastern Region
Related job offers
-
Developer Community Manager
Jobgether Eastern Region -
Software Engineer: Frontend
Jobgether Eastern Region -
Senior Site Reliability Engineer (SRE)
Jobgether Eastern Region -
Senior Specialist, Digital Transformation
Environment Fund | صندوق البيئة Riyad -
Low-Code / Full-Stack Automation Developer
Sohoby IT Solutions Arabie saoudite