Site Reliability Engineer (Remote)

Site Reliability Engineer (Remote)

joveo ai

April 26, 2026June 10, 2026LagosNigeria
Job Description
Job Overview:
We are hiring a Site Reliability Engineer to own the availability, performance, and scalability of Joveo's production systems. You will apply software engineering principles to infrastructure and operations - reducing toil, improving observability, and keeping our platform at the reliability levels our clients depend on.

Duties and Responsibilities:
Define and maintain SLOs, SLIs, and error budgets for critical services; Lead incident response, blameless postmortems, and reliability improvements; Build internal tooling and automation to reduce operational toil; Partner with engineering teams to bake reliability into system design; Implement and evolve observability stacks — metrics, logs, and traces; Manage on-call rotations and build scalable incident runbooks.

Required Qualifications:
Strong software engineering background with SRE or production ops experience; Proficiency in Python, Go, or similar for automation and tooling; Experience with observability platforms (Datadog, New Relic, Prometheus/Grafana); Deep understanding of distributed systems, failure modes, and reliability patterns; Experience with Kubernetes, container orchestration, and cloud-native infrastructure; Strong incident management skills and a calm, structured approach to outages.

Additional Notes:
Joveo is an equal opportunity employer. We are committed to building an inclusive workplace and welcome applications from all qualified individuals regardless of race, color, ethnicity, nationality, gender, gender identity or expression, sexual orientation, age, religion, disability, marital status, or any other characteristic protected by applicable law. All hiring decisions are made solely on the basis of qualifications, skills, and demonstrated ability.

Apply now
Similar Jobs