Site Reliability Engineer

Welcome to the Jungle

We tried out the four-day working week in 2019 for five months, between June and October. After the trial ended, the company decided to implement the four-day week, without impacting salaries.

Only considering candidates eligible to work in Paris, France ⚠️

The position

Job description

The Site Reliability Engineer (SRE) is responsible for designing, implementing, and maintaining scalable infrastructure and systems that ensure the reliability, performance, and security of our production environments. This hands-on position bridges the gap between development and operations, applying software engineering principles to infrastructure and operational challenges.

This role involves close collaboration with Development teams, Security teams, and other stakeholders to establish and maintain robust systems, implement automation, and drive operational excellence through SLOs (Service Level Objectives) and observability.

Additionally, you plays a key role in incident management, capacity planning, and implementing infrastructure as code practices across the organization.

Key Responsibilities

Technical Leadership & System Design

  • Collaborate with Development teams & Security on infrastructure architecture, deployment strategies, and operational requirements.
  • Design and implement monitoring, alerting, and observability solutions.
  • Drive infrastructure as code initiatives and maintain deployment automation.
  • Establish and track SLOs/SLIs to measure and improve service reliability.
  • Implement security best practices and maintain compliance requirements.
  • Design and maintain disaster recovery and backup strategies.

Operational Excellence & Process Implementation

  • Lead incident response and manage on-call rotations.
  • Develop and maintain runbooks and documentation for operational procedures.
  • Ensure proper logging and monitoring across all systems.
  • Drive automation initiatives to reduce manual operations.
  • Maintain and improve SRE practices across the organization.

Cross-team Collaboration & Knowledge Sharing

  • Work with development teams to implement operational readiness requirements.
  • Collaborate with Security teams on infrastructure security measures.
  • Provide technical mentorship to junior SREs and developers on operational best practices.
  • Lead knowledge sharing sessions and documentation efforts.
  • Partner with Engineering Managers to improve development workflows and tools.

Preferred experience
  • You have at least 5 years of infrastructure/systems engineering experience and want to maintain a strong hands-on technical focus.
  • You’re comfortable:
    • Building and maintaining large-scale distributed systems.
    • Managing incident response according to SLA.
    • Implementing automation and self-healing systems.
    • Developing utility scripts and functions.
    • Working in both French and English, in a remote context.
  • It’s not required, but having experience with our tech stack (Elixir, React.js) is a significant advantage.
  • You have strong problem-solving skills and can troubleshoot complex systems issues.
  • You’re reliability-focused: passionate about building resilient systems, measuring and improving reliability through data-driven approaches, and establishing sustainable operational practices.
  • You demonstrate excellent communication skills and can effectively collaborate with various technical and non-technical stakeholders. Deep dive in our stack:
    • Our main cloud provider is AWS ;
    • We use Kubernetes as our container orchestrator ;
    • Our Infrastructure-as-Code is managed with Terraform and Terragrunt ;
    • We use ArgoCD and CircleCI as our integration and deployment tools ;
    • We use OpenTelemetry & Datadog to monitor our platforms ;
    • Our applications runs on GNU/Linux systems, like Debian And if you’re not expert in all of those previous fields, you can still join us, we love sharing our knowledge.
Curious for more? Sign up now.

Sign up for free to uncover the job details—missions, preferred experience, and more.

Sign up

Already have an account? Sign in!

Automatically Apply to Remote Engineering Jobs!

Let our copilot automatically search & apply to remote jobs from all across the web.

Try it now
Welcome to the Junglewelcometothejungle.com

The multiplatform media company that aims to inspire every individual to thrive in their professional lives.

Working Week

We tried out the four-day working week in 2019 for five months, between June and October. After the trial ended, the company decided to implement the four-day week, without impacting salaries.

  • Mon
  • Tue
  • Wed
  • Thu
  • 🏖️
    Fri

Our Vacation Policy

Our vacation policy is 30 days

  • 30 days
  • 52 Fridays
  • 82 days off per year

Remote Working Policy

We work hybrid remotely from our offices in Europe (Paris, Prague and London)

Company Benefits

  • Health insurance
  • 401(k) company contribution
  • Generous parental leave
  • Equipment allowance
  • Professional Development Budget
  • Mental Health Support

Desirable Skills and Experience

Share this job:

Report incorrect data

Let us know if the job has expired