Chargement en cours

Site Reliability Engineer - Apprenticeship

PARIS, 75
il y a 16 jours

# Site Reliability Engineer

  • Apprenticeship
  • Paris
  • Alternance
  • We help companies build their recruitment strategy by sharing their story through employer branding, enabling them to attract, engage, and retain talent who share their values.
  • We guide candidates to their future teams through immersive job listings and support them throughout their job search with a personalized candidate experience.

The SRE Intern will join the Platform Team to discover and contribute to the infrastructure and systems that ensure the reliability, performance, and security of our production environments. Under the mentorship of experienced SRE engineers, this internship bridges learning and hands-on contribution, applying software engineering principles to real infrastructure and operational challenges.

This role involves close collaboration with the SRE team, Development teams, and other stakeholders to contribute to automation, observability improvements, and infrastructure-as-code practices. The intern will progressively gain autonomy on well-scoped projects while learning incident management, capacity planning, and reliability engineering fundamentals in a production context.

The SRE Intern will report to the **Platform Engineering Manager** and is integrated within the **Platform Team**.---Key Responsibilities :### Technical Contribution & Learning

  • Participate alongside **Development teams** in infrastructure discussions, deployment processes, and operational requirements.
  • Contribute to **monitoring, alerting**, and observability improvements (dashboards, alerts, log hygiene).
  • Write and review **Terraform / Terragrunt modules** under supervision, learning Infrastructure-as-Code best practices.
  • Contribute to **disaster recovery** documentation and backup verification procedures.### Operational Excellence & Automation
  • Shadow and progressively contribute to **incident response** efforts, learning root cause analysis methodology.
  • Develop and improve **runbooks and documentation** for operational procedures.
  • Help ensure proper **logging and monitoring** coverage across systems.
  • Contribute to **automation initiatives** to reduce manual operations (scripts, tooling, pipeline improvements).
  • Learn and apply **SRE practices** (SLOs, error budgets, toil reduction) in day-to-day work.### Cross-team Collaboration & Knowledge Building
  • Work with development teams to understand and support **operational readiness** requirements.
  • Collaborate with the SRE team on **infrastructure security** measures.
  • Participate in **knowledge sharing sessions** and team rituals.
  • Document learnings, contribute to the team's **knowledge base**, and share findings with peers.
  • Partner with team members to improve **developer experience** through tooling and documentation.---## Profil recherché
  • You are a **student** in a Computer Science / Engineering program, looking for a **5-to-6-month internship** (convention de stage required).
  • You have solid fundamentals in systems and want to develop a strong hands-on technical focus in infrastructure and reliability.
  • Let's show you our stack ! You don't need to master it, but **familiarity or curiosity about these tools is expected:** + Our main cloud provider is **AWS**; + We use **Kubernetes** as our container orchestrator; + Our Infrastructure-as-Code is managed with **Terraform and Terragrunt**; + We use **Argo

CD and Circle

CI** as our integration and deployment tools; + We use **Open

Telemetry & Datadog** to monitor our platforms; + Our applications run on **GNU/Linux** systems, like Debian.

  • You're comfortable or eager to learn: + Working with **Linux/Unix systems.** + Understanding **distributed systems** fundamentals and cloud architectures. + Writing **scripts** (Bash, Python or equivalent) to automate tasks. + Learning **incident response** practices and structured troubleshooting. + Working in both **French and English**, in a hybrid/remote context.
  • It's not required, but having touched our tech stack (Ruby, Elixir, React.js) or contributed to personal/open-source infra projects is a significant advantage.
  • You have **strong problem-solving skills** and a methodical approach to understanding how systems work.
  • You're **reliability-curious**: genuinely interested in how production systems run, how failures happen, and how to build resilient infrastructure.

Step : A 30-minutes interview with **Lilia**, Talent Acquisition Apprentice

Step : A 45-minute interview focused on job skills assessment and value with **Nicolas,** Senior Site Reliability Engineer

Step : A 1h values interview with **Pascal,** Platform Engineering Manager

Good luck !

#J-18808-Ljbffr

Entreprise
Starther
Plateforme de publication
WHATJOBS
Offres pouvant vous intéresser
PARIS, 75
il y a 21 jours
PARIS, 75
il y a 23 jours
VALBONNE, 06
il y a 7 jours
PARIS, 75
il y a 21 jours
Soyez le premier à postuler aux nouvelles offres
Soyez le premier à postuler aux nouvelles offres
Créez gratuitement et simplement une alerte pour être averti de l’ajout de nouvelles offres correspondant à vos attentes.
* Champs obligatoires
Ex: boulanger, comptable ou infirmière
Alerte crée avec succès