Signaler

Site Reliability Engineer - Apprenticeship

PARIS, 75

il y a 16 jours

# Site Reliability Engineer

Apprenticeship
Paris
Alternance
We help companies build their recruitment strategy by sharing their story through employer branding, enabling them to attract, engage, and retain talent who share their values.
We guide candidates to their future teams through immersive job listings and support them throughout their job search with a personalized candidate experience.

The SRE Intern will join the Platform Team to discover and contribute to the infrastructure and systems that ensure the reliability, performance, and security of our production environments. Under the mentorship of experienced SRE engineers, this internship bridges learning and hands-on contribution, applying software engineering principles to real infrastructure and operational challenges.

This role involves close collaboration with the SRE team, Development teams, and other stakeholders to contribute to automation, observability improvements, and infrastructure-as-code practices. The intern will progressively gain autonomy on well-scoped projects while learning incident management, capacity planning, and reliability engineering fundamentals in a production context.

The SRE Intern will report to the **Platform Engineering Manager** and is integrated within the **Platform Team**.---Key Responsibilities :### Technical Contribution & Learning

Participate alongside **Development teams** in infrastructure discussions, deployment processes, and operational requirements.
Contribute to **monitoring, alerting**, and observability improvements (dashboards, alerts, log hygiene).
Write and review **Terraform / Terragrunt modules** under supervision, learning Infrastructure-as-Code best practices.
Contribute to **disaster recovery** documentation and backup verification procedures.### Operational Excellence & Automation
Shadow and progressively contribute to **incident response** efforts, learning root cause analysis methodology.
Develop and improve **runbooks and documentation** for operational procedures.
Help ensure proper **logging and monitoring** coverage across systems.
Contribute to **automation initiatives** to reduce manual operations (scripts, tooling, pipeline improvements).
Learn and apply **SRE practices** (SLOs, error budgets, toil reduction) in day-to-day work.### Cross-team Collaboration & Knowledge Building
Work with development teams to understand and support **operational readiness** requirements.
Collaborate with the SRE team on **infrastructure security** measures.
Participate in **knowledge sharing sessions** and team rituals.
Document learnings, contribute to the team's **knowledge base**, and share findings with peers.
Partner with team members to improve **developer experience** through tooling and documentation.---## Profil recherché
You are a **student** in a Computer Science / Engineering program, looking for a **5-to-6-month internship** (convention de stage required).
You have solid fundamentals in systems and want to develop a strong hands-on technical focus in infrastructure and reliability.
Let's show you our stack ! You don't need to master it, but **familiarity or curiosity about these tools is expected:** + Our main cloud provider is **AWS**; + We use **Kubernetes** as our container orchestrator; + Our Infrastructure-as-Code is managed with **Terraform and Terragrunt**; + We use **Argo

CD and Circle

CI** as our integration and deployment tools; + We use **Open

Telemetry & Datadog** to monitor our platforms; + Our applications run on **GNU/Linux** systems, like Debian.

You're comfortable or eager to learn: + Working with **Linux/Unix systems.** + Understanding **distributed systems** fundamentals and cloud architectures. + Writing **scripts** (Bash, Python or equivalent) to automate tasks. + Learning **incident response** practices and structured troubleshooting. + Working in both **French and English**, in a hybrid/remote context.
It's not required, but having touched our tech stack (Ruby, Elixir, React.js) or contributed to personal/open-source infra projects is a significant advantage.
You have **strong problem-solving skills** and a methodical approach to understanding how systems work.
You're **reliability-curious**: genuinely interested in how production systems run, how failures happen, and how to build resilient infrastructure.

Step : A 30-minutes interview with **Lilia**, Talent Acquisition Apprentice

Step : A 45-minute interview focused on job skills assessment and value with **Nicolas,** Senior Site Reliability Engineer

Step : A 1h values interview with **Pascal,** Platform Engineering Manager

Good luck !

#J-18808-Ljbffr

Entreprise

Starther

Plateforme de publication

WHATJOBS

Offres pouvant vous intéresser

Staff Resilience Engineer (H/F/N)

PARIS, 75

il y a 21 jours

Platform Engineering Director

PARIS, 75

il y a 23 jours

Apprentice, Software Development

VALBONNE, 06

il y a 7 jours

Staff Resilience Engineer (H/F/N)

PARIS, 75

il y a 21 jours