Site Reliability Engineer - SRE
1 day ago Be among the first 25 applicants
OUR STORY: Join Scaleway and shape the sovereign cloud of tomorrow !
OUR STORY: Join Scaleway and shape the sovereign cloud of tomorrow !Since 1999, we have been designing secure, sustainable infrastructures aimed at supporting the most ambitious companies.Historically known for our dedicated servers (Dedibox), we made a strategic shift to cloud computing in 2015. Staying true to our principles of simplicity, flexibility, and technical excellence, we have become one of the leading players in Europe in the sector.With the rise of artificial intelligence, we have strengthened our commitment, supported by the Iliad Group, which is investing €3 billion to develop a serious, sovereign AI alternative to American and Asian giants.Every day, thanks to our rich catalog of products and services (bare metal, containerization, serverless, AI, etc.), Scaleway proudly serves 38,000 private and public sector clients, from Photoroom to Mistral AI, Golem AI, and ADEME.Our offices are located in Paris, Lille, Toulouse, Bordeaux, and Lyon.WHY WE NEED YOU?Our growth is driving us to strengthen our SRE team to support and scale our production environments.Your mission will be to build and maintain reliable, observable, and secure infrastructure in order to ensure optimal service availability for our customers around the world.YOUR FUTURE TEAMWe work in a collaborative and international environment where the diversity of Scalers, combined with a spirit of sharing, helps bring new projects to life every day, advancing our ambitions together.You will be part of a team of experienced Site Reliability Engineers. The team is responsible for maintaining and evolving core infrastructure and observability tools, supporting product teams, and improving the reliability of Scaleway’s services.YOUR DAILY ROUTINE
- Build and optimize tooling to automate monitoring, diagnosis, and remediation of production incidents
- Troubleshoot high-impact production issues in collaboration with other engineering teams
- Participate in an on-call rotation to handle incidents and ensure service continuity
- Implement and maintain observability solutions to monitor infrastructure and application health
- Contribute to infrastructure lifecycle management across different environments
- Promote and apply best practices in terms of stability, resiliency, scalability, and security
- Maintain clear technical documentation for tools and procedures
- Contribute to system and tool evolution based on production feedback
- Collaborate closely with development teams to ensure infrastructure readiness
- Participate in team rituals and knowledge-sharing initiatives
- Proactive and solution-oriented mindset
- Passion for automation and continuous improvement
- Strong collaboration and communication skills
- Ability to work independently and in a team
- Willingness to mentor and share knowledge
- Experience with Go, Python or Rust
- Strong scripting skills (Bash, Python)
- Hands-on experience with Linux systems (Ubuntu/Debian)
- Knowledge of networking (TCP/IP, DNS, BGP, load-balancing, IPv6, etc.)
- Experience in cloud environments and infrastructure (bare metal, VMs, containers, orchestrators)
- Familiarity with monitoring and logging tools (Prometheus, Grafana, Elastic, etc.)
- Comfortable with Infrastructure-as-Code (Ansible, Salt, AWX, etc.)
- Experience managing relational databases (PostgreSQL)
- Understanding of CI/CD pipelines (GitLab)
- Comfortable with English (written and spoken)
- Hybrid work: We offer up to 3 days of remote work per week
- Offices: Our offices are spacious, dynamic workspaces with bold design, conveniently located near public transport. Most of our offices feature outdoor spaces (terraces) and bike parking facilities
- Dining: Our chef provides a healthy meal service at the headquarters, and breakfast is available across all our sites year-round. Scalers working from regional sites enjoy a Swile card for lunches
- Well-being commitments: Whether it’s access to a gym, daycare places, or discounted services for caring services, Scaleway is committed to supporting Scalers in maintaining a balanced life
- Career & Mobility: Our managers value internal mobility, and opportunities to transition to other entities within the Iliad Group are accessible to all Scalers
- Discovery call with a recruiter (30 min)
- Interview with the manager to understand your technical skills and approach to the role (45 min)
- Technical interview to validate your expertise (1h)
- Interview with the Head of the Tribe to deepen your discussions and assess your fit with the team (45 min)
- HR interview to tour our offices and meet your future colleagues
Seniority level
Seniority level
Associate
Employment type
Employment type
Full-time
Job function
Job function
Engineering and Information TechnologyIndustries
Software Development
Referrals increase your chances of interviewing at Scaleway by 2x
Get notified about new Site Reliability Engineer jobs in Paris, Île-de-France, France .
Junior Site Reliability Engineer, AI Platform
Nanterre, Île-de-France, France 1 month ago
INGENIEUR SYSTEMES TRAITEMENT DE L'IMAGE (H/F)
Site Reliability Engineer (x/f/m) - Tech Foundations
Site Reliability Engineer (Observability) - remote friendly
Courbevoie, Île-de-France, France 2 weeks ago
Chiller Applied Systems Engineer - HVAC Solutions Europe (m/f/d)
Courbevoie, Île-de-France, France 2 weeks ago
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr