Site Reliability Engineer
PARIS, 75
il y a 11 jours
About the role:
As Site Reliability Engineer , you will be responsible for deploying and maintaining diagnostic software across multiple centers, ensuring system reliability, performance, observability and security. You will collaborate with development and support teams, automate infrastructure using tools like Kubernetes, Ansible, and Terraform, and document IT solutions while managing the full product lifecycle.
Position is based in Paris and can be offered remotely.
In particular, you will:
- Deploy diagnostic software products across multiple centers (hospitals or pathology laboratories), both on‑premise and on the cloud.
- Be familiar with a wide range of technologies (DevOps, GitOps, Open Source software) and infrastructure (from physical server via lower layers to Kubernetes architecture concepts & solutions).
- Provide detailed specifications for the proposed IT solutions including hosting specifications, network flow matrix, RACI and security and you will document them.
- Support the Customer Support team by providing primary operational support and engineering to centers at which products are deployed.
- Run the production environment by monitoring availability and taking a holistic view of system health, and manage the full product lifecycle (including decommissioning).
- Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.
- Partner with development teams to improve services through rigorous testing and release procedures and to create sustainable systems and services through automation and uplifts.
- Provide operation support and engineering to centers for dataset import.
- Deploy data science environments with Sagemaker to meet the needs of data scientists.
About You
- 3+ years of industry experience with a Masters or BS degree in computer science, software engineering or an associated field.
- Expertise in Terraform and Kubernetes.
- Experience with Ansible.
- Knowledge of development collaboration tools (git & Jira).
- Experience in production environments: documentation/specification, tests & code versioning.
- Experience with dynamic resource management frameworks like Mesos, Kubernetes, or Yarn.
- Experience with distributed storage technologies like NFS, HDFS, Ceph, S3.
- Experience with AWS cloud computing services.
- Knowledge of other cloud computing services such as GCP, Azure, etc.
- Experience with monitoring and alerting tools like Prometheus.
- Experience with SageMaker.
- Knowledge of Linux/Unix-like environments.
- Knowledge of network and security.
- Experience in cyber security and applicable standards (in particular ISO 27001).
- Fluency in English.
Preferred qualifications / bonus skills
- Certification in Ansible, Terraform and/or Kubernetes.
- Experience in medical industry and applicable standards (in particular ISO 13485 and ISO 62304).
- Experience with a high-level programming language like Python, Rust or front-end programming languages.
- Experience in Agile Scrum Methodology.
- Language skills in French.
Entreprise
Waiv, formerly Owkin Dx
Plateforme de publication
WHATJOBS
Offres pouvant vous intéresser
PARIS, 75
il y a 13 jours
FRANCE
il y a 5 jours
LILLE, 59
il y a 13 jours
PARIS, 75
il y a 13 jours