Site Reliability Engineer (Freelance/CDI) - act digital
  • Other
description du poste

Service Description

We are looking for a Site Reliability Engineering (SRE) service to join an Engineering team.

The goal of this role is to ensure the reliability, scalability, monitoring, and performance of on-premises services within the organization's product environment.

The role involves designing and implementing best practices for monitoring and observability while collaborating with cross-functional teams to improve systems, processes, uptime, and operational efficiency.

Responsibilities

  • Design and maintain monitoring infrastructure
  • Create custom dashboards, alerts, and visualization solutions
  • Implement distributed tracing and log aggregation systems
  • Establish monitoring best practices and SLI/SLO frameworks
  • Maintain security compliance for on-premises monitoring tools
  • Automate deployment and configuration management
  • Collaborate with development teams on application instrumentation
  • Participate in on-duty rotations and incident response

Requirements

Core Technologies

  • Advanced Grafana
  • Prometheus (PromQL)
  • OpenTelemetry
  • Elasticsearch

Infrastructure

  • Linux system administration
  • Networking knowledge
  • On-premises infrastructure and security

Programming

  • Python
  • Bash
  • Go (for automation)

Experience

  • 3+ years experience in monitoring or observability
  • 2+ years working with Grafana and Prometheus in production
  • Strong Linux administration experience
  • Proven experience managing on-premises infrastructure environments

Security

  • Knowledge of enterprise security practices
  • Understanding of compliance requirements

Additional Skills

  • Ability to balance technical trade-offs with business needs
  • Strong prioritization and problem-solving skills
  • Willingness to participate in 24/7 on-call rotations

Key Deliverables

  • Reduced MTTD / MTTR through effective monitoring
  • Comprehensive observability across systems
  • Automated monitoring, deployment, and infrastructure management
  • Security-compliant monitoring practices

Languages

  • English: C1 level
  • Additional languages are a plus (German, French, or Dutch)

;

demandeur d emploi

à la recherche d un emploi
postulez maintenant

recruteur

recrutez-vous
publier une offre d emploi