Observability & SRE Senior Engineer (w/m)
Chaque jour, nous fournissons des solutions innovantes pour améliorer la vie de millions de personnes, en connectant les employés, les entreprises et les commerçants dans le monde entier.
Nous savons qu'il y a des centaines de façons pour vous d'évoluer. Avec nous, vous développerez vos compétences dans un environnement multiculturel, stimulant et dynamique.
Osez rejoindre Edenred et préparez-vous à vous épanouir dans une entreprise internationale qui vous offrira des opportunités infinies.
La méritocratie fait partie de notre ADN.
Chez Edenred, nous reconnaissons, recrutons et développons tous les talents.
Venez comme vous êtes, dans le respect de votre singularité et contribuer à l'aventure Edenred avec nous.
Nous nous engageons à prévenir toute forme de discrimination et à proposer à tous nos candidats des opportunités égales indépendamment de leur genre et expression de genre, handicap, origine, croyance religieuse et orientation sexuelle ou tout autre critère.
OUR CONTEXT
For a very ambitious agile-at-scale program (Feature Teams organization), we are seeking talented and ambitious professionals to join our team based primarily in Paris and Bucharest as we accelerate our transformative digital and product journey.
We are fully embracing Cloud Native capabilities and best practices and operating as a multi-tribe Product team to better serve our customers (Clients, Merchants and Users) and our Edenred employees. We are leveraging Agility at scale principles and are leading the charge in establishing “FinTech Product” ways of working for Edenred.
YOUR ROLE
SRE is what you get when you treat operations as if it’s a software problem. Our mission is to protect, provide for, and progress the software and systems behind SmartER platform — Mobile App for our users, Web front end for our Clients & Merchant, Salesforce ecosystem for Customer Service & Sales Edenred employees, Card transactions, Order Mgt & Invoicing components, Open APIs with Payroll systems to name just a few — with an ever-watchful eye on their availability, latency, performance, and capacity.
In your role as a Senior Engineer - SRE and Observability within the Employee Benefits Platform Engineering team, you will be working on company-wide strategic initiatives to make all services Observable for the different key players - Product Owner, Cloud Platform team, Engineering lead, Infrastructure team, Customer Service - according to their needs. You will be responsible for implementing tooling and services that would be instrumental in establishing SRE practices for EDENRED Employee Benefits Business line, focusing on compliantly delivering out-of-the-box Observability and Incident management process. You will be constantly in communication with cross-functional product engineering teams acting as a domain expert in various SRE practices - system design, software platforms and frameworks, performance optimization and capacity planning among others.
Main responsibilities :
-
Take a leadership role to design, build and maintain Observability suite of tools and services and keep them up to date with market standards
-
Drive a product mindset for Observability, ensuring the different personas get Observability services consistently with their needs and areas of expertise
-
In particular support the dual approach of SLO (PO persona, End to End User experience centric, Trending) vs SRE (Tech persona, Component/Feature centric, Now in Prod).
-
Research and develop SRE standards and conventions and drive their adoption within the product teams
-
Closely collaborate with product engineering teams & Azure PaaS teams to guide system design improvements with a clear focus on observability, availability, scalability, and latency
-
Lead incident response efforts and influence and bring forth a blameless post mortem culture resulting in overall improved service resilience and decreased Mean time to recovery (MTTR)
-
Establish a strong “Actionable” observability culture by driving reliable alerting across the ecosystem and empowering the front line teams (Product Operations & Customer Care Operations) through timely and proactive signals
-
Work with the Employee Benefits CTO and Engineering lead to assess the needs of Software Engineering against current offerings
-
Act as the Technical referent for the Observability tool ecosystem (DataDog, Opentelemetry.io, Opsgenie, …), stays up to date on their roadmap and influence both SmartER and Vendor roadmaps to maximize value and cost of ownership
-
Participate in and occasionally lead the daily, weekly, sprint cycle team ceremonies and ensure efficient activities of the team aligned to Observability & SRE goals
-
Provide coaching to team members related to Observability best practice – extend to our internal partners when needed
-
Assess and size effort associated with work backlog and participate in grooming
-
Leading technical discussions and requirements analysis with Tribes – both PO and Engineering
-
Define best practices around making systems and services measurable and work with various teams to get those best practices applied
-
Collect, aggregate, and visualize the collected metrics to provide SmartER with actionable insights
-
Interview and participate in building an extended team to establish an Edenred community of practice in the Monitoring and Observability domains
-
Inform recommendations, including resourcing, of strategic projects to mature and improve the monitoring and observability platform
-
Lead proofs of concepts, engineering, and implementation projects
YOUR PROFILE
-
Master's degree – Technology / Computer science preferred
-
At least 3+ years of experience with building large scale distributed Cloud Native services and understanding how to make them reliable and scalable
-
Understanding Unix/Linux operating system and TCP/IP network fundamentals
-
Previous Background in both Systems and Software Engineering
-
Hands-on experience with SRE practices & Observability concepts (Traces, Metrics, Logs, Events), standards (Opentelemetry) and tooling (Datadog, Prometheus)
-
Hands on experience consolidating application and system logs at enterprise scale
-
Experience using cloud provider platforms (preferably MS Azure) and deploying and running distributed services on Kubernetes. You know your way around Terraform
-
Experience working in a remote team across multiple regions and time zones
-
You have experience with analyzing, troubleshooting and acting as a technical influencer for large-scale distributed systems
-
You should have experience in designing, automating, maintaining and optimizing observability platforms(logging, metric and tracing)
-
You are self motivated coupled with fluent English communication skills
-
You consistently demonstrate clear and concise written and verbal communication, with a “story telling” mindset
-
You are curious, eager to learn and grow both your technical and product skills
Apply now and Vibe with Us!