Gray

Staff Site Reliability Engineer

Paris, Île-de-France, France

Staff Site Reliability Engineer

  • Paris, Île-de-France, France
  • Full Time
View favourites

About Ledger

We’re a team of experts pushing the limits of what’s possible, united by our common goal to unlock true freedom through digital ownership, making technology accessible for all. We believe in a world where users, creators and enterprises manage their value with ownership and freedom. Our curiosity drives us to innovate, empowering individuals on a global scale. We believe change is constant, and our team moves forward as one, with a culture of problem-solving where every employee is empowered and supported to challenge tradition and create solutions. Our mission is simple: to make self-custody accessible and give people the keys to their own financial futures. If you want to make a true impact, we want you to join us at Ledger.

At Ledger, we’re proud to be the global platform for digital assets and Web3, with over 20% of the world’s crypto assets secured through our Ledger devices. With our headquarters in Paris, and offices in Vierzon, Grenoble, Montpellier, London, Portland, Geneva, Zurich and Central Singapore, we have a team of around 600 professionals developing a variety of products and services to enable individuals and companies to securely buy, store, swap, grow and manage crypto assets – including the Ledger hardware wallets line with more than 7.5 millions units already sold in 200 countries.

The team

To support our continued growth, we’re hiring a Site Reliability Engineer to join the Infrastructure team. Reporting to our SRE Manager, you will be a member of Ledger’s SRE team driving technology's transformation by launching new platforms, building tools, automating away complex issues, and integrating with the latest technology.

Site Reliability Engineers leverage their experience as software and systems engineers to ensure applications integrated by SRE are available, have full-stack observability and have continuous improvement through code and automation. We are looking for an experienced candidate in reliability engineering who thrives on and enjoys solving complex problems through innovation and impacting change at scale.

What you’ll be doing:

  • Participate in building a DevOps / SRE culture and enable the transition to modern infrastructure management and deployment practices; Participate in building the SRE team roadmap (vision and delivery accountability); while anticipating stakeholder needs, game-changing technologies emergence and challenge scope / deadlines;

  • You will bring a strong mixture of software engineering, operations, and systems engineering experience to the role, and you have experience in the integration of complex systems;

  • Perform integration of platform software components;

  • Participate to design and deliver solutions to improve the availability, scalability, latency, and efficiency of systems;

  • Influence and create standards & best practices in support of service level objectives;

  • Automate key SRE metrics including SLOs/SLAs and error budgets;

  • Provide expert support to our level-2/application support team, to troubleshoot priority incidents, and conduct post-mortems;

  • Apply analytics on past incidents and usage patterns to predict issues and take proactive actions;

  • Ensure control of technical debt and promote quality practices;

  • Follow SRE and chaos engineering approaches across all strategic systems to predict in coordination with Service Design and prevent outages and improve solution availability;

  • Design and conduct performance tests, identify the bottlenecks and opportunities for optimization;

What we’re looking for

  • 5+ years on cloud engineering at scale, on organizations operating SaaS solutions

  • Proficiency in working in Unix/Linux environments, Python, Terraform, Kubernetes, AWS cloud solutions and architectures, CI/CD tools, ArgoCD, Ansible, configuration management, Database management (postgres), API management etc.

  • Strong knowledge on observability practices, with experience implementing and managing Logging, Monitoring and Alerting framework with solutions such as Datadog or Prometheus/Grafana.

  • Experience of cross-functional work and the ability to demonstrate a collaborative approach with regards to building key relationships across the organization and define projects scope, goals, plan and deliverables

  • “Customer focused” with the ability to identify and understand both internal and external customer's needs

  • Creative problem-solving and analysis skills with an ability to identify develop and implement solutions to meet the needs of the business

  • Excellent presentation and written communication Ability to deal with ambiguity, high level of pressure and rapidly changing environments

At Ledger, we are dedicated to continually investing in our employees which is why we offer more than just salaries; we provide comprehensive compensation packages that include a wide range of benefits. Here are some of the benefits you can look forward to:

  • Flexible work options - Our hybrid policy allows employees to work from home up to 3 times per week

  • Health & Wellness support - Health and Life Insurance.

  • Financial growth opportunities - Employees can become shareholders in Ledger as well as other financial benefits depending on your country of work.

  • Commuter allowance - Ledger offers a commuter allowance to contribute to your preferred means of transportation.

  • Learning & Development - A comprehensive suite of training solutions providing a personalised learning experience for every employee.

For regionally specific benefits, your Talent Acquisition contact will be able to provide you with more information.

We’re committed to building an inclusive hiring process. If you need any adjustments or accommodations, just let us know, we’ll do our best to support you.