Senior Site Reliability Engineer

Related keywords: mental health remote jobdevops remote jobremote job software engineer

Role Overview

The Senior Site Reliability Engineer position is a challenging role situated in the core Platform/SRE team of Heidi Health, a company aimed at improving production reliability and incident response. This position is pivotal for ensuring the seamless operation of production environments and requires a combination of technical expertise and analytical skills to address various issues related to system reliability and performance.

Key Responsibilities

The successful candidate will be responsible for participating in on-call and incident response activities, which are critical for maintaining production standards. This role also involves:

  • Improving operational reliability by identifying and rectifying potential system failures.
  • Owning parts of the production environment, meaning that you will have direct responsibility for specific components, contributing to their enhancement and stability.
  • Strengthening observability, which involves implementing better monitoring and alerting solutions to preemptively identify issues before they become critical.
  • Actively working to reduce operational toil, thereby streamlining processes to make them more efficient.
  • Supporting safe change policies, ensuring that all modifications to the environment are practical and risk-averse.
  • Contributing to operational practices that drive overall system health and reliability.

Why It Might Be a Fit

This position is ideal for an individual with 3–6+ years of experience in SRE, DevOps, Platform, or other operations-heavy engineering roles. Candidates should have an established background in:

  • Supporting production systems and successfully navigating on-call rotations.
  • Problem-solving skills capable of debugging live systems under pressure, a key requirement in the high-stakes environment of an SRE role.
  • Familiarity with operating cloud infrastructure, specifically AWS, as it will be integral in this role.
  • Experience with Kubernetes and containerized workloads, as knowledge in these areas can significantly enhance operational responsiveness.
  • Infrastructure as code experience using tools like Terraform or similar solutions, which is becoming the gold standard in modern engineering practices.
  • Proficiency in various monitoring and alerting tools such as Datadog or Prometheus, ensuring a reliable feedback loop about system performance.
  • Some level of scripting or automation experience, ideally in languages such as Python or Bash, will aid in automating repetitive tasks and improving efficiency.

Benefits

To motivate and support its employees effectively, Heidi Health offers an array of attractive benefits, including:

  • Equity from day one, allowing you to own part of the company and share in its success.
  • Comprehensive medical and dental coverage through Bupa, ensuring that health needs are met without financial burden.
  • 24/7 mental health, coaching, and well-being support through Sonder, emphasizing a commitment to the mental health of employees.
  • A £100/month Healthy Heidi’s stipend, promoting a healthy lifestyle.
  • Global parental leave policies and fertility support, with a one-off payment of £7,000, recognizing the diverse needs of employees' families.
  • Learning & development budget of £700 per year, reflecting the importance of continuous education and upskilling.
  • A home office budget of £500, assisting employees in creating a productive working environment.
  • Recharge days post major milestones, which promotes work-life balance and recognizes hard work during stressful periods.
  • Flexibility in work arrangements, including the option to work from anywhere for up to 4 weeks per year, adapting to the modern approach to work.
  • Clinical leave policies allowing for 10 days of leave per year for qualifying clinical roles, highlighting the company’s commitment to its medical staff.
  • A hybrid environment policy, where employees will spend 3 days in the office, striking a balance between in-person collaboration and remote work.

Conclusion

In summary, the Senior Site Reliability Engineer role at Heidi Health is an excellent opportunity for seasoned professionals in the field of site reliability engineering. This position not only allows you to enhance and ensure the reliability of production systems but also offers a comprehensive benefits package that emphasizes well-being, personal development, and job satisfaction. It is an attractive option for individuals looking to make significant contributions to a responsible and forward-thinking company.



This job offer was originally published on himalayas.app

Heidi Health

Ireland

Software development

Full-time

June 6, 2026

3 views

0 clicks on Apply Now


Similar job offers


This job offer summary has been generated using automated technology. While we strive for accuracy, it may not always fully capture the nuances and details of the original job posting. We recommend reviewing the complete job listing before making any decisions or applications.