Back to Jobs

Lead Site Reliability Engineer - Azure Cloud Platform Optimization and Reliability Expert

Remote, USA Full-time Posted 2025-11-03

Join Mercy's Innovative Team as a Lead Site Reliability Engineer

At Mercy, we're passionate about bringing to life a healing ministry through compassionate care and exceptional service. Our mission is clear, and we're committed to creating careers that match the unique gifts of unique individuals. We're a vibrant and supportive community that values professionalism, compassion, and advocacy. As a Lead Site Reliability Engineer, you'll have the opportunity to pioneer new models of care and transform the healthcare experience through advanced technology and innovative procedures.

About the Role

We're seeking an experienced Lead Site Reliability Engineer to join our team, focusing on the design, reliability, and scalability of our Azure Cloud platform. As a Lead SRE, you'll collaborate with our engineering team to develop creative solutions to operational problems, optimize new and existing systems, build infrastructure, and reduce work through automation. This role requires a strong and diverse skillset in relevant areas, including cloud computing, software development, and systems engineering.

Key Responsibilities:

  • Design, implement, and maintain scalable and reliable Azure Cloud platforms, ensuring high availability and performance.
  • Collaborate with cross-functional teams to develop and implement automation solutions, reducing manual work and improving efficiency.
  • Optimize cloud and on-prem hosted applications and services based on key performance metrics, ensuring cost-effective and secure operations.
  • Debug and optimize code, automate routine processes, and implement infrastructure-as-code solutions using tools like Azure Pipelines, Terraform, and PowerShell.
  • Lead and mentor junior engineers, providing guidance and support to help them develop their skills and expertise.
  • Participate in scale testing, disaster recovery, and capacity planning, ensuring business continuity and minimizing downtime.
  • Interact with diverse stakeholders, making decisions with a sense of urgency and resolving difficult situations.

Requirements and Qualifications

Essential Qualifications:

  • Bachelor's degree in a related field, specialized training, or equivalent work experience.
  • 7+ years of experience as a Site Reliability Engineer or comparable role, working with cloud (Azure preferred) and on-prem hosted solutions.
  • Proven experience optimizing cloud and on-prem hosted applications and services based on key performance metrics.
  • Ability to debug and optimize code, automate routine processes, and implement infrastructure-as-code solutions.

Preferred Qualifications:

  • Experience configuring Azure API Manager and App Services policies.
  • Familiarity with microservices architecture and container orchestration with Kubernetes.
  • Strong understanding and experience configuring cloud and on-prem technologies, building and optimizing CI/CD, and Infrastructure-as-Code.
  • Expertise with the full Microsoft stack, including AD, DHCP, DNS, DFS Namespace, Windows Servers, and Linux environments.
  • Good understanding of networking solutions, including Load Balancer, V-Net, Peering, etc.
  • Experience growing talent and leading less senior coworkers in developing skills in this competency.

What We Offer

At Mercy, we're committed to providing a supportive and inclusive work environment that values diversity and promotes growth. Here are just a few of the benefits we offer:

  • Competitive salary and comprehensive benefits package, including health, vision, and dental coverage.
  • Day-one comprehensive health, vision, and dental coverage.
  • PTO and employer-matched retirement funds.
  • A dynamic and supportive work environment that encourages collaboration and innovation.

Why Join Mercy?

At Mercy, we're passionate about creating a culture of compassion, professionalism, and advocacy. Here are just a few reasons why you might want to join our team:

  • Opportunities to pioneer new models of care and transform the healthcare experience.
  • A supportive and inclusive work environment that values diversity and promotes growth.
  • Collaborative and dynamic team with a strong sense of community.
  • Professional development opportunities and tuition reimbursement.

How to Apply

If you're a motivated and experienced Site Reliability Engineer looking for a new challenge, we encourage you to apply. Even if you feel you're not a perfect match, we'd still love to hear from you. We're looking for great people to join our friendly team and contribute to our mission of bringing to life a healing ministry through compassionate care and exceptional service.

Apply now and take the first step towards joining our innovative team as a Lead Site Reliability Engineer!

Apply for this job  

Similar Jobs