Principal SRE- AWS/Kubernetes at Athenahealth

  • Job opening

Summary:

We are seeking a Principal Site Reliability Engineer to join our Kubernetes-focused team in Cloud Infrastructure Engineering. CIE ensures the continuous availability of the technologies and systems that are the foundation of athenahealth’s services. We are directly responsible for thousands of servers, petabytes of storage, and handling thousands of web requests per second on Hybrid cloud architecture, all while sustaining growth at a meteoric rate. We enable an operating system for medical offices that removes administrative complexity, leaving doctors free to practice medicine. But enough about us, let’s talk about you.

You are an engineer with a passion for working hands-on with the latest and greatest cloud technology. You want to use the newest tech, but you want to do so in a large environment so you can see how that technology is used when scale matters. What’s most important to you is learning new things, and you’re seeking the best environment to enable that growth for yourself and your team.

The Principal Site Reliability Engineer is a highly skilled and experienced individual who is responsible for leading the design, implementation, and maintenance of our Kubernetes platform, while integrating with the various services which form athenahealth’s technology stack. You are an expert in AWS’ technology offerings (including EKS, ALB, EC2, and IAM), and have familiarity with building a cloud at scale, while being mindful of adhering to (and authoring) guardrails.

This role will be responsible for working with a team of engineers to develop and implement new features, troubleshoot problems, and ensure that our platform is scalable, reliable, and secure. The ideal candidate will have a deep understanding of Kubernetes and hybrid cloud computing, as well as strong leadership and communication skills.

Responsibilities:

Lead the design, implementation, and maintenance of our Kubernetes platform

Work with a team of engineers to develop and implement new features

Troubleshoot problems and ensure that our platform is scalable, reliable, and secure

Work with other teams to ensure that our platform meets the needs of our business and stakeholders

Establish standards and best practices for our stakeholders to better achieve success, specifically around Kubernetes & containers

Stay up-to-date on the latest cloud computing technologies and best practices

Mentor and train other engineers

Qualifications:

Demonstrative experience with Kubernetes and container based orchestration

8+ years of experience with cloud computing (preferably AWS)

Familiarity with release engineering tooling such as Git, Jenkins, artifact management, Jira

Familiarity with open-source tooling such as Ansible, Puppet, Terraform, Prometheus, Grafana, Kibana

Strong understanding of software engineering principles and practices

Strong leadership and communication skills

Ability to work independently and as part of a team

Ability to work under pressure and meet deadlines

About athenahealth

Our Vision: To create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all.

Our Location: This role will be based remotely. Our Global Headquarters is in Watertown, MA, just a few miles from Boston. We have three other US locations in Atlanta, Austin and Belfast, ME.

Our Culture: At athenahealth, our employees (or “athenistas”) are committed to making healthcare smarter. Our success is dependent on the diversity, collective spirit, and contributions of our people, clients and partners. We value teamwork and believe that the strength of our team comes from supporting each other and leveraging our specialized skills. If you are looking for company that will enable you to work outside of your comfort zone to transform the healthcare ecosystem, athenahealth is the place for you.

Our Perks: Along with health & financial benefits, our athenistas are offered a variety of perks that promote employee wellbeing such as commuter support, collaborative workspaces and dog-friendly offices - just to name a few.


Name

Principal SRE- AWS/Kubernetes at Athenahealth

Description

We are seeking a Principal Site Reliability Engineer to join our Kubernetes-focused team in Cloud Infrastructure Engineering.

Location

Remote USA