TechJobBoard
Why TechJobBoard?

Collective Health

Senior Site Reliability Engineer, Healthcare Cloud Infrastructure and Networking

at Collective Health

CA | Lehi, San Francisco, UT | Plano, United States



At Collective Health, we’re transforming how employers and their people engage with their health benefits by seamlessly integrating cutting-edge technology, compassionate service, and world-class user experience design.

As a Sr. Site Reliability Engineer, you will be a key player in designing, building, and maintaining the cloud infrastructure that powers our healthcare applications. You will blend software engineering with systems and network administration expertise to solve complex multi-cloud connectivity and operational challenges. Your work will directly impact patient care by ensuring our services are always available and performant. You will be responsible for the availability, latency, performance, efficiency, monitoring, and emergency response of our production environment, with a special focus on meeting stringent healthcare compliance standards like HIPAA, SOC 2, & HITRUST.

What you'll do:

  • Cloud Infrastructure Management:
    • Design, deploy, and manage scalable, secure, and highly available infrastructure on multi-cloud platforms - AWS and GCP.
    • Implement and manage Infrastructure as Code (IaC) using tools like Terraform, Ansible to automate provisioning and configuration.
    • Manage containerized applications and orchestration platforms, primarily Kubernetes and Docker.
  • Cloud Networking Engineering:
    • Lead the architecture, implementation, and maintenance of secure cloud connectivity solutions to ensure compliant and high-throughput data exchange with external healthcare partners.
    • Design, implement, and maintain highly available and secure cloud network topologies (e.g., VPCs, subnets, routing tables, and peering) across multiple regions and multiple cloud technologies.
    • Expertly configure and manage cloud load balancing (e.g., ALB, NLB, GCLB) and DNS services (e.g., Route 53, Cloud DNS) for optimal traffic distribution and low latency.
    • Own the end-to-end lifecycle of TLS/SSL termination, key rotation, and certificate management across all load balancers to enforce stringent security postures.
    • Design and enforce network segmentation and Zero Trust Architecture principles at the network layer to secure Protected Health Information (PHI).
    • Perform network performance analysis and troubleshooting for latency, throughput, and connectivity issues, specifically within the cloud provider's network infrastructure.
  • Site Reliability & Automation:
    • Develop and implement Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to maintain and improve system reliability.
    • Automate manual operational tasks, from deployments and scaling to incident response and recovery.
    • Conduct blameless post-mortems and root cause analyses to prevent recurrence of incidents.
    • Participate in an on-call rotation to respond to production issues and drive them to resolution.
  • Monitoring & Application Support:
    • Build and maintain robust monitoring, logging, and alerting systems using tools like Prometheus, Grafana, or the ELK stack.
    • Work closely with software development teams to improve the reliability and performance of applications.
    • Manage CI/CD pipelines to ensure safe, automated, and efficient software releases.
    • Ensure all systems and processes are compliant with healthcare regulations and security best practices (HIPAA, SOC 2).

To be successful in this role, you'll need:

  • Required Qualifications
    • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
    • 8+ years of experience in a Network Engineering or Cloud Engineering role.
    • Strong proficiency with AWS and GCP cloud providers.
    • Deep expertise in network observability tools and designing systems for continuous network compliance auditing against HIPAA / HITRUST standards.
    • Expert-level proficiency in VPC design, advanced routing protocols, IP Address Management (IPAM), and Container Network Interface (CNI) configuration.
    • Hands-on experience with containerization and orchestration technologies such as Kubernetes, Docker.
    • Solid experience with Infrastructure as Code tools such as Terraform, Ansible.
    • Proficiency in scripting and/or programming languages such as Python, Go, Bash.
    • Experience in capacity planning, cost analysis, and justification for the architecture and design proposals.
    • Experience working in a regulated industry (e.g., healthcare, finance) with a strong understanding of security and compliance requirements.
  • Preferred Qualifications
    • Multi-cloud & Multi-site networking experience with AWS and GCP.
    • Experience with Service Mesh technologies (e.g., Istio, Linkerd) to manage and secure inter-service communication.
    • Relevant certifications: AWS Certified Advanced Networking or GCP Professional Cloud Network Engineer.

Pay Transparency Statement 

This is a hybrid position based out of one of our offices: San Francisco, CA, Lehi, UT, or Plano, TX. Hybrid employees are expected to be in the office two days per week.#LI-hybrid 

The actual pay rate offered within the range will depend on factors including geographic location, qualifications, experience, and internal equity. In addition to the salary, you will be eligible for stock options and benefits like health insurance, 401k, and paid time off. Learn more about our benefits at https://jobs.collectivehealth.com/benefits/.

San Francisco, CA Pay Range
$168,000$210,000 USD
Lehi, UT Pay Range
$134,500$168,000 USD
Plano, TX Pay Range
$147,800$185,500 USD

Why Join Us?

  • Mission-driven culture that values innovation, collaboration, and a commitment to excellence in healthcare
  • Impactful projects that shape the future of our organization
  • Opportunities for professional development through internal mobility opportunities, mentorship programs, and courses tailored to your interests
  • Flexible work arrangements and a supportive work-life balance

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Collective Health is committed to providing support to candidates who require reasonable accommodation during the interview process. If you need assistance, please contact recruiting-accommodations@collectivehealth.com.

Privacy Notice

For more information about why we need your data and how we use it, please see our privacy policy: https://collectivehealth.com/privacy-policy/.

TechJobBoard

Search open jobs in the tech industry faster and find your match.

© 2023 TechJobBoard. All rights reserved.