SRE Architect
The team of experts providing analytical services to healthcare clients is looking for a great, long-term SRE Architect.
You will join an international team of first-class professionals who are passionate about creating products that improve the quality of medical services.
The company offers exposure to a variety of industries and technologies, room to grow as a professional, time in projects to learn new skills and an opportunity to work with phenomenal coworkers, some of the best people on the planet.
The SRE Architect will be responsible for leading the SRE function across the enterprise. This role involves designing and delivering robust, scalable, and secure GCP-native solutions. The ideal candidate has extensive experience with GCP, microservices, SRE practices, and leadership, defining company-wide policies, standards, and best practices.
Responsibilities:
- Design, implement, and maintain scalable, reliable, and secure GCP-native solutions
- Proactively monitor system performance, anticipating potential issues, and implementing solutions
- Design and execute DR strategies for GCP-native solutions to ensure business
continuity - Collaborate with development and operations teams to ensure alignment of goals and efficient workflows
- Create and maintain comprehensive solutions documentation, including C4 models, deployment diagrams, network diagrams, and component views.
- Provide technical guidance and mentorship to team members on cloud provisioning practices
- Implement and advocate for SRE best practices, including monitoring, alerting, incident response, and capacity planning
- Design and implement SLI/SLOs to monitor and enhance system reliability and
scalability. - Collaborate with development teams to integrate the latest technology advancements, ensuring reliability and scalability
- Navigate complex corporate environments, aligning technical operations with broader business goals
- Apply security best practices throughout the software development lifecycle to ensure secure and compliant software releases.
- Troubleshoot and resolve complex technical issues promptly to maintain system
reliability and performance
Fundamentals:
- 7+ years in Site Reliability Engineering.
- 3+ years with GCP, including designing and delivering cloud infrastructure architecture solutions.
- Extensive experience with GCP networking.
- Proficient in solutions documentation, including C4 models, deployment diagrams, network diagrams, and component views.
- Skilled in GCP-native CI/CD pipelines.
- Experienced in designing and executing DR strategies for GCP-native solutions.
- Proficient in designing and implementing SLI/SLO/SLAs
- Expertise in designing GCP-native microservice architectures.
- Strong background in using Terraform to manage infrastructure within GCP.
- Strong analytical and strategic thinking skills.
- Knowledgeable in security best practices with diligence in ensuring the security of software releases.
- Proven ability to collaborate effectively across organizational boundaries, build
relationships, and share ideas to achieve broad organizational goals. - Excellent problem-solving, negotiation, and organizational skills.
- Outstanding communication skills.
Pros:
- Experience in the Healthcare business domain
- Understanding of HIPAA Compliance
- Experience with Python
- Experience with PHP
Technical Stack:
- GCP in general
- GCP Cloud Build
- OLTP: Google Cloud Spanner
- OLAP: Google Cloud BigQuery, BigLake
- Google Kubernetes Engine
- DataFlow
- Python
- Cloud Composer
- Looker Studio
- Google Pub/Sub
Benefits:
- Flexible working hours;
- Remote work;
- Interesting projects to work on;
- Exposure to a variety of industries and technologies.