Your Impact
As a contributor in the SRE organization, you are passionate about delivering solutions to the real-time problems our mission-critical cloud native services encounter. You are also obsessed about achieving the high quality and reliability our customers demand. You will work closely not only with the SRE division, but your technical deliverables will reach the entire engineering organization to enable product teams to continuously deliver features on the vanguard of innovation.
What You’ll Do
Location: Remotely from Canada
Reports to: Manager, Site Reliability Engineering
Direct Reports: None
- Build robust, easy-to-use foundational platforms and tools that enable engineering teams to provision services rapidly, consistently, and securely.
- Exemplify cloud-native site reliability best practices.
- Write code that is performant, maintainable, clear, and concise.
- Employ strong problem-solving skills, with the ability to debug problems in cloud native distributed systems.
- Influence and educate the engineering organization to adopt new and improved architectural patterns.
- Provide robust documentation for use by engineers to promote self-service
- Take calculated risks, champion new ideas, and cultivate your craft.
What You Bring
- 5+ years of applicable experience
- Experience with Kubernetes is the must
- Experience operating cloud platforms such as Azure, AWS, or similar.
- Experience utilizing container technologies like Docker or similar.
- Experience using scripting languages such as Python, Golang or similar.
- Experience utilizing CI/CD platforms to automate provisioning infrastructure, software builds, tests, and releases.
- Experience using observability tools such as APM, logging, and metrics to assist with debugging issues.
- Experience using Infrastructure as Code tools for provisioning infrastructure such as Terraform, Cloudformation, or similar.
- Empathy to support the needs of software engineers
#LI-Remote