Full Time Permanent opportunity
Location: Anywhere in Canada (ability to support EST hours).
System Reliability Engineer
Our client is one of the largest independent information technology services firm, and still growing!
They are currently expanding their System Reliability Engineering team that helps one of thier key clients deploy, manage, troubleshoot, and enhance their developer tooling platform, servicing over 2000 developers.
As a System Reliability Engineer, you will be responsible for designing, implementing, and supporting a verity of developer productivity tools that include Ansible Tower, GitLab, Artifactory and SonarQube. The technology stack used to manage the platform includes Ansible, Terraform, Python, Prometheus, Splunk, ELK and PageDuty.
You will build automation solutions to provision and validate infrastructure and help debug and resolve problems. You will help to improve operational performance by focusing on user experience, effectively assessing and managing risk, and minimizing the impact of failures.
Key Skills and Attributes
NOTE: It’s not expected that any single candidate would have experience across all these areas. Our client is looking for someone who is strong in a few areas, and has interest and desire to learn in others.