Site Reliability Engineer
Location
Nottingham | United Kingdom
Job description
About Us
Over the last 10 years Commify has grown and transformed under Private Equity ownership and now operates across 9 countries including UK, France, Spain, Italy, Romania, Germany, Netherlands, Australia and the USA.
Our mission is clear; to make business communication brilliant by leveraging our best in class products that span SMS, WhatApp, Email, VOIP and other communications technologies. We serve over 50,000 businesses globally and send over 5 billion communications on a yearly basis.
We have ambitious growth plans to expand our already impressive product portfolio. As a values led business, we recognise our ‘People’ are our biggest asset. Do you want to be part of our success story?
Role Summary
We are looking for a Site Reliability Engineer who will be a key member of our Operations team. You will be passionate about our products and platforms running at peak performance. You will understand how our software runs on top of the infrastructure (both physical and Cloud) and how the individual components interact to provide the brilliant messaging solutions our customers expect. to join our Technology Department. As a Site Reliability Engineer you will play a key role in:
- Ensuring high levels of system performance through monitoring, analysis and performance tuning
- Implementing scalability and fault tolerance
- Improving processes through automation or other efficiencies
- Troubleshooting Application and Middleware issues
- Working with Engineering teams to ensure successful running of their software in a high throughput production environment
- Building deployment pipelines that ensure high quality code deployments
What essentials are we looking for?
- Experience of working with Microsoft Azure
- Strong working knowledge of Terraform, App Services and Kubernetes
- Fluent in written and spoken English
- Passion around reliability
- Creating and modifying Terraform deployments
- Previous experience of working in an Operations role (ideally a Site Reliability role)
- Ability to work collaboratively across multiple teams, to take ownership of, prioritise and be accountable for your work
- Excellent communication skills
- Monitoring solutions (Datadog, Azure Application Insights or Log Analytics)
- Scripting/programming languages to assist in automating solutions e.g. PowerShell (preferred), Bash, C#, Ruby, Python.
- Experience supporting web-based applications
It’s desirable for you to have:
- Azure DevOps pipelines
- Experience of working with Microsoft Server Operating Systems
- Experience of defining service level objectives/operational requirements for a Cloud-based solution
- Understanding and working knowledge of Microsoft Azure Cloud offerings, especially in the Platform as a Service category (Web Apps, Storage, Functions)
- A good understanding or working knowledge of the following tools: Terraform, Ansible, VSTS, ARM, Puppet, Chef, Jenkins, ELK, Grafana
- A good understanding or working knowledge of DNS, Load Balancer configuration, Active Directory and Cloud-based network infrastructure
- Experience of working in an agile environment and experience with agile methodologies such as TDD, Scrum, Kanban
- Understanding and experience of implementing a monitoring and alerting system for a micro-service architecture
- Applied understanding of cloud security best practice
Benefits
- Competitive Salary (£70,000 to £75,000 depending on experience )
- Company Bonus Scheme based on performance
- Private Healthcare & Dental
- 27 days annual leave plus Bank Holidays
- Birthday off work
- 5% Pension
- Death in service (4x salary)
- £350 Christmas voucher
- Monthly/quarterly socials
- Training & Development opportunities
- Hybrid working
Job tags
Salary
£70k - £75k per annum