Senior Site Reliability Engineer
4 Days Old
Fully Remote working for candidates based in the UK Salary £85k to £90k + Benefits
We are looking for a Senior Site Reliability Engineer / DevOps Engineer that has come from a Software Development Background in the past and who still has strong C# or Java or other similar OO development language combined with strong knowledge of DevOps tools like Kubernetes and/or Docker and Azure or AWS Cloud platforms. We are looking for a Senior Site Reliability Engineer with a Software Engineering background to join their growing global Cloud Infrastructure team supporting their SaaS products.
Our client who are a Global Digital SaaS Software Company have a fantastic fully remote opportunity for an experienced Senior Site Reliability Engineer / DevOps Engineer to join their UK Cloud Infrastructure team.
Senior Site Reliability Engineers at this company are responsible for keeping the SaaS products running properly. Using concepts of software and systems engineering, they work to improve the reliability of all cloud systems while keeping levels of manual work low. SREs are expected to be experienced in software engineering principals, operational discipline, and automation.
The SRE team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software, this company s unique SaaS platform is an essential platform in the life of millions of University students across the globe.
In this role, you will apply your Software Engineering experience to enhance system performance and reliability, as well as building internal systems and capabilities that eliminate manual work through automation. You'll be joining our Platforms teams with globally-dispersed Site Reliability and Platform Engineers in a "follow the sun" model to operate our products on a multi-region cloud platform.
Role Responsibilities:
- Provide technical leadership and mentoring within the team through knowledge sharing sessions, pair programming, code reviews and solution design
- Identify and implement technical solutions to improve platform reliability, including the creation of mitigation strategies and operational playbooks.
- Implement and maintain monitoring/alerting/logging systems to identify and respond to incidents
- Ensure scalability and efficiency of cloud infrastructure and systems to handle traffic and data growth
- Conduct performance tests to identify and remediate bottlenecks
- Develop and maintain platform solutions, automate infrastructure provisioning, configuration, and management tasks using Infrastructure as Code.
- Monitor, review and tune databases to ensure high availability and performance
- Collaborate with product engineering teams to design/build fit-for-purpose and observable software
Required Skills and Experience:
- Proven experience in a SRE / DevOps / Platform Engineering role and having previously worked in a Software Engineering role in .Net and C# or Java or similar OO development language.
- Proficiency in C# or Java or another OO development language alongside knowledge of scripting languages like Bash, Python or PowerShell
- Production experience operating containerization technologies - ideally with Kubernetes and/or Docker.
- Proficiency with one or more public cloud providers such as Azure, AWS or GCP
- Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation.
- Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar.
- Proven track record of maintaining highly-available and performant production environments.
- Ability to identify and implement effective mitigation strategies and operational playbooks.
Useful / Bonus Skills to have:
- Experience in CI/CD tooling: Azure DevOps/GitHub Actions, Octopus Deploy
- Relevant certifications in cloud platforms (e.g., Microsoft Certified: Azure Solutions Architect) and DevOps practices (e.g., Certified Kubernetes Administrator) are a plus
- Experience in database management/performance tuning, particularly MSSQL.
Employee benefits:
- Opportunity to be a part of a 30+ year well-established, high-performance SaaS company.
- Excellent Company Pension scheme and Life Insurance,
- Excellent holiday allowance.
- A supportive team environment with emphasis on learning and development opportunities
- Working with a team of caring, high-performing, and passionate people who have fun supporting our vision, innovation, and continuous improvement.
This Senior Site Reliability Engineer role is working for a market leading global software company and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider.
Please apply with your CV to find out more. #J-18808-Ljbffr
- Location:
- London, England, United Kingdom
- Salary:
- £125,000 - £150,000
- Category:
- Engineering
We found some similar jobs based on your search
-
New Yesterday
Senior Site Reliability Engineer
-
United Kingdom
-
£80,000 - £100,000
- Engineering
StarRez is the global market leader in student housing software and residential community management. Our cloud software solutions serve 1,300 institutions, in 25 countries, with over 3 million beds. With a customer satisfaction score of 99%, many of...
More Details -
-
New Yesterday
Senior Site Reliability Engineer London, United Kingdom
-
London, England, United Kingdom
-
£125,000 - £150,000
- Engineering
About the Role At NinjaOne we are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Senior Site Reliability Engineer to join our SRE team in the Platform Engineering organizat...
More Details -
-
New Yesterday
Senior Site Reliability Engineer (Content Delivery Network)
-
London, England, United Kingdom
-
£125,000 - £150,000
- Engineering
Job Description This job is with Warner Bros. Discovery, an inclusive employer and a member of myGwork – the largest global platform for the + business community. Please do not contact the recruiter directly. Welcome to Warner Bros. Discovery… the st...
More Details -
-
New Yesterday
Senior Site Reliability Engineer
-
Manchester, England, United Kingdom
-
£100,000 - £125,000
- Engineering
3 months ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, ...
More Details -
-
New Yesterday
Senior Site Reliability Engineer (UK)
-
City Of Edinburgh, Scotland, United Kingdom
-
£100,000 - £125,000
- Engineering
Location: Edinburgh (or Remote UK), Scotland, United Kingdom Salary: Not disclosed Description About us We’re Dayshape—an award-winning software scale-up with big ambitions and the momentum to match. Trusted by Big Four and many other top profes...
More Details -
-
New Yesterday
Senior Site Reliability Engineer in Milton Keynes - Xtremepush
-
Milton Keynes, England, United Kingdom
-
£80,000 - £100,000
- Engineering
The Ideal Candidate The ideal candidate will be working on complex technical solutions requiring performance and optimization at scale. Our core technologies include PHP, MySQL, Vue.js, and AWS. Participating in an on-call roster is required as part ...
More Details -