Principal Site Reliability Engineer
New Today
Overview
Our client is looking for a Principal Site Reliability Engineer to join their team on an initial three month contract with good scope for extension. The role is Inside IR35 and requires an active SC clearance. The candidate should be able to travel to site in Wokingham twice a week and work remotely otherwise.
Role Description
Collaborate with Agile teams to automate deployment, monitoring, and infrastructure management.
Ensure platform and business application reliability and performance against strict SLAs and KPIs.
Implement and maintain cloud-native observability stacks (Prometheus, Grafana, Loki, Tempo).
Develop and maintain Infrastructure as Code (IaC) using tools like Kustomize or Helm.
Manage CI/CD pipelines using Tekton and ArgoCD.
Support and troubleshoot OpenShift Operators (ServiceMesh, ODF, ACS, ACM, AMQ).
Conduct security reviews and implement controls aligned with national infrastructure standards.
Mentor junior engineers and promote SRE best practices.
Collaborate with vendors and IT teams for incident resolution and platform improvements.
Required Skills
Strong communication skills (written and verbal).
Experience in remote team collaboration.
Deep expertise in OpenShift/Kubernetes and RedHat Linux.
Proficiency in scripting (Bash, Python) and templating (Helm, Kustomize).
Experience with CI/CD automation and IaC strategies.
Security-first mindset with experience in regulated environments.
Experience with VMware vSphere virtualization?
Clearance and Eligibility
Note: The role requires an active SC clearance. Candidates who have held high level security clearance in the past are welcome to apply. Successful applicants will be required to be security cleared prior to appointment, which can take up to a minimum of 10 weeks.
#J-18808-Ljbffr
- Location:
- England, United Kingdom
- Job Type:
- FullTime