Site Reliability Engineer (SRE)
New Yesterday
Role Type: Full Time Remote (right to work in UK required)Office Hours: Flexible, generally aligned with standard UK office hours ±1 hourAnnual Salary: £55k - £70kOpportunitySocket is a next-generation software platform powering the client engagement operations of accounting and bookkeeping firms across the UK, Australia and New Zealand.We’re a fast-growing SaaS company helping accounting and bookkeeping firms strengthen client relationships through smart engagement tools. Our mission is to make every client interaction effortless and meaningful – we do that through clean design, automation, and thoughtful engineering.Role OverviewWe’re looking for our first Site Reliability Engineer to join the team. You’ll play a foundational role in designing and scaling the infrastructure that powers our platform – ensuring it’s secure, performant, and resilient as we grow.ResponsibilitiesDesign, build, and maintain reliable infrastructure for our SaaS platform (currently Laravel, Postgres, React + TypeScript).Implement robust monitoring, alerting, and observability tools to ensure system health and uptime.Manage cloud environments (AWS, Laravel Cloud) often using Infrastructure as Code (Terraform).Improve and own CI/CD pipelines to make deployments faster and safer.Automate repetitive tasks – provisioning, scaling, testing, backups, etc.Establish SLOs, SLIs, and incident management processes appropriate for a growing SaaS product.Partner with developers to embed reliability practices in every layer of the stack.Help define our approach to security, cost optimisation, and compliance.Contribute to software development projects when time permits.QualificationsWe know there’s no perfect candidate – enthusiasm, curiosity, and a willingness to learn count just as much.3+ years of experience in a DevOps, SRE, or infrastructure engineering role.Strong understanding of cloud platforms (AWS preferred).Proficiency with Docker and container orchestration.Experience managing CI/CD pipelines (AWS, GitHub Actions, etc.).Familiarity with monitoring and alerting tools (e.g. Prometheus, Grafana, Datadog).Solid scripting or programming ability (e.g. Bash, Python, Go, PHP).Excellent documentation and communication skills in async environments.A pragmatic mindset – automate what matters, don’t over‑engineer.Bonus Skills & AttributesExposure to the accounting and bookkeeping industry.Experience supporting SaaS products or multi‑tenant systems.Familiarity with Laravel, Postgres, or TypeScript environments.Interest in site security, compliance, or incident response practices.Passion for creating calm, resilient systems that let teams sleep better at night.BenefitsWork in a small, high‑trust team where your work has a visible impact.Flexible, fully remote setup.Autonomy to shape our infrastructure and reliability culture from scratch.Be part of a growing SaaS company shaping how accountants engage their clients.Send us your CV, a few lines about what excites you about this role, and a Loom video walking through something you’ve automated or built – we look forward to meeting you!
#J-18808-Ljbffr
- Location:
- United Kingdom
- Job Type:
- FullTime