Manager, Operational Resilience Engineering
New Today
Overview
At Fanatics Betting & Gaming (FBG), a core division of Fanatics’ mission to build the ultimate end-to-end digital sports platform, we’re shaping the future of sports betting. As part of our team, you’ll help create cutting-edge experiences that match the passion of fans worldwide. We’re looking for talented individuals to join us in driving innovation across our Sportsbook and Casino products, helping to strengthen the reliability, efficiency, and resilience of our platforms. In this role, you’ll contribute to reducing the frequency and impact of Priority 1 incidents with measurable improvements in Mean Time to Mitigation (MTTM), ensuring smooth and transparent incident processes that keep engineering teams focused while stakeholders remain informed. You’ll also help embed a culture of continuous improvement, turning post-incident reviews into actionable follow-ups that prevent repeat issues, while supporting proactive operations through runbooks, automation, and advanced monitoring to reduce day-to-day overhead. By tracking and reporting key operational health metrics, you’ll play a key role in building data-driven resilience across the business.
Responsibilities
- Build, lead, and mentor the Operational Resilience Engineering team.
- Own the end-to-end incident management lifecycle, including response, escalation, postmortems, and continuous improvement.
- Define and standardize incident playbooks, escalation practices, and operational metrics (MTTR, error budgets, reliability KPIs).
- Champion a blameless culture that drives learning and operational maturity.
- Partner with engineering, product, and business leaders to align resilience priorities with risk, compliance, and customer impact.
- Deliver executive-level communications during major incidents with clarity and business context.
- Stay hands-on in critical incidents and guide improvements in monitoring, alerting, automation, and observability.
- Leverage and evolve key tools (AWS, Datadog, PagerDuty, Terraform, GitHub) to improve operational resilience.
Required Qualifications
- 8+ years in platform operations, site reliability, incident management, or operational resilience.
- 3+ years in leadership roles, including team management and cross-functional incident response.
- Proven track record leading high-severity incidents and communicating effectively under pressure.
- Strong technical background in cloud platforms (AWS preferred), infrastructure-as-code (Terraform), and CI/CD/containerized workloads.
- Deep familiarity with observability and incident tooling (Datadog, PagerDuty, FireHydrant, or similar).
- Strong written and verbal communication skills, with ability to distill technical detail into executive-ready updates.
- Demonstrated success driving metrics-based improvements in reliability and availability.
- Skilled in stakeholder management and influencing across engineering, product, and business leadership.
Other Qualifications / Nice To Have
- Experience in high-availability, high-transaction industries (sports, gaming, entertainment, or similar).
- Familiarity with cloud-native development and modern infrastructure practices.
- Knowledge of ITIL or incident management best practices (certification not required).
- Background with regulatory or compliance-driven environments.
Ready to build the future of sports betting? If you possess some of these skills but not all of them, we still encourage you to apply!
Please note that visa sponsorship is not available for this position. We are open to fully remote candidates based in the United Kingdom or Ireland, but we strongly encourage those who can join us on campus two days per week.
Additional Details
- Seniority level: Mid-Senior level
- Employment type: Full-time
- Job function: Engineering and Information Technology
- Industries: Technology, Information and Internet
Referrals increase your chances of interviewing at Fanatics by 2x. Get notified about new Engineering Manager jobs in United Kingdom.
- Location:
- United Kingdom
- Salary:
- £100,000 - £125,000
- Job Type:
- FullTime
- Category:
- Engineering