Senior Engineer, Database Infrastructure - Slack

New Today

Overview

Our Team Slack's Datastores team builds and operates the database platform powering Slack. We write software to manage thousands of stateful hosts, providing several petabytes of online database capacity. We are building one of the fastest-growing database platforms in the world. Our MySQL databases run in Vitess. You can read more about our migration to Vitess at Scaling Datastores at Slack with Vitess.

Background

Slack enables people all over the world to communicate and collaborate together. Teams of all scales — from the world’s largest public companies to the smallest of startups — use Slack to get work done, so we take performance and reliability very seriously. A taste of our scale:

  • 5 billion+ messages are sent per week, half of those outside the United States
  • Every day we see over 10M+ daily active users, 30+ billion web requests, and 200+ billion database queries.

For millions of people, Slack is the primary communication tool they use at work all day long. They expect it to be exceptionally reliable and fast, all the time.

Core Infrastructure at Slack

We operate at tremendous scale with systems that process millions of events per second. Teams in our group maintain and build the lower levels of our stack, including:

  • Edge services
  • Data Stores and Caches
  • Real-time messaging
  • Asynchronous background job processing

We know we’ve done our job correctly when none of our users think about us. We don\'t typically ship new user-facing features, but rather ensure our systems are incredibly performant, highly available, reliable, and scalable. Slack\'s API and web backend is built on PHP/Hack, our backend services are written in Java and Go, and we use Vitess as our storage engine. Our architecture is constantly evolving to handle millions more users. You can read about how we scaled our datastores with Vitess, how we respond to incidents, and much more on our blog.

What you would do over the course of a typical week

  • Operate and enhance our large, highly-available database infrastructure, utilizing technologies such as MySQL and Vitess.
  • Develop tools to enable self-service and self-managing capabilities of our database infrastructure so that other teams can operate full-stack while rapidly building new features for our customers.
  • Collaborate with engineering teams on their database storage needs, and advise them throughout the development lifecycle.
  • Write code to capture database performance, and create tools and dashboards to provide actionable insight into that data.
  • Participate in our on-call rotation and collaborate with our operations team to triage and resolve production issues.

You may be a fit for this role if you

  • Have been working in data storage, core infrastructure, or distributed system-owning teams with increasing responsibilities for 5+ years.
  • Have professional experience using Go, PHP/Hacklang, Python, Ruby, or Java.
  • Write code that can be easily understood by others with an eye towards clarity and maintainability. Collaborate with other teams to integrate new features of your platform or adopt self-service features.
  • Operated at least one distributed system, at scale and in a team environment. Some examples include: a relational database like MySQL/Postgres, or systems like Kafka, Cassandra, or ElasticSearch.
  • Deployed server software on Linux, and then operated it at scale. You’ve debugged its problems, and analyzed and optimized its performance.
  • Have experience operating cloud infrastructure, especially AWS.
  • Are familiar with deployment automation/configuration management tools like Chef, Terraform, Ansible, or Puppet.
  • Are a very strong communicator. You’re excited to explain complex technical concepts and share your knowledge with different audiences.

Core Infrastructure is a diverse and inclusive team that treats their colleagues exceptionally well. We are happy to help you learn what you need to know; we encourage and support each other\’s growth and thus it’s not expected that you would have expertise across all of these areas. Come join us!

#J-18808-Ljbffr
Location:
England, United Kingdom
Salary:
£80,000 - £100,000
Job Type:
FullTime
Category:
IT & Technology