Senior Infrastructure Engineer (Linux & Networking) - Remote
New Today
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Infrastructure Engineer (Linux & Networking) in EMEA.
As a Senior Infrastructure Engineer, you will design, deploy, and operate the systems that power high-performance GPU cloud environments. You will leverage your expertise in Linux, networking, and automation to ensure infrastructure is secure, scalable, and high-performing. This hands-on role involves managing Linux systems, configuring network infrastructures, automating deployments, and supporting containerized and virtualized workloads. You will play a key role in monitoring system health, implementing security measures, and responding to incidents. This position offers the opportunity to contribute to cutting-edge technologies, work with global teams, and have a tangible impact on cloud operations and performance.
Accountabilities:
- Provisioning and managing Ubuntu-based Linux systems supporting GPU servers and backend services.
- Designing and maintaining high-speed, low-latency network infrastructure, including firewalls, BGP, VLANs, VXLANs, and VPNs.
- Troubleshooting network-related incidents and ensuring minimal disruption to workloads or customers.
- Automating infrastructure deployments using tools such as Ansible, Terraform, and CI/CD workflows.
- Supporting containerized workloads via Kubernetes or custom orchestration systems, and managing both bare-metal and virtualized GPU platforms.
- Deploying and maintaining monitoring stacks (e.g., Prometheus, Grafana, ELK) to track system health, capacity, and performance.
- Implementing hardening practices, access controls, and audit trails to ensure security and compliance.
- Producing and maintaining accurate documentation for systems, networking, and processes.
Requirements
- 3-5 years of experience in Linux systems administration or infrastructure engineering.
- Strong networking knowledge, including routing, switching, TCP/IP, DNS, DHCP, VLANs, BGP, and VPN.
- Proficiency in scripting languages such as Bash and Python, and experience with automation tools like Ansible and Terraform.
- Hands-on experience with virtualization, containerization, and infrastructure troubleshooting.
- Familiarity with monitoring and logging systems in production environments.
- Knowledge of IB, BluField NIC, UFM, OpenvSwitch, or similar software-defined networking technologies.
- Strong analytical, problem-solving, and documentation skills.
- Experience working effectively in team environments and collaborating across global teams.
Preferred / Nice to Have:
- Prior experience in GPU cloud or HPC environments.
- Familiarity with NVIDIA GPU technologies and tooling (e.g., GPU operator, CUDA toolkit, DCGM).
- Experience with software-defined networking (SDN, OVS/OVN) and overlay networks (VXLAN, Calico).
- Exposure to networking hardware from Arista, Cisco, Mikrotik, or Nvidia/Mellanox.
- Familiarity with OpenStack private clouds and CMDB tools such as Netbox.
- Knowledge of server provisioning via PXE/iPXE and out-of-band management tools (IPMI, Redfish).
Benefits
- Competitive salary and performance-based incentives.
- Fully remote work with flexibility across EMEA time zones.
- Full-time permanent contract with international exposure.
- Opportunity to work with a diverse team of talented professionals.
- A collaborative environment that encourages growth, learning, and innovation.
- Exposure to cutting-edge GPU cloud technologies and the ability to make a significant impact.
- Opportunities to participate in international events and initiatives.
Jobgether is an equal opportunities employer and welcomes applications from all qualified candidates. We are committed to creating a diverse and inclusive workplace.
- Location:
- United Kingdom
- Salary:
- £80,000 - £100,000
- Job Type:
- FullTime
- Category:
- IT & Technology