Platform Engineer - AI
New Today
Make an impact with NTT DATA
Join a company that is pushing the boundaries of what is possible. We are renowned for our technical excellence and leading innovations, and for making a difference to our clients and society. Our workplace embraces diversity and inclusion – it’s a place where you can grow, belong and thrive.
Your day at NTT DATA
As a Platform Engineer at NTT DATA, you will lead the design of complex managed service solutions for our largest enterprise clients. Your role involves driving the strategic vision and direction for these solutions, combining technological expertise and business acumen to create IT strategies and roadmaps aligned with our clients' business objectives, KPIs, and SLAs.
Key Responsabilities:
Platform Development & Architecture
Design and build internal developer platforms (IDPs) that provide self-service infrastructure provisioning, deployment pipelines, and operational tooling through intuitive interfaces and APIs
Develop comprehensive platform architecture spanning on-premises, cloud, and hybrid environments with focus on scalability and reliability
Create developer-friendly abstractions for complex infrastructure concepts, including deployment workflows, environment management, and service discovery mechanisms
Operating Systems & Infrastructure Management
Design, implement, and maintain enterprise-grade Linux and Windows server infrastructures, including system installation, configuration, patching, and optimization
Perform advanced system administration tasks including user management, security hardening, performance tuning, and troubleshooting across diverse OS environments
Implement automated OS provisioning and configuration management using infrastructure-as-code principles
Virtualization Technologies
Design, deploy, and manage virtualized infrastructure using VMware vSphere/ESXi, Microsoft Hyper-V, and KVM hypervisors
Conduct capacity planning and performance analysis of virtual infrastructures to optimize resource utilization
Implement backup and disaster recovery solutions for virtual machines including technologies like Veeam and SRM
Integrate virtualization platforms with storage area networks (SAN) and network-attached storage (NAS) solution
Containerization & Orchestration
Design, implement, and maintain Kubernetes clusters across various environments (on-premises, cloud, hybrid) with focus on scalability and high availability
Optimize container orchestration platforms for performance, cost-efficiency, and resource management including advanced scheduling algorithms
Develop and maintain container deployment strategies, including blue-green deployments, canary releases, and rolling updates
Implement service mesh technologies and networking solutions for secure, scalable service-to-service communication
Cluster & Resource Management
Implement advanced scheduling algorithms and resource allocation strategies for distributed workloads across multi-cluster and multi-tenant environments
Design and optimize job scheduling systems with features including backfill algorithms, fair share scheduling, and advance reservations
Manage cluster resource allocation including CPU, memory, storage, and specialized hardware (GPUs) with focus on maximizing utilization and minimizing latency
Implement automated scaling policies and resource optimization techniques for dynamic workload management
CI/CD Pipeline Engineering
Build and maintain sophisticated continuous integration and deployment pipelines incorporating automated testing, security scanning, and progressive deployment strategies
Integrate CI/CD systems with Kubernetes and container orchestration platforms for streamlined application delivery
Implement GitOps workflows and Infrastructure-as-Code practices using tools like Terraform, Pulumi, and Ansible
Monitoring & Observability
Design and implement comprehensive monitoring, logging, and alerting systems providing visibility into platform health and application performance
Deploy observability solutions using tools like Prometheus, Grafana, Jaeger, and distributed tracing systems
Implement automated anomaly detection and performance optimization based on metrics, logs, and traces
Required Qualifications
Education & Experience
Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent practical experience
5+ years of experience in platform engineering, DevOps, site reliability engineering, or similar infrastructure-focused roles
Technical Skills
Expert-level knowledge of Linux system administration (Red Hat Enterprise Linux, CentOS, Ubuntu, Debian) including kernel tuning, process management, and security hardening
Proficiency in Windows Server administration including Active Directory, Group Policy, and PowerShell scriptin
Virtualization Technologies
Strong experience with VMware vSphere/ESXi, Microsoft Hyper-V, and open-source hypervisors like KVM
Knowledge of virtualization management tools including vCenter Server and System Center Virtual Machine Manage
Containerization & Orchestration
Expert-level Kubernetes administration including cluster setup, networking, storage, and security
Proficiency with Docker containerization and container image management preferableincluding RAFAY and RANCHER platforms
Experience with container orchestration patterns and service mesh technologies
Cloud Platforms
Hands-on experience with major cloud platforms (AWS, Azure, Google Cloud Platform) including compute, networking, and storage services
Knowledge of cloud-native technologies and hybrid cloud architecture
Programming & Scripting
Proficiency in scripting languages including Python, Bash, Go, and PowerShell for automation and infrastructure management
Experience with Infrastructure-as-Code tools like Terraform, Pulumi, CloudFormation, or Ansible
Monitoring & Observability
Experience with monitoring solutions including Prometheus, Grafana, Datadog, ELK Stack, and distributed tracing tools
Knowledge of observability best practices including metrics, logs, and traces correlation
Cluster & Resource Management
Experience with job schedulers and resource management systems like Slurm, PBS, or Kubernetes scheduling frameworks
Understanding of distributed systems architecture and resource optimization techniques
Soft Skills
Strong analytical and problem-solving abilities with experience in complex system troubleshooting
Excellent communication skills and ability to work effectively with cross-functional virtual teams
Product mindset with focus on developer experience and platform usability
Excellent ability to work effectively remote and virtually across Europe & International
Preferred Qualifications
Kubernetes certifications (CKA, CKAD, CKS) or equivalent cloud platform certifications
Experience with service mesh technologies (Istio, Linkerd) and API gateway solutions
Knowledge of security frameworks and compliance standards (SOC 2, ISO 27001, HIPAA)
Experience with GitOps practices and advanced CI/CD patterns
Background in high-performance computing or large-scale distributed systems
Workplace type :
Remote Working
About NTT DATA
NTT DATA is a $30+ billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long-term success. We invest over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure, and connectivity. We are also one of the leading providers of digital and AI infrastructure in the world. NTT DATA is part of NTT Group and headquartered in Tokyo.
Equal Opportunity Employer
NTT DATA is proud to be an Equal Opportunity Employer with a global culture that embraces diversity. We are committed to providing an environment free of unfair discrimination and harassment. We do not discriminate based on age, race, colour, gender, sexual orientation, religion, nationality, disability, pregnancy, marital status, veteran status, or any other protected category. Join our growing global team and accelerate your career with us. Apply today.
Third parties fraudulently posing as NTT DATA recruiters
NTT DATA recruiters will never ask job seekers or candidates for payment or banking information during the recruitment process, for any reason. Please remain vigilant of third parties who may attempt to impersonate NTT DATA recruiters—whether in writing or by phone—in order to deceptively obtain personal data or money from you. All email communications from an NTT DATA recruiter will come from an @nttdata.com email address. If you suspect any fraudulent activity, please contact us (global.careers@nttdata.com) .
- Location:
- London, England, United Kingdom
- Salary:
- £125,000 - £150,000
- Job Type:
- FullTime
- Category:
- IT & Technology
We found some similar jobs based on your search
-
New Today
Lead Software Engineer- Platform (Connectivity/Networking) - Chase UK
-
London, England, United Kingdom
-
£150,000 - £200,000
- IT & Technology
Overview At JP Morgan Chase, we understand that customers seek exceptional value and a seamless experience from a trusted financial institution. We launched Chase UK to transform digital banking with intuitive and enjoyable customer journeys. With a...
More Details -
-
New Today
Lead Software Engineer- Cloud Platform
-
London, England, United Kingdom
-
£150,000 - £200,000
- IT & Technology
Overview Be an integral part of an agile team that's constantly pushing the envelope to enhance, build, and deliver top-notch technology products. As a Lead Cloud Platform Engineer at JPMorgan Chase within the Chief Data and Analytics Office, you'll...
More Details -
-
New Today
Senior Platform Engineer
-
London, England, United Kingdom
-
£125,000 - £150,000
- IT & Technology
Social network you want to login/join with: *This role is open to remote working within the UK. Successful candidates will be required to travel to their closest UNiDAYS campus (London or Nottingham) occasionally. The role in a nutshell As a key memb...
More Details -
-
New Today
SC Cleared Platform Engineer
-
London, England, United Kingdom
-
£125,000 - £150,000
- IT & Technology
Overview SC Cleared Platform Engineer | Kubernetes | Linux/Windows | Secure Environments | Contract Infrastructure | Cloud Services | CI/CD | DevOps Tooling | Defence | SC Cleared Key Responsibilities Implement, maintain, and enhance infrastructur...
More Details -
-
New Today
AI Platform Engineer
-
London, England, United Kingdom
-
£125,000 - £150,000
- IT & Technology
Overview AI Platform Engineer Department: Product Development Employment Type: Permanent - Full Time Location: London Reporting To: Head of Development and R&D Description At Sabio Group, we're dedicated to fostering an environment where emplo...
More Details -
-
New Today
Staff Engineer - Platform Architecture
-
London, England, United Kingdom
-
£125,000 - £150,000
- IT & Technology
Overview Location: London, Hybrid position, In office 2 days per week. With millions of diners, 60,000+ restaurant partners and 25+ years of experience, OpenTable, part of Booking Holdings, Inc. (NASDAQ: BKNG), is an industry leader with a passion f...
More Details -