Job Description
About Us: At Dabster, we specialize in connecting top talent with leading global companies. We are currently hiring for a Senior Data Infrastructure Site Reliability Engineer (SRE) to join our client's cutting-edge team in London, UK. Our mission is to be the foremost recruitment specialist in securing exceptional talent for diverse industries including technology, engineering, and digital innovation. Who Will You Work With: You'll be working with one of the most innovative tech companies in the world — a leader known for creating transformative user experiences. The team you'll join is part of the Data Virtualization Infrastructure (DVI) group, responsible for managing Apple's largest multi-cloud storage abstraction and caching platform. Their work supports mission-critical machine learning workloads that power user-facing features across the Apple ecosystem. About the Role: As a Senior Data Infrastructure SRE , you will help ensure the reliability, scalability, and performance of a complex infrastructure stack spanning both first-party and third-party cloud environments. Your focus will be on automation, operational efficiency, and cloud storage systems, with deep involvement in Kubernetes-based deployments. This is a 6-month B2B contract role based in London, UK , with a strong possibility of extension. Key Responsibilities: Operate, monitor, and scale infrastructure supporting mission-critical workloads. Work across multi-cloud environments to manage storage abstraction and caching platforms. Own and execute projects from design through to delivery. Build tools to automate manual operations and improve reliability. Participate in a rotating on-call schedule, including occasional weekends. Collaborate closely with globally distributed teams (Cupertino, Bangalore, and London). Preferred Skills & Experience: 5 years in building and scaling large applications in public, private, or hybrid cloud environments. Deep hands-on expertise with Kubernetes , particularly with GKE or EKS . Programming experience in Python , Go , or Rust . Experience with object storage technologies such as Amazon S3 or Google Cloud Storage (GCS) . Strong background in Linux internals , networking protocols, and distributed systems. Skilled in deploying large-scale applications and using tools like Helm , Flux , or Spinnaker . Proven ability to troubleshoot complex networking issues and drive automation across systems. What We Offer: Impactful Work : Support the development of infrastructure for next-gen ML workloads and user-facing technologies. Global Collaboration : Work with talented teams across the US, India, and the UK. Hybrid Cloud Innovation : Contribute to managing and evolving infrastructure at a truly global scale. Contract Type : 6-month B2B engagement with potential for extension. Interview Process: Expect two to three interview rounds, including technical deep-dives and team fit discussions. How to Apply: If you're a skilled SRE with deep experience in infrastructure, automation, and cloud-native systems, we'd love to hear from you. Submit your resume via LinkedIn or send it directly to (email address removed) . Elevate your career by working on globally distributed, mission-critical infrastructure at one of the most admired tech companies in the world.. Location : London