Featured

Venquis

Senior Site Reliability Engineer

This role is for a Senior Site Reliability Engineer, focused on e-commerce platforms, requiring expertise in observability, cloud scalability, and performance tuning. It offers a permanent contract, competitive salary, and is remote/hybrid in the UK. Key skills include distributed systems, cloud services, and automation coding.
🌎 Country
United Kingdom
🏝️ Location
Hybrid
📄 Contract
Full-time
🪜 Seniority
Mid-Senior level
💰 Range
Unknown
💱 Currency
£ GBP
💸 Pay
Unknown
🗓️ Discovered
August 28, 2025
📍 Location detailed
London, England, United Kingdom
recTXRnkomgVNSEr9
🧠 Skills
#JavaScript
Role description
Location: Remote / Hybrid - UK Sector: E-Commerce & Retail Platforms A global retail brand is scaling its e-commerce and digital customer platforms, handling millions of daily transactions and peak seasonal traffic. To support this growth, they are hiring a Site Reliability Engineer with deep expertise in observability, cloud scalability, and performance tuning. What you'll do: • Build and maintain highly scalable cloud infrastructure for large-scale e-commerce platforms. • Develop monitoring and observability frameworks to ensure fast response to performance bottlenecks. • Optimise CDN, caching, and APIs for high-traffic shopping events (e.g., Black Friday). • Drive automation and CI/CD pipelines to accelerate feature delivery without compromising stability. • Partner with software engineering teams to ensure always-on shopping experiences. What we're looking for: • Proven track record in high-scale distributed systems (retail, e-commerce, digital platforms). • Expertise in observability stacks (Grafana, Prometheus, Datadog, NewRelic, Elastic). • Strong cloud skills (AWS/GCP/Azure) including Kubernetes and serverless. • Solid coding skills for automation (Python, Go, JavaScript, Bash). • Experience optimising performance in high-traffic digital platforms. This is your chance to build reliability at retail scale, where seconds of downtime mean millions in lost revenue. Location: Remote / Hybrid - UK Sector: E-Commerce & Retail Platforms A global retail brand is scaling its e-commerce and digital customer platforms, handling millions of daily transactions and peak seasonal traffic. To support this growth, they are hiring a Site Reliability Engineer with deep expertise in observability, cloud scalability, and performance tuning. What you'll do: • Build and maintain highly scalable cloud infrastructure for large-scale e-commerce platforms. • Develop monitoring and observability frameworks to ensure fast response to performance bottlenecks. • Optimise CDN, caching, and APIs for high-traffic shopping events (e.g., Black Friday). • Drive automation and CI/CD pipelines to accelerate feature delivery without compromising stability. • Partner with software engineering teams to ensure always-on shopping experiences. What we're looking for: • Proven track record in high-scale distributed systems (retail, e-commerce, digital platforms). • Expertise in observability stacks (Grafana, Prometheus, Datadog, NewRelic, Elastic). • Strong cloud skills (AWS/GCP/Azure) including Kubernetes and serverless. • Solid coding skills for automation (Python, Go, JavaScript, Bash). • Experience optimising performance in high-traffic digital platforms. This is your chance to build reliability at retail scale, where seconds of downtime mean millions in lost revenue. Highly competitive salary plus bonus % paid yearly. Venquis is acting as an Employment Agency in relation to this vacancy.