Senior Site Reliability Engineer
| Posting date: | 20 January 2026 |
|---|---|
| Hours: | Full time |
| Closing date: | 19 February 2026 |
| Location: | London, EC2M 4AA |
| Company: | NatWest Group |
| Job type: | Permanent |
| Job reference: | R-00269103 |
Summary
Join us as a Senior Site Reliability Engineer
In this key role, you’ll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services
You’ll enjoy significant stakeholder interaction, working in collaboration with engineers to ensure a principled approach to deliver change in a safe and secure way
This is a chance to join an inclusive team with a collaborative ethos and a commitment to innovation and professional development
You'll work from home some of the time, but you'll also spend a significant amount of time working from an office or hub
What you'll do
As a Senior Site Reliability Engineer, you’ll work closely with our feature team and other colleagues to meet defined service level objectives and continually improve systems and environments. You’ll define error budgets that support finding the right balance between risk and reliability.
You’ll also provide structure and help to our release process, suggesting and making improvements where possible. You’ll help scale systems sustainably through mechanisms like automation, evolving them by pushing for changes that improve reliability and velocity. We’ll also look to you to coach and provide guidance to colleagues and the wider team, leading where required.
In addition to this, you’ll:
Proactively contribute new ideas and innovations to meet short term and longer-term goals
Continually balance and manage any potential risks
Be accountable for the day-to-day health of both production and non-production environments and respond to any incidents as required
Provide technical expertise and input to establish the risk tolerance of products and services
Communicate incident status updates clearly and frequently to other teams, customers and stakeholders
The skills you'll need
We’re looking for an experienced Senior SRE with a proactive approach to spotting problems, areas for improvements and performance bottlenecks. You'll need experience working with cloud-native microservices, including containerisation, management of Kubernetes workloads, and API management.
We’re also looking for:
Hands-on experience of Azure, Infrastructure as Code, and technologies such as PowerShell, JSON, Azure Bicep, ARM and Azure DevOps
Experience with Full Stack Observability using tools such as Grafana Stack, Log Analytics, AppInsights
Excellent knowledge of DevOps processes and principles
Knowledge of IT Service Management and automation of IT fulfilment processes through Orchestration and ServiceNow
Strong communication skills with the ability to proactively engage with a wide range of stakeholders