Job Description:
Site Reliability Engineer (SRE)
đ London (Hybrid â 2â8 days per month in office)
đ° ÂŁ50,000 per annum
đ Clear progression to Mid-Level SRE within 18 months
Weâre working with a growing, engineering-led organisation looking to hire a Site Reliability Engineer who enjoys solving real platform problems through automationânot just firefighting tickets.
This is an ideal opportunity for someone with 2â3 years of DevOps, Platform Engineering or SRE experience who wants to take ownership of CI/CD, infrastructure-as-code, and platform tooling while continuing to build production-grade coding skills.
The OpportunityThis role blends hands-on engineering with platform ownership. Youâll spend your time split between:
- Supporting developers with broken builds and deployments (40%)
- Designing and building automation, CI/CD pipelines, and Terraform infrastructure (60%)
Youâll act as the automation backbone of the platformâreducing manual effort, improving reliability, and enabling engineering teams to move faster.
Key Responsibilities
Developer Support & Troubleshooting (40%)- Debug failing builds, deployments, and CI/CD pipelines
- Provide Tier 2/3 support via Slack, tickets, and pairing sessions
- Take ownership of incidents, ensuring reliable and timely resolution
Platform Engineering & Automation (60%)- Design, build, and optimise CI/CD pipelines (GitHub Actions, Jenkins, GitLab CI)
- Develop and maintain Terraform modules for infrastructure-as-code
- Build automation tools (CLI tools, scripts, GitHub Apps, self-service tooling)
- Own observability: dashboards, alerts, monitoring, and runbooks
- Continuously improve platform processes and reduce operational toil
What Weâre Looking For
Essential Skills & Experience- 2â3 years in DevOps, SRE, or Platform Engineering
- Strong Linux troubleshooting and systems knowledge
- Proven experience with Terraform (module design, not just usage)
- CI/CD experience (GitHub Actions, GitLab CI, Jenkins)
- Ability to write production-quality code in Python or Bash
- Solid networking fundamentals (DNS, load balancers, CDNs)
- Experience with observability tools (NewRelic, Datadog, Prometheus, Grafana)
- Comfortable participating in on-call rotations
- Experience using AI tools (e.g. ChatGPT, Copilot, Cursor) to enhance productivity
Desirable- Go, Ansible, or configuration management experience
- Experience working with multiple CDNs (CloudFront, Fastly, Cloudflare)
About You
- Youâre a proactive problem-solver who automates rather than repeats
- You communicate clearly with both technical and non-technical stakeholders
- You stay calm under pressure and take ownership during incidents
- You care about clean, maintainable, production-quality code
- You actively use AI tools to improve how you build and debug systems
Whatâs On Offer
- ÂŁ50,000 salary
- Genuine ownership of CI/CD and platform automation
- Direct collaboration with the Head of Technology
- Clear progression to mid-level SRE within 18 months
- Learning budget and dedicated development time
Why Apply?This is not a ticket-driven support role. Youâll be a key technical contributor shaping how the platform operatesâworking alongside engineers who code and influencing real infrastructure and tooling decisions.