Site Reliability Engineer

apartmentKroo Bank Ltd placeLondon scheduleFull-time calendar_month 

We’re on a mission to build the world’s greatest social bank. We believe that banking needs to change for the better. When money is used correctly, it can transform our daily lives and positively impact the planet.

We’re a varied team of experienced tech, customer experience, marketing, legal and banking professionals and we’re looking for highly motivated and inquisitive people who want to understand customer behaviour and how to deliver the best customer experience.

How you will contribute and key responsibilities:

As a Site Reliability Engineer, you'll take ownership of the implementation, monitoring, maintenance and improvement of our core services. Other responsibilities include:

  • Advocate for reliability across the engineering team and within the wider business.
  • Create, set and monitor SLOs for Kroo’s core services and manage SLO breaches with product managers and engineers.
  • Develop and maintain system wide application monitoring including synthetic transactions to detect and identify potential issues.
  • Monitor third party provider performance against agreed SLAs.
  • Conduct reliability and resilience tests following chaos engineering practices.
  • Work with Kroo’s codebase and infrastructure to improve reliability and resolve resilience issues.
  • Build and maintain documentation detailing the configurations of deployed solutions and playbooks for incidents and outages.
  • Identify, recommend, and implement changes that will result in positive improvement to performance, resilience and reliability of the platform.
  • Assist with the development and testing of the technical Disaster Recovery (DR) Plan and conduct regular DR tests of core systems.
  • Assist the product engineering team with major releases and go-lives, whilst managing risk and eliminating downtime.
  • Be a part of the on-call schedule for major incident management.

Requirements

We’re looking for someone who:

To be successful in this role you must have proven experience within the Technology sector with knowledge of the following skills:

  • Experience working with common Public Cloud provides such as AWS, Azure or GCP
  • Knowledge and familiarity with IaC tools such as Cloudformation and Terraform
  • Knowledge of at least one programming language, preferably Typescript or Clojure
  • Strong understanding of the Software Development Life Cycle (SDLC) with close collaboration with development teams
  • In-depth knowledge of implementing and utilising monitoring and APM tools (Datadog preferred)
  • Knowledge of common SRE practices and understanding of setting and measuring SLOs and error budgets
  • Experience in operating a high-performance, high-security and high-availability applications and infrastructure, while minimising downtime and risks
  • Familiarity with microservice architecture and related design patterns
  • Experience managing and resolving technical incidents and service outages
  • Knowledge and awareness of common IT security practices
  • Knowledge of and experience implementing common disaster recovery strategies and experience
  • Excellent oral, written communication and presentation skills

Experience or knowledge in the following areas is highly desirable but not essential:

  • Experience building, deploying, monitoring and operating containerised workloads (Docker, ECS, EC2 / Fargate) in AWS. Relevant certifications (AWS Solutions Architect etc) preferred
  • Working knowledge of agile DevSecOps environments, and CI/CD (Git, Concourse, Terraform for Infrastructure-as-Code)
  • Experience working with suppliers to manage performance against agreed SLAs
  • Experience working in a highly regulated industry such as finance involving compliance reporting to regulatory authorities

Benefits

What we offer:

At our cutting-edge fintech company, we know that attracting and retaining the best talent means offering top-notch benefits that help our employees thrive both in and outside of work. Check out what we currently offer:

  • Generous holiday time: 25 days annual leave, 8 bank holidays, 1 Kroo bank holiday (June 24th), and 1 day off during the week of your birthday.
  • Personal days: We know that life can be unpredictable, so we offer 3 personal days to use as needed.
  • Employer-sponsored volunteer program: We're passionate about giving back to our community, and we support our employees in doing the same with up to 4 hours per month of employer-sponsored volunteer time.
  • Mental health support: We care about the mental health of our team members and offer access to Spill, our mental health support partner.
  • Workplace pension: We want you to feel secure about your future, so we offer a workplace pension with a 5% employee contribution and a 3% employer top-up.
  • Top-notch equipment: We provide top-of-the-line equipment necessary for smooth hybrid work, including a MacBook laptop. Additionally, we also offer support in establishing your home office by contributing towards your setup if required.
  • Modern office: When you're in the office, you'll enjoy access to our modern, bustling workspace in Central London.
  • Cycle to Work scheme: We encourage sustainable transportation with our Cycle to Work scheme.
  • Electric Car scheme: We're committed to reducing our carbon footprint, and our Electric Car scheme makes it easy for our employees to do the same.
  • Enhanced parental leave: We know that family comes first, and we offer an enhanced parental leave policy to support our employees in starting and growing their families.
  • Room for growth: As a fast-paced, high-growth start-up, we're dedicated to providing our employees with room to grow and excel.
  • You get full healthcare for you and your nuclear family via Vitality.

Hybrid Working:

At Kroo Bank, we have a hybrid policy that gives both individuals and teams a lot of freedom when it comes to using the office space to boost productivity. Our London office is a great resource when used effectively. So, employees who can occasionally come to the office are a good fit for how we work right now.

Keep in mind that this job involves working from Monday to Friday, with a mix of remote and office work, so you won't need to be on-site all the time.

Diversity and Inclusion:

We wholeheartedly uphold our commitment to fostering a diverse and inclusive workplace. Every employee is highly regarded, respected, and supported without any form of judgement or prejudice. We consider Diversity, Equality, and Inclusion as fundamental pillars guiding our path in all aspects of our bank.

We also ensure that reasonable adjustments are made available to all candidates throughout the recruitment process.

To all Recruitment Agencies:

At Kroo Bank, agency resumes are strictly prohibited. Do not submit agency resumes or forward them to our job advertisements or Kroo Bank employees. Be aware that Kroo Bank will not assume any responsibility for fees incurred due to unsolicited resumes.

To ensure a fair and efficient application process, all candidates are kindly requested to submit their applications directly through the advertised platform. We kindly ask that you refrain from reaching out to the company or its employees via email, LinkedIn, or any other communication channels for inquiries or updates.

Please note that any attempts to contact us through these channels will not receive a response. Thank you for your understanding and cooperation.

apartmentRandstadplaceLondon
Site Reliability Engineer (SRE) London - Onsite rolePermanent role £TBDI am working exclusively with a financial services organisation. We are looking for a Site Reliability Engineer (SRE) with a focus on Java playing a critical role in ensuring...
apartmentBenevolentAIplaceLondon
As a Senior Site Reliability Engineer, you will be working alongside our autonomous cross-functional squads. You will advocate high-quality engineering and best-practice in production software as well as providing the infrastructure to both build...
placeLondon
As a Network Reliability Engineer, you will work within a team of software engineers that are responsible for the tooling, automation & stability of our Global Network Infrastructure that supports Bloomberg products and services. Our network...