Site Reliability Engineer | 2025-034

apartmentStarRez placeLondon calendar_month 

About Star Rez

Star Rez, Inc. is the leading student housing and property management platform in the world. Our cloud software solutions serve 1,300 institutions, in 25 countries, with over 3 million beds. With a customer satisfaction score of 99%, many of the most prestigious Universities, Colleges and Property Managers across the globe rely on Star Rez to transform their student residential experience.

Along with the recent combination of Adirondack Solutions and RMS, this growing scale enables even greater opportunities to expand community value through our product capabilities and services. We provide opportunities for students and residents to Thrive!

The Role

Site Reliability Engineers at Star Rez are responsible for ensuring the smooth operation of Star Rez products and platforms. By applying software and systems engineering principles, they enhance system reliability while minimising manual intervention.

SREs are expected to be experienced in software engineering principles, operational discipline, and automation.

As a Site Reliability Engineer, you’ll be joining our Platforms teams with SRE and Platform Engineers based out of three regions in a “follow the sun” model to operate a multi-product/multi-region cloud platform.

Role Specifics
  • Work Location: Remote - United Kingdom
  • Travel:
  • Reporting Structure: Reports to Lead Site Reliability Engineer
What You Will Own
  • Identify and implement solutions to improve platform reliability, including the creation of mitigation strategies and operational playbooks.
  • Implement and maintain monitoring/alerting/logging systems to identify and respond to incidents
  • Participate in Root Cause Analyses (RCAs) and blameless post-mortems
  • Participate in on-call rotations to ensure system reliability and rapid incident response.
  • Ensure scalability and efficiency of cloud infrastructure and systems to handle traffic and data growth
  • Conduct performance tests to identify and remediate bottlenecks
  • Develop and maintain platform solutions, automate infrastructure provisioning, configuration, and management tasks using Infrastructure as Code.
  • Monitor, review and tune databases to ensure high availability and performance
  • Collaborate with product engineering teams to design/build fit-for-purpose and observable software
  • Contribute within the team to implement defined Service Level Indicators (SLIs), Service Level Objectives (SLOs) and Service Level Agreements (SLAs) as required
Required Qualifications
  • Bachelor's degree in Computer Science, Information Technology, or similar
  • Proven experience (2-years+) in a Platform Engineering, Site Reliability Engineering or Software Engineering role.
  • Proficiency in a least one (or more) object-oriented programming language (C# preferable)
  • Familiarity with one or more public cloud providers such as Azure, AWS or GCP
  • Familiarity using Infrastructure as Code (Ia C) tools such as Terraform (preferred), Ansible, or Cloud Formation.
  • Proficiency in scripting and automation using languages like Bash, Power Shell or Python.
  • Experience with monitoring, observability and logging tools such as Data Dog, Prometheus, Grafana, or similar.
Preferred Qualifications
  • Production experience operating containerization technologies (Kubernetes).
  • Experience in CI/CD tooling: Azure Dev Ops/Git Hub Actions, Octopus Deploy
  • Relevant certifications in cloud platforms (e.g., Microsoft Certified: Azure Solutions Architect) and Dev Ops practices (e.g., Certified Kubernetes Administrator) are a plus
  • Experience in database management/performance tuning, particularly MSSQL.

Reasons to join our Team:

  • Opportunity to be a part of a well-established, high-performance company that has been in business for over 30+ years
  • Full benefits including health care, paid time off, life insurance, and 401k plan with company match for eligible team members.
  • A supportive team environment with emphasis on learning and development opportunities
  • Our Promise: You will learn, grow, and be appreciated for your impact and contributions.
  • Z-Factor: Our most celebrated value, you will work with a team of caring, high-performing, and passionate people who have fun supporting our vision, innovation, and continuous improvement.

We are proud of our diverse workforce and are dedicated to creating a safe and welcoming environment for all employees. People from various ethnicities, ages, genders, and abilities are encouraged to apply.

Notice to external Recruiters and Recruitment Agencies:
Star Rez will not accept unsolicited resumes from recruitment agencies, headhunters, or any other third parties for this role through this website or directly to any employee. Star Rez and any of our subsidiaries will not pay fees to any third-party agency or company.

In addition, we ask that you do not reach out to any employee with regards to this position, or any other positions, now, or in the future.

apartmentBarclays BankplaceLondon
Join us as a Container Strategy Engineer at Barclays, where youll spearhead the evolution of our digital landscape, driving innovation and excellence. Youll harness cutting-edge technology to revolutionise our digital offerings within Foundations...
electric_boltImmediate start

Site Reliability Engineer

apartmentTrade NationplaceLondon
As a Site Reliability Engineer (SRE) at Trade Nation, you will be part of a dynamic and collaborative team that ensures the reliability, availability, and performance of our web services and applications. You will work closely with developers...
apartmentNexus Jobs LimitedplaceLondon
Job Description Site Reliability Engineer with Python Our Client looking to bring on a site reliability engineer to help deploy, manage, troubleshoot, and enhance our complex cloud-based set of internal tools and externally managed...