Software Reliability Engineer, Senior

apartmentExpert Employment placeAbingdon calendar_month 

Software Reliability Engineering combines software development and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. Software Reliability Engineers influence whole lifecycle of services from inception and design, through deployment, operation and refinement.

Key Skills
Python 3.

Understand Docker, Kubernetes.

Strong in Software Engineering: development lifecycle, DevOps, code release management and development tools.
Ability to debug and optimize code and automate routine tasks.

Good to have: Cloud technology (GCP/AWS/Azure/Java).

Responsibilities

Maintain and improve services once they are live by measuring and monitoring availability, latency and overall system health.
Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
Engaged in incident response and blameless postmortems.
Maintains a broad knowledge of state-of-the-art computer technology, equipment, and systems: participates in professional development activities as appropriate

Support Software Development tooling such as: Rundeck, Pagerduty, Stackdriver, PAM access (cyber Ark), Operational Readiness (Internal process), DR/Incident Drills, Incident reports, Cost Dashboards, Billing exports, certificates etc.

local_fire_departmentUrgent

Software Reliability Engineer, Senior

apartmentExpert EmploymentplaceAbingdon
DevOps, Python, Cloud, Deployment, Docker, Kubernetes, MicroservicesSoftware Reliability Engineering combines software development and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. Software...
apartmentAmazonplaceDunstable, 35 mi from Abingdon
Our Reliability Maintenance Engineering (RME) team is central to Amazon's commitment to innovation. As Amazon evolves and adapts, this team makes sure that the tools and technologies we use do as well. As a Senior RME Technician, you'll help us stay...
apartmentExpert EmploymentplaceEastington, 45 mi from Abingdon
Eectronics, device, electrical, reliability, robust design, mechatronicsElectrical Engineer required to provide innovative and cost-effective product reliability solutions by developing or evolving techniques, procedures or products. Qualification...