Back to feed

Site Reliability Engineer/ Chaos Engineer Remote to start

Remote Full-time Live

Site Reliability Engineer/ Chaos Engineer – Remote to Start Location Malvern, PA Position type 06 months contract Start date Immediately Rate DOE US Citizens and Green Card, GC-EAD and TN VISA accepted.

Job Description

Candidate will be part of the SRE team and lead technical role to determine Reliability Chaos Engineering needs of mission critical systems and business processes. Candidate will assess high level architecture and design issues relating to platform enterprise software interactions with other systems. Application development infrastructure database and middleware teams to ensure stability and reliability of the system. Chaos Engineering will proactive detect issues within the applications platform network and databases in a controlled way using Chaos tools like Chaos Monkey Gremlin Simian Army. Candidate should have familiarity with Internet protocols such as HTTP DNS TCP and UDP and Linux development environment and well versed with DevOps. Candidate will identify anti patterns optimization and support development of self-healing capabilities.

Responsibilities

Create operational tooling for monitoring self-healing infrastructures and chaos testing. Design and create controlled chaos in production systems. Create Python and Terraform scripts. Work across teams identify and fix issues that affect systems reliability and performance. Guide and design architectural decisions and direct solutions that will enhance our client’s product reliability. Dive into system and latent reliability issues service performance and capacity modeling of distributed systems at scale. Partner with development team to identify anti patterns and optimization strategies create fallback options and help develop self-healing capabilities across the enterprise in a sustainable manner. Apply tot his job Apply To this Job Apply To This Job

On the same wavelength

Site Reliability Engineer (SRE) in Austin, TX (Remote)

Remote Full-time

SRE “ Site Reliability Engineer”

Remote Full-time

Senior Site Reliability Engineer, APAC

Remote Full-time

Sr. Infrastructure Engineer - Kubernetes (Remote)

Remote Full-time

Lead Kubernetes Engineer; Fulltime- Remote

Remote Full-time

AWS Engineers - Kubernetes

Remote Full-time

[Remote] Sr. AWS DevOps Engineer- Kubernetes Expertise

Remote Full-time

Senior Platform Engineer – Kubernetes/OpenShift (Secret)

Remote Full-time

AWS Cloud & Kubernetes DevOps Engineer

Remote Full-time

Network Administrator/ Network Engineer - Remote + W2 Only

Remote Full-time

Data and Turf Director - North Carolina

Remote Full-time

Experienced Customer Service Representative – Delivering Exceptional Experiences for arenaflex Customers

Remote Full-time

Part-Time Customer Support Agent – Delivering Exceptional Experiences at arenaflex

Remote Full-time

Experienced Customer Logistics Representative – Logistics Operations and Customer Service

Remote Full-time

Part-Time Remote Data Entry Specialist – Flexible Hours & Remote Work Opportunity at arenaflex

Remote Full-time

Sr. Computational Statistician

Remote Full-time

INFLUENCER & COMMUNITY MARKETING COORDINATOR

Remote Full-time

Strategic Sales Executive, ClinicalKey AI

Remote Full-time

Experienced Customer Service Representative – Disney Enthusiast Wanted for Remote Role

Remote Full-time

Software Engineer, Data Infrastructure & Acquisition - Salvador, Brazil

Remote Full-time