Digital Site Reliability Engineer
  • USG Inc
82 Days Ago
NA
NA
Memphis-TN
5-10 Years
Required Skills: BASH shell scripting, Python, Docker
Job Description
Title: Digital Site Reliability Engineer
Location: Memphis, Tennessee
Duration: 24+ Months
 
POSITION GENERAL DUTIES AND TASKS:
We are looking for a highly skilled and experienced Reliability Engineer to become part of our team. The ideal candidate must possess a strong technology background, with specific expertise in Kubernetes, GitLab, Dynatrace, GraphQL, Node, and React, along with a good understanding of CI/CD pipelines. The candidate should be comfortable with ambiguity, eager to learn new things, and demonstrate perseverance, following the principle of “if at first I don’t succeed, try and try again.”
 
Responsibilities:
Collaborate with cross-functional teams to develop and maintain release architectures and monitoring frameworks.
Provide system design consulting and critical support to the development team before program launch.
Identify and resolve sophisticated performance and scaling issues, working closely with engineers to avoid bottlenecks and ensure traffic demands are met.
Mentor and guide team members, supporting their professional growth.
Identify and implement automation and monitoring tools to enhance the efficiency and effectiveness of SRE processes.
Take ownership of critical incidents, ensuring timely resolution and preventing future occurrences.
 
Mandatory Requirements:
Five (5) to Seven (7) years of professional experience in technology or a related field.
Two (2) years of experience with Kubernetes/EKS.
Two (2) years of experience with CI/CD pipelines.
Two (2) years of experience with a sophisticated observability platform, including RUM and APM.
 
Good to Have Requirements:
Familiarity with reading and understanding JavaScript (Node.JS).
Experience utilizing Dynatrace APM and RUM (other APM or RUM tools may be applicable); Dynatrace Associate Certification is a plus.
Intermediate to Advanced skills in BASH shell scripting, Python, and Docker.
Intermediate skills in on-prem GitLab CI pipeline creation, troubleshooting, and configuration of GitLab CI.
 
Preferred Qualifications:
Ability to solve sophisticated performance and scaling issues, collaborating with engineers to ensure bottlenecks are avoided and traffic demands are met, including organic growth and marketing events.
Strong problem-solving skills with the ability to perform efficiently in a fast-paced environment.
Ability to communicate effectively with stakeholders, including management, providing updates, recommendations, and solutions for any SRE-related issues.
Excellent communication and collaboration skills.
Experience with Kubernetes/EKS and pod life cycle management, including readiness and liveness checks.
Hands-on experience in building and supporting CI/CD pipelines and production releases.
Working knowledge of complex CDN cached website architecture.

Jobseeker

Looking For Job?
Search Jobs

Recruiter

Are You Recruiting?
Search Candidates