Required Skills: Prometheus, Cortex, Grafana, New Relic, Splunk
Job Description
Job Title: Site Reliability Engineer
Location: Lehi, UT .
Responsibilities:
Site Reliability Engineer to help build and operate services like Adobe Sign.
Enforce security controls including PCI-DSS, HIPAA, and SOC2.
Deliver infrastructure as code, automated wherever possible, for resources like DNS, log management, and code deployments
Participate in on-call pager rotation
Participate in the incident management process
Assist in the creation and refinement of operational documentation
Manage our uptime and performance using service level indicators and objectives
Our current stack: Apache, Tomcat, Memcached, Qpid, Kubernetes and MySQL on Linux
Blue/Green deploys via Jenkins CI/CD pipelines and stack builder automation for infrastructure.
Required Skills
Strong programming skills, particularly with Python.
Experience implementing Chef, Docker, Kubernetes, etc. in AWS and Azure
Familiarity with Prometheus, Cortex, Grafana, New Relic, and Splunk