Site Reliability Engineer

 

Description:

Responsibilities
Ensure the highest level of uptime and Quality of Service (QoS) to Adobe’s customers through operational excellence

Define service level objectives (SLOs) and service level indicators (SLIs) to represent and measure service quality

Embed with product teams (physically and/or virtually) to develop tight-knit collaboration/partnership

Identify areas to improve service resiliency through techniques such as chaos engineering, performance/load testing, etc

Support and maintain globally distributed multi-cloud (public and/or private) environments

Automate common, repeatable tasks at a large scale to reduce toil

Solve performance and stability issues using a wide variety of tools

Participate in an on-call rotation as required

Determine the root cause for all production-level incidents and write corresponding high-quality RCA reports

Promote the DevOps/SRE mentality

Qualifications

Deep understanding of both software engineering and technical operations

DevOps skills (scrum/Kanban/agile/ci-cd/12-factor)

Experience in modern cloud-based, SaaS delivery technologies: Kubernetes, AWS, Azure, Jenkins, Git, Atlassian Jira and Confluence, Linux, DNS, E-mail, containers, log analysis, monitoring, Java, Apache, Tomcat, Memcached, Qpid, and MySQL on Linux

Expertise with containerization orchestration engines (Kubernetes )

Programming skills, particularly with Python, Java, and Ruby

Excellent communication, interpersonal, and teamwork skills

Familiarity with Prometheus, Grafana, NewRelic , and Splunk

 

Organization Adobe
Industry Engineering Jobs
Occupational Category Site Reliability Engineer
Job Location California,USA
Shift Type Morning
Job Type Full Time
Gender No Preference
Career Level Intermediate
Experience 2 Years
Posted at 2024-02-26 9:41 am
Expires on 2024-12-15