Site Reliability Engineer

Description:

Responsibilities
Ensure the highest level of uptime and Quality of Service (QoS) to Adobe’s customers through operational excellence

Define service level objectives (SLOs) and service level indicators (SLIs) to represent and measure service quality

Embed with product teams (physically and/or virtually) to develop tight-knit collaboration/partnership

Identify areas to improve service resiliency through techniques such as chaos engineering, performance/load testing, etc

Support and maintain globally distributed multi-cloud (public and/or private) environments

Automate common, repeatable tasks at a large scale to reduce toil

Solve performance and stability issues using a wide variety of tools

Participate in an on-call rotation as required

Determine the root cause for all production-level incidents and write corresponding high-quality RCA reports

Promote the DevOps/SRE mentality

Qualifications

Deep understanding of both software engineering and technical operations

DevOps skills (scrum/Kanban/agile/ci-cd/12-factor)

Experience in modern cloud-based, SaaS delivery technologies: Kubernetes, AWS, Azure, Jenkins, Git, Atlassian Jira and Confluence, Linux, DNS, E-mail, containers, log analysis, monitoring, Java, Apache, Tomcat, Memcached, Qpid, and MySQL on Linux

Expertise with containerization orchestration engines (Kubernetes )

Programming skills, particularly with Python, Java, and Ruby

Excellent communication, interpersonal, and teamwork skills

Familiarity with Prometheus, Grafana, NewRelic , and Splunk

Organization	Adobe
Industry	Engineering Jobs
Occupational Category	Site Reliability Engineer
Job Location	California,USA
Shift Type	Morning
Job Type	Full Time
Gender	No Preference
Career Level	Intermediate
Experience	2 Years
Posted at	2024-02-26 9:41 am
Expires on	2025-05-05