Description:
Responsibilities
Ensure the highest level of uptime and Quality of Service (QoS) to Adobe’s customers through operational excellence
Define service level objectives (SLOs) and service level indicators (SLIs) to represent and measure service quality
Embed with product teams (physically and/or virtually) to develop tight-knit collaboration/partnership
Identify areas to improve service resiliency through techniques such as chaos engineering, performance/load testing, etc
Support and maintain globally distributed multi-cloud (public and/or private) environments
Automate common, repeatable tasks at a large scale to reduce toil
Solve performance and stability issues using a wide variety of tools
Participate in an on-call rotation as required
Determine the root cause for all production-level incidents and write corresponding high-quality RCA reports
Promote the DevOps/SRE mentality
Qualifications
Deep understanding of both software engineering and technical operations
DevOps skills (scrum/Kanban/agile/ci-cd/12-factor)
Experience in modern cloud-based, SaaS delivery technologies: Kubernetes, AWS, Azure, Jenkins, Git, Atlassian Jira and Confluence, Linux, DNS, E-mail, containers, log analysis, monitoring, Java, Apache, Tomcat, Memcached, Qpid, and MySQL on Linux
Expertise with containerization orchestration engines (Kubernetes )
Programming skills, particularly with Python, Java, and Ruby
Excellent communication, interpersonal, and teamwork skills
Familiarity with Prometheus, Grafana, NewRelic , and Splunk
Organization | Adobe |
Industry | Engineering Jobs |
Occupational Category | Site Reliability Engineer |
Job Location | California,USA |
Shift Type | Morning |
Job Type | Full Time |
Gender | No Preference |
Career Level | Intermediate |
Experience | 2 Years |
Posted at | 2024-02-26 9:41 am |
Expires on | 2024-12-15 |