Site Reliability Engineer

Description:

Are you passionate about using technology to solve every problem? Do you believe in automation for consistent, scalable and fool-proof delivery of infrastructure and applications? Are you passionate about the concept of infrastructure as code and leveraging modern tools to define, build and manage infrastructure and applications? We are seeking an Engineer with strong DevOps and Site Reliability Engineering skills to join our Site Reliability Engineering team!

This role is a great mix of technology and teamwork with exposure to development and business teams designed to provide technical support, analysis, and solutions for production operations.

Responsibilities

Work as member of our global team managing our hybrid cloud environment.
Manage production hardware and software resources including coordinating maintenance and outages with globally distributed engineering, operations, and business teams.
Monitoring & management of alerts/events via relevant monitoring & ticketing tools.
Coordinate with Security teams for tuning and maintaining security best practices (SecOps).
Identify opportunities for optimizing performance, enhancing security/efficiency, and provide suggestions for Development teams.
Develop infrastructure as code practice within IS by constantly increase automation and improve IaC processes
Collaborate with development teams to design service architecture, documentation, playbooks, policies and operational procedures.
Assist in efficiently identifying root cause of technical issues in production.
Support and execute Disaster Recovery testing exercises
Carry final responsibility for time-critical escalations

Desired Experience And Qualifications

2+ years’ experience in Application Support/System Administration or other similar technical field preferred.
Experience with Administrating production *nix and/or Windows environments (At least 2 years)
Preferred experience:
- Strong Python or Linux scripting
- Oracle and/or MySQL databases
- Docker and/or Kubernetes experience preferred
- Splunk, Nagios, and/or other Application Performance Monitoring tools
Apache Proxy/Tomcat and/or Web Services experience preferred
Excellent problem-solving/troubleshooting skills, fast learner
Practical knowledge of Linux networking, routing, and firewalls
ITIL Foundations certification preferred
Experience working with distributed teams located in other time-zones/countries
Believe that DevOps betters your life through configuration management tools such as Terraform, Puppet, Ansible, or AWX
Strong team player who can build and effectively collaborate amongst peers
Strong organizational skills with high attention to detail
Excellent written and oral communication skills

Organization	ISS \| Institutional Shareholder Services
Industry	Engineering Jobs
Occupational Category	Site Reliability Engineer
Job Location	Oklahoma City,USA
Shift Type	Morning
Job Type	Full Time
Gender	No Preference
Career Level	Intermediate
Experience	2 Years
Posted at	2023-09-17 3:58 pm
Expires on	2025-01-22