Site Reliability Engineer

 

Description:

 

Are you passionate about using technology to solve every problem? Do you believe in automation for consistent, scalable and fool-proof delivery of infrastructure and applications? Are you passionate about the concept of infrastructure as code and leveraging modern tools to define, build and manage infrastructure and applications? We are seeking an Engineer with strong DevOps and Site Reliability Engineering skills to join our Site Reliability Engineering team!

This role is a great mix of technology and teamwork with exposure to development and business teams designed to provide technical support, analysis, and solutions for production operations.

Responsibilities

  • Work as member of our global team managing our hybrid cloud environment.
  • Manage production hardware and software resources including coordinating maintenance and outages with globally distributed engineering, operations, and business teams.
  • Monitoring & management of alerts/events via relevant monitoring & ticketing tools.
  • Coordinate with Security teams for tuning and maintaining security best practices (SecOps).
  • Identify opportunities for optimizing performance, enhancing security/efficiency, and provide suggestions for Development teams.
  • Develop infrastructure as code practice within IS by constantly increase automation and improve IaC processes
  • Collaborate with development teams to design service architecture, documentation, playbooks, policies and operational procedures.
  • Assist in efficiently identifying root cause of technical issues in production.
  • Support and execute Disaster Recovery testing exercises
  • Carry final responsibility for time-critical escalations

Desired Experience And Qualifications

  • 2+ years’ experience in Application Support/System Administration or other similar technical field preferred.
  • Experience with Administrating production *nix and/or Windows environments (At least 2 years)
  • Preferred experience:
    • Strong Python or Linux scripting
    • Oracle and/or MySQL databases
    • Docker and/or Kubernetes experience preferred
    • Splunk, Nagios, and/or other Application Performance Monitoring tools
  • Apache Proxy/Tomcat and/or Web Services experience preferred
  • Excellent problem-solving/troubleshooting skills, fast learner
  • Practical knowledge of Linux networking, routing, and firewalls
  • ITIL Foundations certification preferred
  • Experience working with distributed teams located in other time-zones/countries
  • Believe that DevOps betters your life through configuration management tools such as Terraform, Puppet, Ansible, or AWX
  • Strong team player who can build and effectively collaborate amongst peers
  • Strong organizational skills with high attention to detail
  • Excellent written and oral communication skills

Organization ISS | Institutional Shareholder Services
Industry Engineering Jobs
Occupational Category Site Reliability Engineer
Job Location Oklahoma City,USA
Shift Type Morning
Job Type Full Time
Gender No Preference
Career Level Intermediate
Experience 2 Years
Posted at 2023-09-17 3:58 pm
Expires on 2024-10-19