Site Reliability Engineer

 

Description:

  • Work with a large established cross-functional team that is focusing on product design and delivery.
  • Work on expanding the Open Banking platform capabilities into new markets, opportunities to offer new payment flows are being explored and offered.
  • Be involved in the entire Open Banking AWS DevOps world. This includes Configuration and infrastructure work and aligning with other team to ensure accuracy integrity in and Open Banking platform.
  • Lead the implementation of new systems, think strategically, and develop process, tools, and frameworks to help ensure effectiveness across teams.
  • Identify and remedy single point of failure and security risks.
  • Continuously improve self-service tools and processes to reduce cycle times for developers and automate repetitive and wasteful operations.
  • Manage databases, caching servers, message queues, centralized logging, etc. including AWS aurora, MySQL, MongoDB, Redis and RabbitMQ.
  • Maintains tools and components including Kubernetes and AWS. Interface with external logging, monitoring, and security vendors.
  • Build out Infrastructure code in Terraform, system images in Packer, and application images in Docker.
  • Automate and simplify all aspects of systems delivery and Identify root causes of incidents and implement learnings.
  • Work with development teams to design, engineer and implementation of complex infrastructure systems using modern Infrastructure as Code (IaC) frameworks.
     

Required Skills:
 

  • BS degree in Computer Science or equivalent experience.
  • 2+ years experience supporting Kubernetes Infrastructure. CKA or CKAD certifications a plus
  • Proven skills with Linux or UNIX systems and related protocols/software with 3+ years' experience.
  • Familiarity or working knowledge of public cloud patterns (AWS/EKS, Azure/AKS); container tools (Kubernetes, Docker); pipeline tools (Jenkins, Gitlab, Ansible, Chef, Puppet, Terraform); logging and monitoring (Prometheus, Kibana, Splunk, Dynatrace); scripting (Python, Bash).
  • Department-wide public speaking and other communications conveying application level directions

Organization Primary Talent Partners
Industry Engineering Jobs
Occupational Category Site Reliability Engineer
Job Location New York,USA
Shift Type Morning
Job Type Full Time
Gender No Preference
Career Level Intermediate
Experience 2 Years
Posted at 2024-03-08 6:03 am
Expires on 2024-12-30