Site Reliability Engineer

 

Description:

Corporate Banking is the division consisting of many domain and sub-domains like Trade management, Investor space, Trust and Agency of securities, client positions management, custodian business, Cash/wire payments (both internal and external via Swift, crest, and Mint), Channel’s space, messaging billing and Trade Finance and Lending area. Corporate banking applications service the needs of business areas spread across all APAC, EMEA, and US regions, in addition to the production support Engineering team spread globally. You will be operating within the Production services team of cash management domain, which is a subdivision of Corporate Bank, as an SRE within the production support team. In this role, you will be accountable to drive a culture of proactive continual improvement into the Production environment. You’ll do this through application, user request support, troubleshooting, and resolving the errors in production environment, automation of manual work, as well as monitoring improvements and platform hygiene.

What We Offer You

  • A diverse and inclusive environment that embraces change, innovation, and collaboration

  • A hybrid working model, allowing for in-office / work from home flexibility, generous vacation, personal and volunteer days

  • Employee Resource Groups support an inclusive workplace for everyone and promote community engagement

  • Competitive compensation packages including health and wellbeing benefits, retirement savings plans, parental leave, and family building benefits

  • Educational resources, matching gift, and volunteer programs

 

What You’ll Do

  • Manage a set of Payment processing and Client Access channel applications

  • Understand Service-Level Agreements (SLAs), Service Level Objectives (SLOs) and Operational-Level Agreements (OLAs) to ensure service levels match client expectations and manage error budget

  • Identify risks and issues related to the area and define appropriate automated solutions, ensuring that the right problem-solving techniques and processes are applied by the team and vendor partners

  • Build and deploy automation around causation and correlation of events in our infrastructure

  • Provide strict production gatekeeper function by reviewing proposed solutions ensuring proper standards are adhered to

  • Update Runbooks and Known Error Database (KEDB) when required, and participate in all Business Continuity Planning (BCP) and component failure tests based on the run books

Skills You’ll Need

  • Experience with Service Operations (within a global operations context) and Information Technology (IT) in large corporate environments (specifically in controlled production environments in Financial Services Technology, FinTech, or a client-facing function)

  • Proficiency in operating infrastructure on any public cloud (e.g., Google Cloud Platform [GCP]) as an SRE/DevOps

  • Understanding of Structured Query Language (SQL) Server or Oracle databases, Linux OS, and RedHat OpenShift platform

  • Experience with monitoring tools like Geneos and Splunk

Organization Deutsche Bank
Industry IT / Telecom / Software Jobs
Occupational Category Site Reliability Engineer
Job Location New York,USA
Shift Type Morning
Job Type Full Time
Gender No Preference
Career Level Intermediate
Experience 2 Years
Posted at 2023-07-20 9:05 am
Expires on 2024-10-19