Site Reliability Engineer

Description:

Corporate Banking is the division consisting of many domain and sub-domains like Trade management, Investor space, Trust and Agency of securities, client positions management, custodian business, Cash/wire payments (both internal and external via Swift, crest, and Mint), Channel’s space, messaging billing and Trade Finance and Lending area. Corporate banking applications service the needs of business areas spread across all APAC, EMEA, and US regions, in addition to the production support Engineering team spread globally. You will be operating within the Production services team of cash management domain, which is a subdivision of Corporate Bank, as an SRE within the production support team. In this role, you will be accountable to drive a culture of proactive continual improvement into the Production environment. You’ll do this through application, user request support, troubleshooting, and resolving the errors in production environment, automation of manual work, as well as monitoring improvements and platform hygiene.

What We Offer You

A diverse and inclusive environment that embraces change, innovation, and collaboration
A hybrid working model, allowing for in-office / work from home flexibility, generous vacation, personal and volunteer days
Employee Resource Groups support an inclusive workplace for everyone and promote community engagement
Competitive compensation packages including health and wellbeing benefits, retirement savings plans, parental leave, and family building benefits
Educational resources, matching gift, and volunteer programs

What You’ll Do

Manage a set of Payment processing and Client Access channel applications
Understand Service-Level Agreements (SLAs), Service Level Objectives (SLOs) and Operational-Level Agreements (OLAs) to ensure service levels match client expectations and manage error budget
Identify risks and issues related to the area and define appropriate automated solutions, ensuring that the right problem-solving techniques and processes are applied by the team and vendor partners
Build and deploy automation around causation and correlation of events in our infrastructure
Provide strict production gatekeeper function by reviewing proposed solutions ensuring proper standards are adhered to
Update Runbooks and Known Error Database (KEDB) when required, and participate in all Business Continuity Planning (BCP) and component failure tests based on the run books

Skills You’ll Need

Experience with Service Operations (within a global operations context) and Information Technology (IT) in large corporate environments (specifically in controlled production environments in Financial Services Technology, FinTech, or a client-facing function)
Proficiency in operating infrastructure on any public cloud (e.g., Google Cloud Platform [GCP]) as an SRE/DevOps
Understanding of Structured Query Language (SQL) Server or Oracle databases, Linux OS, and RedHat OpenShift platform
Experience with monitoring tools like Geneos and Splunk

Organization	Deutsche Bank
Industry	IT / Telecom / Software Jobs
Occupational Category	Site Reliability Engineer
Job Location	New York,USA
Shift Type	Morning
Job Type	Full Time
Gender	No Preference
Career Level	Intermediate
Experience	2 Years
Posted at	2023-07-20 9:05 am
Expires on	2025-02-05