DBS Bank logo

VP / AVP, Site Reliability Engineer, Group Consumer Banking and Big Data Analytics Technology, Technology & Operations

DBS Bank
Full-time
On-site
Singapore, Singapore
IT

Business Function

Group Technology and Operations (T&O) enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group T&O, we manage the majority of the Bank's operational processes and inspire to delight our business partners through our multiple banking delivery channels.

Job Summary

Site Reliability Engineering (SRE) at DBS combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey.

As a Site Reliability Engineer you will be filling a mission-critical role ensuring that our systems are healthy, monitored, automated, fault tolerant and designed to scale.

You will collaborate and work closely with engineering teams to continually improve our production services, facilitating fast delivery of new products, and reducing downtime.

Key Responsibilities:

  • Drive Site Reliability Engineering agenda to improve availability, reliability, and performance of services
  • Drive observability for our applications
  • Drive optimise operate initiative, example, reduction of operation toil
  • Work with application teams in setting up SLI, SLO and Error budget for their applications
  • Work with enterprise team in deploying SRE enablers/initiatives

Requirements:

  • Key Skills\: Unix, Wintel, Apache, JBOSS , IBM WebSphere , IBM IHS, MQ Administration, OpenShift & AWS administration
  • Secondary Skills\: DB Administration, Network (DNS, Firewall, GTM/LTM, VLAN)
  • Expert level knowledge of different OS ( AIX , LINUX , WINTEL, Solaris ) for BAU support, upgrades & maintenance
  • Knowledge on OS Security & hardening
  • Knowledge / hands on experience on Patch Management
  • In-depth knowledge of LVM, SAN allocation & File System increase, Create new file systems in Cluster / Non-cluster environment
  • ESXi, vSphere systems administration and support including vMotion, HA, DRS, vCenter Operations Manager, vCenter Service Manager, vCenter Configuration Manager, Site Recovery Manager
  • Administering cloud-based & OpenShift based Infrastructure deployment. Administration tasks includes provisioning/de-provisioning Of resources
  • Support audit and Infrastructure / network security scans, Disaster Recovery and security related drills
  • Capacity review & performance management across all platform systems
  • Knowledge on Middleware components such as JBOSS, APACHE, WebSphere Application server & MQ
  • Knowledge on SSL Certificate procurement process & renewals
  • Having knowledge on MariaDB, Oracle & DB2 databases Backup, DB restarts, access issues, DB Upgrade support
  • Very good understanding of SAN configuration EMC/Hitachi LUNs on UNIX (AIX/Solaris/Linux) servers
  • Mange Firewall, GTM & LTM configuration requests
  • Ability to develop simple/complex shell scripts as per requirements and for automation
  • Effective in dealing with crisis calls / critical issues for business-critical services
  • Proven experience in technically guiding teams in productivity driven environment
  • Worked in at least two of the areas of IT Infrastructure support i.e. Production Support, Application Support & infrastructure Support
  • Explore, learn and deploy new technologies that will help the company to reduce cost or improve operational efficiencies
  • Excellent troubleshooting and analytical skills
  • Communication and interpersonal skills
  • Working across cultures & able to work 24*7

We offer a competitive salary and benefits package and the professional advantages of a dynamic environment that supports your development and recognises your achievements.

Apply now