Site Reliability Engineer (Contract | Bank)

Location Singapore
Discipline Information & Communications Technology
Job Reference BBBH125666_1680159643
Salary S$5000 - S$7500 per month
Consultant Name Cabria Jireli Gem Mejia
Consultant Email gem.cabria@manpower.com.sg
Consultant Contact No. 62328815
EA License No. 02C3423
Consultant Registration No. R1434374


The Bank is looking for a Platform Site Reliability Engineer with experience working on enterprise level data engineering, analytics, and observability applications. The SRE would be responsible for ensuring high availability of the platform services and perform continuous improvements to increase the platform's efficiency and resiliency. The SRE will also perform automation development tasks to remove toil and increase the team's productivity.

Roles and Responsibilities

  • Implement and maintain highly resilient, highly available data engineering, monitoring and analytics application clusters. Perform production support for the platform.
  • Set up and operate the server infrastructure and software (Linux, Elasticsearch, Logstash, Grafana, Kibana, Kafka, Nginx) based on bank's security standards and industry's security standards
  • Perform continuous improvement for the platform covering areas such as: capacity planning, observability, monitoring, reliability, and resiliency.
  • Design and develop data engineering pipelines.
  • Automate repetitive tasks, optimize processes, and perform thorough testing to ensure quality.
  • Create and maintain software documentation for the platform.
  • Perform system maintenance, patching and upgrades.


Education and Relevant Experience

  • Degree in Computer Science or related field
  • Minimum 4 years of relevant working experience
  • Knowledgeable and have hand on experience in system administration or system software support
  • Knowledge and experience on operating and support one or more of these software - Linux, Elasticsearch, Logstash, Grafana, Kibana, Kafka, Nginx
  • Good knowledge and experience in Unix/Linux/Shell/Python scripting.
  • Excellent communication skills and ability to explain protocol and processes with team and management
  • A passion for learning and using new technologies in the open-source communities.
  • A passion for coding or scripting.
  • Additional knowledge and experience for 1 or more areas below would be good to have
    • SRE (Site Reliability Engineering) practices covering monitoring, observability, performance management, automation, and resiliency.
    • Object Oriented Programming, web application development, NodeJS, Spring boot and Kafka
    • Automation tools (e.g. Ansible, Chef, Puppet etc.) & DevOps pipelines
    • Experience in data ingestion (extraction, cleansing and parsing) and data analytics


Interested candidates may send in their resume and cover letter directly to gem.cabria@manpower.com.sg (R1434374), stating the position as the subject title in the email.

Jireli Gem Mejia Cabria EA License No.: 02C3423 Personnel Registration No.: R1434374

Please note that your response to this advertisement and communications with us pursuant to this advertisement will constitute informed consent to the collection, use and/or disclosure of personal data by ManpowerGroup Singapore for the purpose of carrying out its business, in compliance with the relevant provisions of the Personal Data Protection Act 2012. To learn more about ManpowerGroup's Global Privacy Policy, please visit https://www.manpower.com.sg/privacy-policy