Systems Reliability Engineer

Company:  mthree Recruiting Portal
Location: New York
Closing Date: 07/11/2024
Hours: Full Time
Type: Permanent
Job Requirements / Description
SRE - Leading Investment Bank
Market leading investment bank requires a Systems Reliability Engineer join their Reliability & Production Engineering department. This role supports Institutional Securities and Wealth Management brokerage Operations platforms which include diverse technologies hosted by on premises and cloud platforms. The role is expected to perform day to day support for the business alongside reliability engineering tasks. The role has an emphasis on improving the reliability of our systems by working with the Software developers and Infrastructure engineering teams to develop automated reliability solutions.
Responsibilities:
  • You will spend time on production management, inclusive of: incident and problem management, capacity management, monitoring, event management, change management, and plant hygiene.
  • Troubleshooting issues across the entire technology stack: hardware, software, application, and network.
  • Participating in on-call rotation and periodic conference calls with other specialists from other time zones.
  • Proactively identifying and addressing system reliability risks.
  • Working closely with development teams to design, build, and maintain systems from a reliability, stability, and resiliency perspective.
  • Identifying and driving opportunities to improve automation for our platforms; scope and create automation for deployment, management, and visibility of our services.
  • Representing the RPE organization in design reviews and operational readiness exercises for new and existing products/services.
Experience :
  • Demonstrated ability to troubleshoot problems and debug to identify root cause on large-scale distributed applications across multiple layers, i.e. software, Infrastructure and database.
  • Hands on experience on enterprise tools such as Prometheus, Grafana, Splunk, Apica
  • Hands-on experience of UNIX / Linux system support and Cloud based services.
  • Experience with Ansible, GitHub or any automation/configuration/release management tools
  • Automation-related experience is particularly valued using scripting languages such as python, bash, perl, ruby. One higher level language is desired.
  • Creating stored procedures and optimising SQL in Sybase or DB2.
  • Experience of Azure Networks, ServiceBus, Azure Virtual Machines and AzureSQL will be an advantage.

SRE - Leading Investment Bank
Apply Now
Share this job
mthree Recruiting Portal
  • Similar Jobs

  • Site Reliability Engineer

    New York
    View Job
  • Site Reliability Engineer

    Secaucus
    View Job
  • Reliability & Equipment Engineer

    Brooklyn
    View Job
  • Site Reliability Engineer

    New York
    View Job
  • Electrical Reliability Engineer

    Jersey City
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙