Observability SRE

Company:  Nomura Holdings, Inc.
Location: New York
Closing Date: 05/11/2024
Salary: £100 - £125 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

Department: Group Platform Services & Engineering / Observability

Location: New York, NY

The pay range for this position at commencement of employment is expected to be between $120,000 and $145,000/year*

Company Overview

Nomura is a global financial services group with an integrated network spanning approximately 30 countries and regions. By connecting markets East & West, Nomura services the needs of individuals, institutions, corporates and governments through its three business divisions: Wealth Management, Investment Management, and Wholesale (Global Markets and Investment Banking). Founded in 1925, the firm is built on a tradition of disciplined entrepreneurship, serving clients with creative solutions and considered thought leadership. For further information about Nomura, visit

Job/Group Overview

Observability SRE within the Group Platform Services & Engineering division which provides the Nomura group common services to Development, Infrastructure and Production Services. This is a technical position responsible for support, operation, enhancements and integration of the Observability platform. The successful candidate will have a vital role in shaping future Observability strategy and direction within the Nomura Group.

A fantastic opportunity for somebody with 3+ years’ experience to work with state of the art technologies to deliver industry leading solutions in the Telemetry, Observability and Monitoring space (known as TOM internally). The successful candidate would join a team of enthusiastic forward-thinking SRE & Engineers who are working to radically transform how the Nomura Group manages the operation of its estate. This will evolve into full AI Ops Capability with automated anomaly detection, automated impact and root cause analysis and machine learning generated resolutions. This is a global team consisting of 20 team members bringing change across the organization.

The individual will be a part of our US-based Observability SRE team and shall be responsible for the support & operational engineering for the various tools and technologies that make up our Observability suite. The candidate will work closely with their peers in other regions as well as other development teams to facilitate the strategic objectives of the team.

The observability platform consists of tools from vendor, open source and in-house. These tools include Grafana UI, Loki, Mimir, Tempo, Sloth, RightITNow, EverBridge and Open Telemetry. Experience of the Observability principles and the Grafana toolset is a must. Our solutions are deployed on Linux backend thus intermediate knowledge of Linux is also a must. We also expect the candidate to have experience of development and engineering in some capacity.

Aspiring individual must be a quick learner and be able to understand and support a complex production ecosystem. To that end, they should be familiar with best practices in terms of managing releases / changes / incidents / requests etc. We expect the individual to leverage skills and production support experience to find solutions to problems while minimizing impact to production.

We are looking for an individual who can be innovative and find opportunities to improvise either via process improvement or automation. They need to be self-motivated as well as motivating to others and be a driver of change. Needs to be a team player and foster a healthy and conducive environment in the team where everyone is supportive and respectful of other’s opinions.

Responsibilities

  • Supporting the Observability tools including Grafana Loki, Mimir and Tempo & Grafana UI
  • Supporting a large user base as they manage their transition to modern observability tools and adoption of Open Telemetry
  • Managing updates, releases and testing in both production & non-production environments
  • Drive adoption of best practices in Observability across the organization
  • Contribute to Observability standards and procedures
  • Functionally reporting to head of Observability SRE Team

Requirements

  • Experience in supporting large enterprise systems (3+ years)
  • Passionate in providing high quality deliverables and learning new technologies
  • Strong and confident communicator with good interpersonal skills
  • Experience of collaboration tools such as Confluence / JIRA / Microsoft Teams & Office 365
  • Able to take the initiative to investigate and follow-up with various stakeholders to resolve issues
  • Solid understanding of release, deployment, and change management processes
  • Self-motivated individual, quality and improvement focused
  • Self-starter and able to self-manage
  • Must be able to take initiative to keep own skills up to date and to maintain awareness of current technology developments
  • Good team player, ability to work on a local, regional and global basis and as part of joint cross location initiatives

Preferred

  • SRE experience
  • Experience with Observability tools such as Open Telemetry, Grafana UI, Mimir, Loki, Tempo, Grafana Agent, Prometheus
  • Continuous Integration / Deployment via DevOps Tools such as GitLab, Jenkins, Ansible, Nexus etc.
  • Supporting a medium / large scale production environment
  • Knowledge of ITIL
  • Decent understanding of DB Platforms – Sybase / MySQL / MSSQL – general RDBMS concepts, SQL
  • Knowledge of operating system fundamentals including monitoring of IO, Networks, CPU and Memory
  • Basic knowledge of / familiarity with other infrastructure technologies such as Middleware, Web servers, Load balancers, System Services etc.

*base pay offered may vary depending on multiple individualized factors, including market location, corporate and functional title and duties, job-related knowledge and advanced degrees, skills, and experience. The total compensation package for this position may also include other elements, including a sign-on bonus, restricted stock units, and discretionary awards in addition to a full range of medical, financial, and/or other benefits (including 401(k) eligibility and various paid time off benefits, such as vacation, sick time, and parental leave), dependent on the position offered. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.

If hired, employee will be in an “at-will position” and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors.

Nomura is an Equal Opportunity Employer

Nearest Major Market: Manhattan
Nearest Secondary Market: New York City

#J-18808-Ljbffr
Apply Now
Share this job
Nomura Holdings, Inc.
  • Similar Jobs

  • Database SRE

    New York
    View Job
  • Cloud SRE- Windows

    New York
    View Job
  • Senior Software Engineer - Observability

    New York
    View Job
  • SRE / Applications Support Engineer

    New York County
    View Job
  • Senior Software Engineer (SRE/DevOps)

    Little Ferry
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙