Manager - Automation and Observability Engineer

Company:  Clairvoyant LLC
Location: Chicago
Closing Date: 05/11/2024
Salary: £150 - £200 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

Manager - Automation and Observability Engineer

Chicago, IL, USA Req #27026

Tuesday, September 3, 2024

Company Overview and Culture

EXL (NASDAQ: EXLS) is a global analytics and digital solutions company that partners with clients to improve business outcomes and unlock growth. Bringing together deep domain expertise with robust data, powerful analytics, cloud, and AI, we create agile, scalable solutions and execute complex operations for the world’s leading corporations in industries including insurance, healthcare, banking and financial services, media, and retail, among others. Focused on creating value from data for driving faster decision-making and transforming operating models, EXL was founded on the core values of innovation, collaboration, excellence, integrity and respect. Headquartered in New York, our team is over 40,000 strong, with more than 50 offices spanning six continents.

Job Description:

Automation and Observability Engineer

The role will work with senior leaders to deliver on complex, enterprise-level initiatives that are a part of the bank's overall strategic direction. The Technical Leader will drive key Infrastructure Services strategies for self-service infrastructure, self-healing for infrastructure break fix, fully automated governance, and even drive towards a future state of immutable infrastructure. They will cross and merge the barriers that exist between software development, testing and operations teams and keep existing networks in mind as they design, plan and test. The role will help to design and develop infrastructure code to support continuous delivery and continuous integration processes.

Responsibilities:

  • Responsible for delivery and implementation of automation solution for the platform.
  • Collaborate with operations & engineering teams, application developers, management and infrastructure teams to assess near- and long-term automation solution.
  • Implement, maintain, and consult on the observability and monitoring framework that supports the needs of multiple internal stakeholders.
  • Work with application teams for Observability setup for their applications and Infrastructure that will include Dashboards, Visualisation, monitoring and provide consultancy on Self-Healing solutions for Applications.
  • Jenkins – Automation server, with plugins built for developing CI/CD pipelines.
  • Ansible – Configuration Management and Deployment.
  • Build a practice of performance and tracing using Observability tools like Splunk, AppDynamics and ThousandEyes.
  • Engineer solutions and establish standards for Splunk/ThousandEyes/AppDynamics functional components and specifically agent deployments, including optimizations and application tuning and instrumentation per requirements.
  • Seek opportunities through scripting automated deployments to reduce operational tasks. Seek opportunities for integration of Splunk with other monitoring tools.
  • Effectively communicate tool capabilities and processes to varying stakeholders.
  • Assist in scheduling and hosting regular tool training sessions to better enable tool adoption and best practices.
  • Provide input on improving the global operating model for monitoring and observability services.
  • Continue evolving monitoring tooling toward a standards-based self-service automated platform.
  • Adhere to HSBC policy, procedures and control requirements applicable to day-to-day working, exceptional and project activities, and raise any concerns about actual or potential issues promptly, in line with reporting and escalation procedures.
  • Apply policies, procedures, practices and standards to their allocated tasks, taking responsibility for their own actions, to ensure the achievement of high levels of quality, effective risk management and regulatory compliance.

Qualifications:

Key Requirements

  • Over 5 years’ experience working experience on GitHub, Jenkins, Ansible.
  • Over 10 years’ experience working experience on Linux Fundamentals and Scripting, PowerShell and Python.
  • Over 2 years’ experience working experience on Splunk/AppDynamics/Thousand Eyes.
  • Knowledge of REST API and ability to develop Monitoring Extensions is a strong plus.
  • 2 years of Application Development Experience (Java) at an enterprise level is a plus.
  • 2 years of Experience with a range of architecture tech stacks including Java app servers, Web Servers, Golang, Kubernetes, OpenShift, PCF, AWS, Google Cloud is desirable.
  • Experience of using Service Now, Confluence, Jira is preferred.
  • Knowledge of Event Management tools and of Operations Automation like AIOPs is desirable.
  • Previous experience of defining, creating, and supporting monitoring dashboards.
  • Experience with monitoring and observability solutions and methodologies including server and network performance, hardware, web synthetics, and application performance monitoring. This experience also needs to be for Unix and Oracle databases.
  • Experience implementing and adjusting tools and methodology for monitoring and observability products such as; Splunk, AppDynamics, ThousandEyes, Elasticsearch, Grafana, Prometheus.
  • Possess practical knowledge and appreciation of various aspects of distributed service design, including messaging protocols, caching strategies and autonomous software design practices.
  • Solid understanding of application performance metrics, KPIs, statistical calculations, machine learning, and correlation.
  • 2 years working with APM in deployment for mission critical applications.
  • Ability to work independently, multi-task, and take ownership of various parts of a project or initiative.
  • Possess strong interpersonal and communication skills to be able to deal with and form good relationships with the business and other technology groups through day-to-day support and project work.
  • Understanding of end-to-end business/functional processes with industry vertical/sub vertical to be able to translate business requirements into system requirements and perform impact analysis of changes in requirements.
  • Technical writing experience in relevant areas, including queries, reports, and presentations.
#J-18808-Ljbffr
Apply Now
Share this job
Clairvoyant LLC
  • Similar Jobs

  • Observability Engineer

    Chicago
    View Job
  • Observability Engineer

    Chicago
    View Job
  • Observability Engineer

    Chicago
    View Job
  • Observability Engineer

    Chicago
    View Job
  • Observability Engineer

    Chicago
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙