About Northern Trust:
Northern Trust, a Fortune 500 company, is a globally recognized, award-winning financial institution that has been in continuous operation since 1889.
Northern Trust is proud to provide innovative financial services and guidance to the world’s most successful individuals, families, and institutions by remaining true to our enduring principles of service, expertise, and integrity. With more than 130 years of financial experience and over 22,000 partners, we serve the world’s most sophisticated clients using leading technology and exceptional service.
Northern Trust is seeking an experienced Sr Principal Site Reliability Engineer with a strong focus on developing observability and automation. This role will play a pivotal part in ensuring the reliability and performance of the company’s systems and services. As a Site Reliability DevOps Engineer, you will be responsible for defining and deploying key observability services with a deep focus on architecture, production operations, capacity planning, performance management, deployment, and release engineering. You will work with cross-functional teams to assist with providing efficiency of our services. Your expertise in both software engineering and system operations will enable our partners to drive continuous improvements in our platform’s reliability. This role will focus on bringing complete observability across all technologies.
This role will be responsible for a number of key functions that both support and drive improvements to the reliability of Northern Trust’s IT Landscape.
What you will do:
• System design and Architecture: Lead the design and architecture of providing reliability, scalability, and performance of critical complex systems.
• Operational Excellence: Develop and maintain automation scripts and tools to streamline operations and reduce manual tasks. Oversee system performance transparency.
• Incident Response/Root Cause Analysis: Collaborate with root cause analysis and implement measures to prevent recurrence of issues.
• Monitoring and Observability: Design and implement comprehensive monitoring and observability solutions to proactively detect and address issues prior to them impacting our business.
• Develop and maintain dashboards and alerts to provide real-time insights into system health.
• Reliability Improvements: Identify opportunities for improving system reliability through process enhancements and technical solutions.
• Documentation and Communication: Create and maintain detailed documentation of systems, processes, and procedures.
• Communicate effectively with stakeholders across different teams and levels within the organization.
• Project Management/Collaboration: Manage and prioritize multiple projects and initiatives related to reliability and performance improvements.
• Collaborate with product, development, and operations teams to align SRE efforts with overarching business goals.
You possess:
Qualifications:
• Bachelor's degree or equivalent experience
• 10+ years in systems engineering with a focus on reliability, systems operations, and software engineering
• 5+ years as a Team lead or a hands on Technical Manager role that can engage and deliver projects to completion
• Strong proficiency in programming languages such as Python, Go, Ruby, Java, etc
• Experience with both on-prem and cloud solutions
• Experience with containerization
• Demonstrated ability to design and implement systems that ensure observability with associated dashboards
• Deep understanding of distributed systems, networking, and modern software architectures
• Excellent problem-solving skills and ability to handle complex technical challenges
• Strong dedication to customer needs, with excellent communication and the ability to build lasting relationships, alongside the capability to articulate complex reliability strategies in a clear and impactful manner.
• Prior experience delivering Infrastructure as Code via a CI/CD pipeline
• Proven experience in leading a mentoring technical teams
• Skilled in implementing automation for corrective action based on deployed observability solutions
• Practical experience operating in an Agile development environment
Working with Us:
As a Northern Trust partner, greater achievements await. You will be part of a flexible and collaborative work culture in an organization where financial strength and stability is an asset that emboldens us to explore new ideas.
Movement within the organization is encouraged, senior leaders are accessible, and you can take pride in working for a company committed to assisting the communities we serve! Join a workplace with a greater purpose.
We’d love to learn more about how your interests and experience could be a fit with one of the world’s most admired and sustainable companies! Build your career with us and apply today. #MadeForGreater
Reasonable accommodation
Northern Trust is committed to working with and providing reasonable accommodations to individuals with disabilities. If you need a reasonable accommodation for any part of the employment process, please email our HR Service Center at .
We hope you’re excited about the role and the opportunity to work with us. We value an inclusive workplace and understand flexibility means different things to different people.
Apply today and talk to us about your flexible working requirements and together we can achieve greater.
#J-18808-Ljbffr