Technical Leader - Site Reliability Engineer (SRE) | FedRamp

Company:  Cisco
Location: Raleigh
Closing Date: 19/10/2024
Salary: £100 - £125 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

The successful applicant will be performing work on US Government classified environments, and therefore, must be a U.S. Person (i.e., U.S. citizen, U.S. national, lawful permanent resident, asylee, or refugee). This position may also perform work that the U.S. government has specified can only be performed by a U.S. citizen on U.S. soil.


Who We Are

#WeAreCisco and we are so happy you're thinking of joining us. Follow us on social @WeAreCisco to learn more about what employees say about why we love where we work, or check Cisco out on Glassdoor for the latest reviews.

Think back on the latest significant internet outages and how they reinvented everyday life – even a few hours can halt commerce and connecting with family and friends. If our sites go down that long, our customers are left defenseless. Who's there to make sure that doesn't happen? That's you.

As a Site Reliability Engineer Technical Leader, you will be the hero behind the scenes preventing similar catastrophes for Cisco's TD&R services.

  • Secure Cisco customers and keep them protected from threats.
  • Solve cloud infrastructure challenges through software engineering approaches using innovative technologies.
  • Architect and maintain secure, highly resilient, and highly available cloud infrastructures to host SPR services.
  • Design release pipelines that meet the desired velocity for feature releases.

You'll be part of a team that solves challenges using technologies within AWS and engineers resilient, robust cloud infrastructure using innovative principles. As a result, you can recover quickly, minimize customer impact, thrive in innovating, and love being backed by data.

What You'll Do

  • Provide strong technical leadership and mentorship to a team of SREs and DevOps engineers.
  • Set and maintain high standards for infrastructure reliability, security, and performance in compliance with FedRAMP requirements.
  • Define and implement best practices for SRE and DevOps processes, including CI/CD pipelines, infrastructure as code, monitoring, and alerting.
  • Develop and execute a roadmap for optimizing the digital infrastructure in alignment with FedRAMP compliance.
  • Ensure that all aspects of our infrastructure meet and maintain FedRAMP compliance standards.
  • Stay abreast of the latest FedRAMP guidelines and integrate them into our operational practices.
  • Promote the use of automation tools to streamline deployment, configuration, and management of cloud resources.
  • Implement infrastructure as code practices to enable versioning and reproducibility of our environments.
  • Lead incident response efforts and post-incident reviews to continuously improve system reliability and availability.
  • Implement and manage robust monitoring, alerting, and logging solutions to proactively identify and address potential issues.
  • Work closely with multi-functional teams, including development, security, and compliance, to ensure smooth deployment and operation of services.
  • Communicate effectively with stakeholders to provide updates on infrastructure status, performance, and compliance.
  • Demonstrate in-depth expertise in the administration of enterprise-grade AWS infrastructures, ensuring seamless operation, optimal performance, and robust security measures.

Who You Are

To be successful in this role, you'd be a role model who exemplifies our culture and is often appreciated for living our principles. You are a leader and thrive in a sophisticated world yet begs for simplicity. You shine when collaborating with application development teams in designing cloud infrastructure and finding fulfillment in writing infrastructure-as-code. With your persistent curiosity about improving build and release frameworks, you're comfortable fixing and resolving production incidents.

Basic Qualifications:

  • 5+ years experience in SRE and/or DevOps role with a focus on cloud-based environment.
  • 1+ years of experience architecting cloud solutions.
  • Ability to participate in a 24/7/365 on-call rotation.

Preferred Qualifications:

  • 8+ years experience in SRE and/or DevOps role.
  • Proven experience working with Infrastructure as Code (IaC) and tools like Terraform.
  • Knowledge or experience working in security or compliance or regulated environment.
  • Expertise in triaging, troubleshooting, and addressing production problems in every layer of the stack.
  • In-depth expertise in the administration of enterprise-grade AWS infrastructure.
  • Experience in capacity and business continuity planning.
  • Strong security, networking, and Linux systems administration skills.
  • Strong scripting skills in Golang, Python and bash.
  • Experience in monitoring and analyzing infrastructure using tools such as DataDog and CloudWatch.
  • Experience or familiarity with tools like Vault, Packer and Docker.
  • Experience with CI/CD tools like Jenkins.
  • You are able to prioritize tasks, work independently, and call out exceptions optimally.

Our Team

Our versatile team provides a phenomenal deal of autonomy but expects clear accountability. We thrive in exploring new and innovative ideas but are rooted in making those decisions based on quantitative data. We're also in a fortunate position where our people leaders can proxy as Technical Leaders and vice versa.

We love a good career growth story! Team members previously in this position are now Technical Leaders or People Leaders responsible for the cloud infrastructure of entire product offerings within the TD&R portfolio.

"We pride ourselves on being the outstanding engineering organization with humility. We are agile and pragmatic. We enrich our people to strive for the best, enabling them to simplify complex problems." – Sandip K.

"We use groundbreaking technologies, so we always get a chance to learn and apply improvements to cloud infrastructures. I like our TD&R Ops team culture, flexibility and the direction we are heading." – Gayan J.

Why Cisco Secure

We're global, we're adaptable, we're diverse, and our security portfolio is as extensive as it is groundbreaking. Have you heard of Threat, Detection & Response, Zero Trust by Duo, Common Services Engineering, or Cloud & Network Security? Those are only a few of our product teams! The only thing we're missing is YOU.

Join an enterprise security leader with a start-up culture, committed to driving innovation and giving you the opportunity to make an impact. We #InnovateToWin and we know we're better together, that's why we're dedicated to inclusivity, collaboration, and diversity in everything we do.

We're proud to be the Best Small and Mid-Size Enterprises Security Solution Cisco Secure continues to grow and evolve year after year with 100% of Fortune 100 Companies using our products, and we're excited to see the new heights we'll reach with your passion for security, your customer focus, and your desire to change things up!

There are so many amazing reasons to join Cisco. Learn more here!

#J-18808-Ljbffr
Apply Now
Share this job
Cisco
  • Similar Jobs

  • Technical Leader - Site Reliability Engineer (SRE) | FedRamp

    Raleigh
    View Job
  • Technical Leader - Site Reliability Engineer (SRE) | FedRamp

    Raleigh
    View Job
  • Site Reliability Engineer

    Cary
    View Job
  • Site Reliability Engineer

    Cary
    View Job
  • Site Reliability Engineer

    Raleigh
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙