Senior Manager, Systems Engineering with TS/SCI clearance (on site Northern Virginia)

Company:  Salesforce, Inc.
Location: Herndon
Closing Date: 10/11/2024
Salary: £100 - £125 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

Senior Manager, Systems Engineering

*This is a customer-facing role and will require you to be on-site in Northern Virginia. This is NOT a remote position.*

Salesforce is seeking an Engineering Leader to join the Site Reliability organization! Working closely with counterparts in the Infrastructure and R&D organizations, this organization provides a distributed team of engineers monitoring cloud service availability and is ready to swiftly repair any service-impacting issues. Seven days a week, 24 hours a day, in a follow-the-sun model, the Site Reliability team keeps the Salesforce cloud and our customers protected.

As a member of the Site Reliability team, you will lead efforts for detecting and resolving incidents within minutes. This objective is met by monitoring the services, reacting to problems, and proactively addressing issues before they affect performance or availability.

The team is responsible for fire prevention through monitoring, automation, self-healing, and resiliency initiatives, destructive testing, and game day exercises. The incumbent in this role would demonstrate a strong focus on tactical operations, as well as large-scale production engineering and orchestration.

PLEASE NOTE: Qualification for this job is contingent upon acceptable results from a background investigation as well as your having and maintaining the specific level of U.S. government background investigation and clearance required for this role.

Role Description:

  • Keep the customer-facing services available at top performance by maintaining the constant health of the supporting systems.
  • Incident management - Act in key support roles during major incidents e.g. Sev0, Sev1. Also, participate in the technical review of the incident for problem management.
  • Problem Management - populate and participate in RCAs and hand them off to the Global Solutions team.
  • Ensuring that work carried out by the Site Reliability team is executed in such a way as to align with the company’s internal compliance policy and directives.
  • Being available to discuss and resolve technical issues and customer concerns with other technical staff as the need arises.
  • Work with and lead other members of the team in staying on top of key industry innovation and technology, and assist in team development growth.
  • Identifying work opportunities and preparing or assisting with the preparation of technical proposals as the need arises.
  • Ability to operate in a fast-paced environment and solve sophisticated issues while optimally prioritizing multiple priorities.
  • Work to automate detection and resolution of recurring issues in the production environment.

Minimum Requirements:

  • Active TS/SCI clearance with polygraph required.
  • A related 4-year technical degree required.
  • 4+ years proven experience managing a team of Engineers and Site Reliability Engineers.
  • Systems engineering experience in enterprise scale internet service engineering or support role.
  • Expertise in TCP/IP related technologies (networking protocols, network programming, etc.).
  • Expertise in CLI enterprise support of Unix variants (Linux/Solaris/BSD) as well as strong Linux/UNIX knowledge with significant exposure to Red Hat Enterprise Linux and Solaris.
  • Strong understanding of monitoring implementations and administration.
  • Strong interpersonal skills (Written and Oral).
  • Hands-on experience configuring and running AWS (Amazon Web Services), using the CLI/SDKs.
  • Experience running systems monitoring and alerts.
  • Past experience in Incident Management and good understanding of ITIL service operations.
  • Experience in working in a 24/7/365 ops center leading large teams.
  • Experience working within the Intelligence Community (IC).

Preferred Qualifications:

  • BS or higher degree preferred in Computer Science or Electrical Engineering plus relevant job-related experience.
  • Perl/Python/BASH scripting experience.
  • Prior Chef/Puppet or automated deployment experience.
  • Experience in supporting and maintaining monitoring and alert systems.
  • Experience supporting and solving problems with relational databases and distributed platforms.
  • Experience in supporting and maintaining Java applications.
  • Experience in Docker orchestration and management.
  • Experience with JVM optimization and Java server technologies like Tomcat or Jetty.

Qualification for this job is contingent upon acceptable results from a background investigation as well as your obtaining and maintaining the specific level of U.S. Government security clearance required for this role. U.S. citizenship (U.S. born or naturalized) required.

#J-18808-Ljbffr
Apply Now
Share this job
Salesforce, Inc.
  • Similar Jobs

  • Lead Systems Engineer - TS/SCI Clearance

    Chantilly
    View Job
  • Lead Systems Engineer - TS/SCI Clearance

    Chantilly
    View Job
  • Senior DevOps Engineer (TS/SCI) with Security Clearance

    Chantilly
    View Job
  • Software Engineer (TS/SCI) with Security Clearance

    Herndon
    View Job
  • Systems Engineer (SE) - TS/SCI with Poly Clearance Required - GNRC

    Chantilly
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙