The Cloud AI & Advanced Systems Engineering (CAASE) is responsible for expanding Microsoft’s Cloud Infrastructure to enable Microsoft’s mission to empower every person and every organization on the planet to achieve more. The CAASE team is instrumental in delivering world-class and innovative hardware at scale to ensure a high-quality experience for the millions of Microsoft Azure customers.
We are looking for a Principal Development Operations Engineer to join the team.
As a Principal Development Operations Engineer, you will collaborate with several functional teams and lead the deployment, integration, and operation of cutting-edge AI hardware into the Microsoft Azure data center and lab environments. This is an excellent opportunity to build the AI infrastructure of tomorrow!
Ready to join an exciting and dynamic team? We want to hear from you!
Responsibilities
- Monitor and manage compute, storage, network, and AI hardware within Microsoft Azure’s lab and data center environments.
- System administration on Windows-based servers including deploying software packages, installing virtual machines, running validation scripts, and hardware break/fix.
- Use tools such as Continuous Integration and Continuous Deployment (CI/CD), source control, Infrastructure as Code (IaaC), bug tracking, automated testing, and deployment tools to manage the hardware and software infrastructure.
- Develop automation to deploy, monitor, validate, and manage hardware health and maintain uptime/availability.
- Troubleshoot and debug issues. Analyze trends in failures and provide proactive remediations.
Qualifications
Required/Minimum Qualifications
- 10+ years of technical engineering experience
- OR Bachelor's degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 8+ years of technical engineering experience
- OR Master's degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 5+ years of technical engineering experience
- OR Doctorate degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 4+ years of technical engineering experience.
- 8+ years of relevant experience in scaling infrastructure, building automation, and managing hardware deployments in data center-like environments.
- Experience in Windows or Linux system administration, including configuration management systems, CI/CD tools, and Virtual Machine (VM)/container technologies.
- Object-oriented coding experience with an understanding of agile software development practices.
Other Requirements
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to, the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Additional Or Preferred Qualifications
- 15+ years of technical engineering experience
- OR Bachelor's degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 10+ years of technical engineering experience.
- OR Master's degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 8+ years of technical engineering experience.
- OR Doctorate degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 5+ years of technical engineering experience.
- 10+ years of experience in server, storage, and/or networking system administration in a Windows environment, including provisioning and managing servers, system patching, and deploying software updates.
- Experience building, validating, and deploying software in a hyperscale environment.
- Experience with infrastructure management tools such as configuration management, containers, VMs, CI/CD, IaaC, source control, and/or monitoring.
- Knowledge of operating systems (Windows and Linux) and kernels.
- Experience working in a hyperscale hardware development environment.
- Effective verbal and written communication skills.
- Experience with bug management and tracking tools.
- Experience with hardware, firmware, management firmware, and/or software (system & application stack) interfaces across all modules in a system.
Hardware Engineering IC5 - The typical base pay range for this role across the U.S. is USD $133,600 - $256,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $173,200 - $282,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
#J-18808-LjbffrSimilar Jobs
- View Job
Development Operations Engineer
Palo Alto - View Job
Software Development Engineer, Software Development Engineer, AI Operations
Santa Clara - View Job
Principal Engineer
Mountain View - View Job
Principal Aerothermal Engineer / Sr. Principal Aerothermal Engineer
Sunnyvale - View Job
Principal Software Engineer
Sunnyvale