System Machine Reliability Engineer - Engineering Infra

Company:  Shopee
Location: Emory
Closing Date: 15/10/2024
Salary: £100 - £125 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

```html

System Machine Reliability Engineer - Engineering Infra Department
Engineering and Technology Level: Experienced (Individual Contributor)
Location: Singapore

The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not merely solve problems at hand; we build foundations for a long-lasting future. We don't limit ourselves on what we can or can't do; we take matters into our own hands even if it means drilling down to the bottom layer of the computing platform. Shopee's hyper-growing business scale has transformed most "innocent" problems into huge technical challenges, and there is no better place to experience it first-hand if you love technologies as much as we do.

About the Team:

The mission of the Shopee Tech Ops MRE (Machine Reliability Engineering) team is to ensure efficient and sustainable operation of the Shopee network and hardware level 24x7, building and maintaining massive hardware clusters for SRE and capacity, in terms of capacity, cost and hardware performance. The team provides sustainable hardware resources and stable network support services.

MRE needs to:

  1. Communicate with the data center team to design and optimise network architecture;
  2. Provide reasonable hardware configuration through hardware testing and selection according to business requirements;
  3. Customise stable and efficient OS;
  4. Optimise traditional operation through engineering and service means;
  5. Build a complete hardware monitoring system to improve the efficiency of fault handling.

Job Description:

Responsible for:

  1. The maintenance of OS and server.
  2. The system service such as NTP/SMTP/Ansible/Saltstack.
  3. The maintenance of CI/CD pipeline.
  4. Providing efficient and effective OS/Server solutions according to business needs.

Requirements:

  1. Bachelor's or higher degree in Computer Science, Engineering, Information Systems or related fields.
  2. Proficient in Linux Operating system.
  3. Familiar with X86 hardware architecture, including CPU, GPU, SSD, PCIE.
  4. Skilled use of a variety of system management tools, with experience in performance benchmark, familiar with TCP/IP and basic network concept.
  5. Large system management experience in an Internet company is preferred.

Skills below are optional but preferable:

  1. RHCE/RHCA certification.
  2. Experience with Ansible/Saltstack.
  3. Experience with SMTP/PoP3/IMAP/NTP.
  4. Experience with development of CMDB.
```#J-18808-Ljbffr
Apply Now
Share this job
Shopee
  • Similar Jobs

  • System Machine Reliability Engineer - Engineering Infra

    Emory
    View Job
  • Site Reliability Engineer (Infra and SRE) - Global Payment - Singapore

    Emory
    View Job
  • Site Reliability Engineer (Infra and SRE) - Global Payment - Singapore

    Emory
    View Job
  • Site Reliability Engineer

    Emory
    View Job
  • Site Reliability Engineer

    Emory
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙