Site Reliability Engineer

Company:  Themesoft Inc.
Location: Dallas
Closing Date: 08/11/2024
Hours: Full Time
Type: Permanent
Job Requirements / Description

Role: SRE Architect

Location: Dallas, Tx


Description:

  • Foster a Site Reliability Engineering culture by sharing best practices, approaches, documentation, and code across engineering teams.
  • Automate manual tasks or system components where applicable, increasing operational efficiency.
  • Troubleshoot and resolve complex issues in cloud-based SaaS and on-premise environments, focusing on OS, networking, and databases.
  • Manage live production incidents, debug/troubleshoot application and infrastructure problems, and implement SRE best practices.
  • Monitor and enhance the performance and stability of applications, driving the implementation of solutions.
  • Conduct system analysis and configuration management, developing system software improvements for performance, availability, and reliability.
  • Design, develop, and maintain software and systems to increase product reliability, observability, and organizational efficiency.
  • Collaborate closely with software engineers and QA teams to ensure system responsiveness to performance, security, and availability requirements.
  • Document system knowledge, create runbooks, and ensure critical system information is accessible.
  • Manage and monitor deployments, orchestrations of servers, Docker containers, databases, and backend infrastructure.
  • Stay current with security trends and proactively identify, diagnose, and solve complex security issues.


Responsibilities:

  • 15+ years of experience in full-stack application support, DevOps, or SRE roles.
  • Proficiency in JavaScript, TypeScript, and web development technologies.
  • Expertise in scripting languages like PowerShell and/or Python.
  • Experience with Resilience Frameworks, Design Patterns, and Chaos Engineering.
  • Ability to explain complex technical scenarios clearly to all organizational levels.
  • Knowledge of DevOps methodologies, CI/CD concepts, and tools such as Jenkins, CodePipeline, Puppet, Ansible, etc., is a plus.
  • Experience with public cloud platforms (AWS, Azure, GCP) and delivering projects on these platforms is a plus.


Regards

Praveen Kumar

Talent Acquisition Group – Strategic Recruitment Manager

Apply Now
Share this job
Themesoft Inc.
  • Similar Jobs

  • Site Reliability Engineer

    Dallas
    View Job
  • Site Reliability Engineer

    Dallas
    View Job
  • Site Reliability Engineer GenAI

    Irving
    View Job
  • Site Reliability Engineer (GenAI)

    Irving
    View Job
  • Site Reliability Engineer - VP

    Irving
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙