We are interested in every qualified candidate who is eligible to work in the United States. However, we are not able to sponsor visas or take over sponsorship at this time.
#LI-Hybrid #BI-Hybrid
Reports to: Technology Manager II - Tech Ops
About the Role: As a Site Reliability engineer you will help maintain the reliability of our consumer business from a technology and operational standpoint, and will drive its rapid improvement and efficiency by implementing automated tools, evaluating processes, troubleshooting and resolving complex problems. You will collaborate with IT, Software Engineering, and product teams to resolve operational issues, as well as working on building creative solutions for our teams. You will be applying DevOps principles to manage the platform and applications, and help to drive functionality and adoption through continuous improvement, simplification, and automation. The Site Reliability engineer will join a team of fellow SRE and Observability engineers, working together to make Enova’s reliability best of breed.
Responsibilities:
- Troubleshoot incidents and service requests with teams or vendors, prioritizing by business impact, diagnosing issues, restoring service, and keeping stakeholders informed.
- Manage daily operational requests.
- Monitor service level objectives for availability, latency, scalability, and efficiency of services.
- Participate in the team’s on-call rotation.
- Promote reliability best practices and develop frameworks for stable, scalable products.
- Stay curious about new technologies and evolving best practices.
- Collaborate with Software Engineering and Product teams to advise on effective operational strategies.
Requirements:
- At least 2 years' experience in Site Reliability Engineering (SRE), DevOps, Systems Administration, or Infrastructure Support, working with IT infrastructure (Linux, networking, databases, web technologies).
- Proficiency in at least one programming language such as Ruby, Python, Java, or Go.
- Strong SQL skills, capable of writing efficient and accurate queries.
- Proven experience troubleshooting and resolving complex technical issues, including root cause analysis.
- Excellent problem-solving abilities, with strong written and verbal communication skills.
- Ability to read, understand, and collaborate on software application code.
- Experience with monitoring, automation, and continuous improvement practices to drive platform reliability and efficiency.