Job Description:
Title: Site Reliability Engineer/DevOps (GCP Certified)
Location: Timonium, MD (Hybrid-3days/week)
Duration: Long term Contract
Rate: $60+ on C2C OR $55+ on W2
Job Description: Site Reliability Engineer and Capacity Planning
We are looking for a talented Site Reliability Engineer (SRE) with a strong background in Google Cloud Platform (GCP), RedHat OpenShift and Linux administration. The ideal candidate will be responsible for ensuring the reliability, performance, and scalability of our on-premise and cloud-based systems along with focus on reducing costs for Google Cloud.
System Reliability: Ensure the reliability and uptime of critical services and infrastructure.
Google Cloud Expertise: Design, implement, and manage cloud infrastructure using Google Cloud services.
Linux Administration: Perform standard Linux system administration tasks, including monitoring, debugging, and optimizing system performance.
Automation: Develop and maintain automation scripts and tools to improve system efficiency and reduce manual intervention.
Monitoring and Incident Response: Implement monitoring solutions and respond to incidents to minimize downtime and ensure quick recovery.
Collaboration: Work closely with development and operations teams to improve system reliability and performance.
Capacity Planning: Conduct capacity planning and performance tuning to ensure systems can handle future growth.
Documentation: Create and maintain comprehensive documentation for system configurations, processes, and procedures.
Qualifications:
Education: Bachelor’s degree in Computer Science, Engineering, or a related field.
Experience: 3+ years of experience in site reliability engineering or a similar role.
Skills:
Proficiency in Google Cloud services (Compute Engine, Kubernetes Engine, Cloud Storage, etc.).
Strong Linux administration skills, including experience with system monitoring, debugging, and performance tuning.
Experience with automation tools (Terraform, Ansible, Puppet).
Familiarity with CI/CD pipelines and tools (Azure pipelines Jenkins, GitLab CI, etc.).
Strong scripting skills (Python, Bash, etc.).
Knowledge of networking concepts and protocols.
Experience with monitoring tools (Prometheus, Grafana, etc.).
Skill:
Google Cloud Certification, Azure Pipelines, ArgoCD, Promethius and Grafana Dashboards. Capacity Planning.
Desired Skills:
OpenShift Certification, Linux Certified
Thanks & Regards
Abhishek Pathak
Technical Recruiter at "Stellar Consulting Solutions, LLC"
Email :