DIGITAL INFRASTRUCTURE SPECIALIST | (UCY-421)

Bebeeengineer


Job Title: Site Reliability Engineer (Middle) Position Overview: As a Site Reliability Engineer, you will play a critical role in ensuring the stability and efficiency of our software infrastructure. You will be responsible for identifying and resolving issues, as well as implementing proactive measures to prevent problems from arising. Key Responsibilities: - On-Call Support: Participate in a team that provides 24/7 on-call support for critical SaaS events; - Alert Management: Manage alerts daily, check systems, and escalate issues as needed; - Infrastructure Development: Proactively create appropriate monitors in the EKS/K8S ecosystem; - Deployment and Maintenance: Deploy to EKS/K8s cluster using Terraform and Helm, and maintain existing infrastructure running under Docker Swarm; - Automation and Integration: Automate manual tasks, implement/integrate new technologies in our Cloud Infrastructure, and collaborate with other teams and departments to provide the highest level of support and assistance; - Root Cause Analysis: Perform RCA and take necessary corrective actions to prevent the recurrence of issues; Requirements: - Professional Experience: 2+ years of professional experience; - Cloud Engineering Skills: Experience working with Datadog, AWS Cloud Engineer, EKS/Terraform/Helm, Docker and Docker Swarm; - Technical Knowledge: Good understanding of AWS IAM roles and policies, experience logging and monitoring AWS resources using CloudWatch logs, experience working in a Linux environment, proficient in Bash and/or Python scripting, and strong understanding of web technologies such as REST APIs; - Communication Skills: Excellent oral and written communication skills, customer-facing communication skills to effectively explain issues and RCAs to them, and upper-Intermediate English level. The Benefits of Joining Us: - Personal Growth: Accelerate your professional journey with mentorship, TechTalks, and personalized growth roadmaps; - Competitive Compensation: Competitive USD-based compensation and budgets for education, fitness, and team activities; - Flexibility: Flextime and options for working from home or going to the office – whatever makes you the happiest and most productive; - Collaborative Environment: Collaborate with other teams and departments to provide the highest level of support and assistance.

trabajosonline.net © 2017–2021
Más información