SITE RELIABILITY ENGINEER (MIDDLE) - JW272

Agileengine


AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across multiple industries. Our people-first culture has earned us multiple Best Place to Work awards. About the Role We are looking for a skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and scalability of our cloud-based systems. Key Responsibilities - Manage alerts daily, check systems, and escalate issues as needed. - Be part of a team that provides 24×7 on-call support for critical SaaS events. - Document issues and remediation steps. - Proactively create appropriate monitors in the EKS/K8S ecosystem. - Deploy to EKS/K8s cluster using Terraform and Helm. - Learn and maintain existing infrastructure running under Docker Swarm. - Improve existing infrastructure health by implementing checks and scripts to correct known issues. Requirements - 2+ years of professional experience. - Experience working with Datadog. - Hands-on experience as an AWS Cloud Engineer. - Working knowledge of EKS/Terraform/Helm. - Working Experience with Docker and Docker Swarm. - Good understanding of AWS IAM roles and policies. - Experience logging and monitoring AWS resources using CloudWatch logs. - Experience working in a Linux environment. - Proficient in Bash and/or Python scripting. - A strong understanding of web technologies such as REST APIs. Preferred Qualifications - Experience working with monitoring solutions, such as Grafana and Prometheus. - Excellent oral and written communication skills. - Customer-facing communication skills to effectively explain issues and RCAs to customers. About Us AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups. We rank among the leaders in areas like application development and AI/ML. Our people-first culture has earned us multiple Best Place to Work awards. We offer competitive compensation, flexible work arrangements, and opportunities for growth and development. If you're looking for a challenging and rewarding role in a dynamic and innovative company, we encourage you to apply.

trabajosonline.net © 2017–2021
Más información