AgileEngine is a leading software development company that creates award-winning applications for Fortune 500 brands and startups across various industries. We are ranked among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. About the Role We are seeking a Senior DevOps Engineer to join our team. As a DevOps expert, you will be responsible for designing, deploying, and operating scalable and robust Kubernetes environments. You will work with our Data Engineering, SRE, Product, and Business teams to deliver resilient solutions and support key initiatives. Key Responsibilities: - Kubernetes Operations: Design, deploy, and operate scalable and robust Kubernetes environments (EKS or similar) supporting data and analytics workloads; - Argo Workflows: Build, automate, and maintain complex data pipelines using Argo Workflows for orchestration, scheduling, and workflow automation; - GitLab/Git Migration Projects: Lead or support migration of source code repositories and CI/CD pipelines to GitLab or other Git-based platforms. Automate and optimize testing, deployment, and delivery using GitOps principles; - Infrastructure as Code: Develop and manage infrastructure with Terraform and related tools, implementing infrastructure automation and repeatable deployments in AWS and Kubernetes; - Data Platform Reliability: Support high-availability S3-based data lake environments and associated data tooling, ensuring robust monitoring, scalability, and security; - Observability: Instrument, monitor, and create actionable alerts and dashboards for Kubernetes clusters, Argo workflows, and data platforms to quickly surface and resolve operational issues; - Incident & Problem Management: Participate in incident, problem, and change management processes, proactively drive improvements in reliability KPIs (MTTD/MTTR/availability); - Collaboration: Work cross-functionally with Data Engineering, SRE, Product, and Business teams to deliver resilient solutions and support key initiatives; - Security & Networking: Apply best practices in networking (Layer 4-7), firewalls, VPNs, IAM, and data encryption across the cloud/data stack; - Capacity & Performance: Engage in capacity planning, forecasting, and performance tuning for large-scale cloud and Kubernetes-based workloads. Requirements: - Bachelor's degree in Computer Science, Engineering, or related field, or equivalent experience; - 5+ years of production experience operating and managing Kubernetes clusters (preferably in AWS, EKS, or similar environments); - Strong hands-on experience with AWS cloud services; - Deep hands-on experience with Argo Workflows, including developing, deploying, and troubleshooting complex pipelines; - Experience with Git, GitLab, and CI/CD, including leading or supporting migration projects and the adoption of GitOps practices; - Effective at developing infrastructure as code with Terraform and related automation tools; - Practical experience in automating data workflows and orchestration in a cloud-native environment; - Proficient in SQL and basic scripting (Python or similar); - Sound understanding of networking (Layer 4-7), security, and IAM in cloud environments; - Proficient in Linux-based systems administration (RedHat/CentOS/Ubuntu/Amazon Linux); - Strong written and verbal communication skills; - Ability to collaborate in cross-functional environments; - Track record delivering reliable, secure, and scalable data platforms in rapidly changing environments; - Experience working with S3-based data lakes or similar large, cloud-native data repositories; - Upper-Intermediate English level. Nice to Have: - Exposure to regulated or healthcare environments; - Familiarity with data modeling, analytics/BI platforms, or DBT; - Experience leading software/tooling migrations (e.g., Bitbucket to GitLab), or managing large-scale CI/CD consolidations. About AgileEngine: AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries.