We are seeking an experienced and hands-on Lead Data Software Engineer to guide a team of talented data engineers and help shape the foundation of our data ecosystem. You will collaborate closely with the Head of Data/AI to modernize legacy assets, build scalable new data solutions, and enable the organization to leverage its data infrastructure effectively for advanced analytics and AI initiatives. Responsibilities Lead and mentor a small team of data engineers, ensuring best practices and high-quality deliverables Oversee the cleaning, transformation, and optimization of legacy data assets to ensure data accuracy and usability Collaborate with stakeholders to understand their data needs and translate those requirements into scalable, efficient workflows Design and architect data pipelines, ensuring seamless ETL/ELT processes that meet performance needs Work extensively with cloud platforms, particularly Google Cloud Platform, leveraging tools like BigQuery, Vertex AI, Dataform, and Dataplex Execute hands-on coding and data engineering work, contributing to pipeline development and tuning processes Build and maintain large-scale datasets in a cloud environment, ensuring reliability and optimization Develop advanced SQL queries for data manipulation, reporting, and insights Incorporate robust programming practices using Python to automate processes and drive efficiencies Coordinate with cross-functional teams, including data scientists and business leaders, to align on strategic data initiatives Ensure documentation and thorough testing of all processes and deliverables to ensure scalability and maintainability Investigate and troubleshoot data pipeline performance issues, implementing improvements as needed Requirements 5+ years of software development experience 1+ years in a leadership or mentoring capacity Proven expertise working with the Google Cloud Platform, including BigQuery, Vertex AI, Dataform, or comparable tools in AWS/Azure Advanced proficiency in SQL and data transformation techniques Strong programming skills in Python or other languages for data engineering tasks Background in working with large-scale datasets in cloud environments, coupled with experience in building scalable and efficient ETL/ELT pipelines Excellent written and verbal communication skills in English (B2+ level), enabling effective collaboration within and between teams in a distributed environment Nice to have Familiarity with distributed data systems and tools such as Dataplex or similar platforms across cloud providers Showcase of previous successful projects in modernizing legacy data systems or building data foundations from scratch Knowledge of machine learning tools and workflows, particularly in integrating data for AI applications We offer/Benefits - International projects with top brands - Work with global teams of highly skilled, diverse peers - Healthcare benefits - Employee financial programs - Paid time off and sick leave - Upskilling, reskilling and certification courses - Unlimited access to the LinkedIn Learning library and 22,000+ courses - Global career opportunities - Volunteer and community involvement opportunities - EPAM Employee Groups - Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn