[MT-862] | JUNIOR DATA ENGINEER – AUTOMATION

Vlex España


About Vlex At Vlex, we're passionate about turning raw data into powerful business insights. Junior Data Engineer Role Summary The Junior Data Engineer is responsible for building, maintaining, and optimizing automated data pipelines that provide clean, structured, and actionable datasets to support marketing and commercial initiatives. Main Responsibilities - Automated Data Collection: - Design and implement automated scraping processes from public websites (e.g., bar associations, universities, law firms) using Python, Scrapy, or APIs to generate structured contact and company databases. - Execute scraping from web apps or event platforms to capture attendee lists and participant data from global conferences and webinars. - Develop and maintain custom scraping scripts, proactively monitoring for UI changes or failures and applying timely fixes. - Scraping Environment Management: - Set up and manage scraping environments, including the use of virtual machines (VMs), proxy management, and IP rotation, to optimize performance and avoid detection or throttling from target servers. - Data Cleaning & Structuring: - Transform raw scraped or imported data into usable formats by cleaning and normalizing datasets, formatting critical fields such as email, company, size, and industry, and generating import-ready files for tools like HubSpot or advertising platforms. - Clean and maintain existing commercial databases, ensuring data integrity, removing anomalies, and verifying completeness against internal standards. - Data Extraction & Technical Collaboration: - Run SQL queries to support structured data extraction, transformation, and analysis as needed for internal consumption. - Collaborate via GitHub to manage code versions, track changes, and maintain alignment with other contributors. Required Technical Skills - Python (automation and scraping-focused development) - Web scraping libraries (Scrapy, BeautifulSoup, Selenium) - API consumption and basic integration - Automation tools and scripting best practices - Environment setup: VMs, proxies, IP rotation strategies - Data structuring and cleaning (with commercial use in mind) - SQL (basic queries and data manipulation) - GitHub (version control and collaboration workflows) - Familiarity with marketing tools such as HubSpot - Ingles C1 About the Job This is a full-time position offered under a permanent contract, with hybrid or remote work options. Salary is negotiable based on experience and skills.

trabajosonline.net © 2017–2021
Más información