I'm currently seeking a data scientist for a project focused on matching two datasets.
Project Overview:
Datasets: We have two datasets - one with 28 million rows.
Task: Design and implement an effective matching solution.
Aspects: Data preprocessing, feature engineering, and composite key creation for matching.
Daily Updates: This task needs to be performed daily.
Platform: Data is stored on Google Data Studio.
Key Responsibilities:
· Ensure data consistency and cleanliness.
· Develop strategies for effective feature creation.
· Create a robust composite key for matching.
· Implement solutions to handle daily dataset updates.
· Develop strategies to optimize matching performance.
This job is already closed and no longer accepting applicants, sorry.