Overview Data Science Engineer:
We are looking for a savvy Data Science Engineer to join our team. The hire will be responsible for first and foremost assisting with a large backlog of client data reporting requests, running standardized functions, while reviewing the environment and when possible, optimizing our data and data pipeline architecture, as well as optimizing data flow and collection for cross functional teams. The ideal candidate has experience with Databricks, PySpark, SQL, Python, familiarity with APIs (does not need to build), ability to produce high quality documentation from the processes they learn, worked with Git (versioning control), and has a strong familiarity with ETL. The Data Science Engineer will support our software developers, database architects, data analysts and data scientists on data initiatives and will ensure optimal data delivery architecture is consistent throughout ongoing projects. They must be self-directed and comfortable supporting the data needs of multiple teams, systems, and products. The right candidate will be excited by the prospect of optimizing or even re-designing our company’s data architecture to support our next generation of products and data initiatives.
Responsibilities
•Assisting with a large backlog of client data reporting requests
• Document the process and functions used.
• Assemble large, complex data sets that meet functional / non-functional business requirements.
• Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
• Use existing infrastructure for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and Databricks ‘big data’ technologies.
• Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics.
• Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs.
• Keep our data separated and secure across national boundaries through multiple data centers and AWS regions.
• Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader.
• Work with data and analytics experts to strive for greater functionality in our data systems.
Required Skills
• Databricks, PySpark, SQL, Python, git, API queries, high quality documentation, and strong familiarity with ETL.
This job is already closed and no longer accepting applicants, sorry.