Lead the technical design and implementation of the data components of ML/AI-centric solutions.
Work with our Data Scientists to design datasets that are useful for creating statistical and machine-learning models.
Design, develop, and maintain feature stores as well as the accompanying data pipelines that will be used in training, inferencing, and model monitoring workflows:
Work on the following technical areas
Implementing data quality and integrity checks and ensure the quality and availability of data sources in accordance with their SLAs.
Aligning with Data Engineering and Data Governance team to achieve maturity in the data.
Be the expert on various business or data domains to support the data science initiatives
Creation and maintenance of software packages for use by our Data Scientists to help improve their model development workflow.
Building CI/CD pipelines to improve time to deployment of data pipelines and proactively catch issues before they hit production
Building Data Quality checks and monitoring into the system
Ensure that the team adheres to best practices for code and architecture of data pipelines. Do code and architecture reviews to ensure adherence to best practices
REQUIRED QUALIFICATIONS
With at least a bachelor's degree in any quantitative discipline (i.e. Computer Science, Math, Physics, etc)
Having at least 4 years of experience transforming business logic into data models
Having at least 4 years of experience in creating building and maintaining ETL pipelines