Job Description
ROLE & RESPONSIBILITIES:
- Design, build and maintain scalable data pipelines and ETL workflows to ingest and transform data for analytics and reporting.
- Implement and optimize data storage solutions including data lakes and data warehouses on cloud platforms.
- Develop PySpark and Python applications for large-scale data processing and transformations.
- Ensure data quality, consistency and integrity through testing, validation and the use of data quality tools.
- Collaborate with stakeholders to translate business requirements into technical specifications and data models.
- Propose and review system and solution designs and evaluate technical alternatives.
- Maintain and operate cloud infrastructure and CI/CD pipelines for data platform components.
- Create and maintain technical documentation, runbooks and artefacts for developed solutions.
- Support production troubleshooting, monitoring and inc...