Job Description
- At least 5 years of experience in data analysis
- At least a year of experience in data quality focused role
- Experience with big data tools: Hadoop, Spark, Kafka, etc.
- Experience with relational SQL and NoSQL databases, including
Postgres and Cassandra.
- Experience with data pipeline and workflow management tools:
Azkaban, Luigi, Airflow, etc.
- Experience with AWS cloud services: EC2, EMR, RDS, Redshift
- Experience with stream-processing systems: Storm, Spark Streaming, etc.
- Experience with object-oriented/object function scripting languages: Python, Java, C++, Scala, etc.
- In-depth knowledge of statistical methods and tests
- Extensive experience with statistical packages, such as MS Excel, SAS, and SPSS
Responsibilities:
Data Quality Management:
- Develop data quality reports on large datasets to determine data quality based on the set rules an...