Job Description
Required Skills & Experience
- Data Engineering:
Strong hands‑on experience with the Hadoop ecosystem (Hive, Impala, Spark, Kafka, Iceberg, Ranger, Atlas, Nifi, Flink etc.,), and data pipeline orchestration. - Full‑Stack Development:
Proficiency in Java, Python, shell scripting, RESTful API development, and web frameworks such as Flask and React. - Machine Learning & AI:
Experience working with ML platforms such as CML, Spark MLlib, and Python ML libraries (scikit‑learn, XGBoost), including model deployment. - Security & Governance:
Solid understanding of enterprise data security frameworks, including LDAP, Kerberos, RBAC, data masking, and fine‑grained access control. - Performance Optimization:
Demonstrated ability to tune and optimize data queries and applications in large‑scale Hadoop environments. - Tools & Platforms:
Experience with Cloudera Data Platform (CDP), Cloudera Data Services (CDS, CML, Informatica, ...