Data Engineer - Hadoop OzoneCH
Mphasis
📍 berkeley heights, nj, United-States
Job Description
We are seeking a highly skilled Big Data Engineer with strong experience in Apache Spark, Hadoop ecosystem, and Apache Ozone. The ideal candidate will design, develop, and optimize large-scale data processing systems, ensuring high performance, scalability, and reliability for enterprise-level applications.
Key Responsibilities:
• Design and implement distributed data processing solutions using Apache Spark, Hadoop, Flink
• Develop and maintain Spark applications for data transformation, aggregation, and ETL processes using Scala, Java, or Python
• Utilize Apache Ozone for storing large-scale datasets, ensuring efficient data access and management in a distributed environment
• Manage and optimize HDFS and Apache Ozone, Kafka for scalable and fault-tolerant storage.
• Develop ETL pipelines for batch and real-time data ingestion and transformation.
• Implement and ensure data validation, data security, integrity, and compliance across ...