Data Engineer - Hadoop OzoneCH

Mphasis

📍 berkeley heights, nj, United-States

Full-time IT Services and IT Consulting Posted June 14, 2026

Job Description

We are seeking a highly skilled Big Data Engineer with strong experience in Apache Spark, Hadoop ecosystem, and Apache Ozone. The ideal candidate will design, develop, and optimize large-scale data processing systems, ensuring high performance, scalability, and reliability for enterprise-level applications.

Key Responsibilities:

• Design and implement distributed data processing solutions using Apache Spark, Hadoop, Flink

• Develop and maintain Spark applications for data transformation, aggregation, and ETL processes using Scala, Java, or Python

• Utilize Apache Ozone for storing large-scale datasets, ensuring efficient data access and management in a distributed environment

• Manage and optimize HDFS and Apache Ozone, Kafka for scalable and fault-tolerant storage.

• Develop ETL pipelines for batch and real-time data ingestion and transformation.

• Implement and ensure data validation, data security, integrity, and compliance across ...