Data Engineering is the practice that enables data-driven decision-making by collecting, transforming and publishing data. Data Engineers design, build, operationalize, secure, and monitor data processing systems. They focus on security and compliance, scalability and efficiency, reliability and fidelity, flexibility and portability.
This training program has been developed for talented Juniors with basic knowledge of Java or Python eager to build a successful career in the Data Engineering sphere.
During our course, you will learn various characteristics of big data and its sources. We will introduce you to architectural requirements, principles of big data infrastructures and the intersection of cloud computing with big data. We will also provide an overview of the most popular big data technologies, including core Hadoop, NoSQL databases, Apache Spark, Apache Kafka, etc.
The program consists of two stages:
Upon program completion, if all materials have been studied, assigned tasks accomplished and the final assessment successfully passed, you will be invited to a technical interview in the production. The best students will have an opportunity to receive a job offer and join the EPAM team.
To enhance your knowledge of Python Core and Java basics, you can explore the following useful resources:
These articles will help you improve your English skills:
1 STAGE. SELF-PACED β 3 months
This stage takes ~3 months and requires ~12 hours of workload in a week.
1 module
Basic technical self-paced courses:
You need to score at least 70%
2 module
Data Engineering Intro self-paced courses:
You need to score at least 70%
2 STAGE. DATA ENGINEERING LAB β 3 months
If you show good results during the technical interview, we will invite you to continue your studies and practice the acquired skills at the intensive stage in EPAM Data Lab (it usually lasts for 3 months). There you will be offered the mentoring program with one-to-one weekly communication with production experts:
Have any questions? Contact us