Data Engineer

DATA ENGINEER.
LEARNING FROM US == CAREERS IN IT

Data engineering from design to non-trivial processing.

TRAINING FORMAT

ONLINE

WHO'S FITTING

JUNIOR/MIDDLE

LEARN HOW TO PROPERLY PREPARE DATA OF ANY SIZE AND COMPLEXITY

Training samples for machine learning and beautiful graphs for reports don't just appear by themselves: data needs to be collected, stored, validated and combined with each other, reacting quickly to changes in its structure.

STANDARD PATH:

YOU START WORKING WITH THE DATA

→

YOU'RE TRYING TO MAKE IT SYSTEMATIC AND SCALABLE

→

REALISE THAT THERE IS NOT ENOUGH KNOWLEDGE TO COVER THE ENTIRE DWH ARCHITECTURE IN ITS ENTIRETY

×

To work effectively with data, one tool is not enough - you need to consider all the interrelationships of a large warehouse, understand the customer's needs, and treat the data as an end product.

A strong data engineer, through breadth of knowledge and understanding of DWH architecture, is able to select the right tools for any task and deliver results to data consumers.

YOUR CV == IN 5 MONTHS

Roy Mudd

Data Engineer

- I work with relational databases, including MPP, understand the peculiarities of distributed systems based on Greenplum

- I know how to build and automate ETL\ELT pipelines based on Apache Airflow.

- I have experience working with big data in Hadoop and Spark, I know how to create complex SQL queries in Apache Hive.

- I understand data warehouse architecture (DWH), multidimensional modelling, anchor modelling and Data Vault techniques

- I have hands-on experience with Spark in Kubernetes, understand the basic approaches to building data warehouses in clouds

- I understand the principles of work and data preparation for Tableau-based BI-tools.

- I apply ML models on big data, know how to prepare data for training, understand approaches to versioning datasets with Data Version Control

- I know basic approaches to data management based on DMBOK

DESIRED SALARY FROM

$150,000 per annum

HOW TRAINING TAKES PLACE

COURSE DETAILS

The lecturers will talk about the course and its content. You will learn what the value of each module is and how the knowledge gained will help you in your future work.

TRAINING FORMAT

- Training takes place in an intensive format of 3 lessons per week
- Homework assignments are done on real infrastructure
- All lectures and supplementary materials are available on the education platform and remain with you after the course is over.
- Our students spend an average of 10 hours per week on their studies

WORK WITH DATA IN ANY SYSTEM

- Learn data warehouse architecture and approaches to data warehouse design
- Compare Hadoop-based Big Data solutions and relational MPP DBMSs in practice
- Learn to work with clouds and automate ETL processes with Airflow

UTILISE OUR INFRASTRUCTURE

- Work with all the tools you need on a dedicated server
- Improve your skills with Hadoop, Greenplum, PostgreSQL, Airflow, Spark, Hive and Kubernetes

ASK ANY SUPPORT QUESTIONS

- Discuss challenges and projects with market experts
- Your mentors will be data engineers from leading companies

WHO THIS COURSE IS FOR:

BI DEVELOPER

You are involved in the development of business intelligence systems, want to master the architecture of modern data warehouses and learn how to design them.

DATA ENGINEER

BACKEND DEVELOPER

Already working with data warehouses, but want to systematise your knowledge and dive deeper into the actual technologies.

DATA ANALYST

Constantly interacting with databases, but want to better understand ETL processes and take analytics to the next level.

Have backend development experience and want to apply it to big data storage and processing challenges.

RECOMMENDED LEVEL:

PYTHON

INFRA-STRUCTURE

SQL

> Knowledge of language syntax

> Understanding of basic data structures (list, dictionary, tuple)

> Mastery of OOP basics (class, object)

> Ability to work with the command line

> Knowledge of basic Linux commands

> Experience with Git

> Knowledge of basic syntax (SELECT, WHERE, GROUP BY, HAVING)

> Ability to create subqueries and make all kinds of JOINs

> Skill in working with window functions

COURSE PROGRAMME ://

ALUMNI FEEDBACK /

I was satisfied with the course: I learnt new technologies (in an applied, rather than overview format) and closed gaps in my fundamental understanding. And most importantly, I got the idea of deploying my data solution in the cloud. As a result, I took a server on DigitalOcean and made my workspace there: I deployed clusters, Jupyter, Superset for visualisation, Airflow for automation, as well as Spark and ClickHouse, following all the recommendations from the lessons. I was very pleased with it.

Now I'm rebuilding my pet project and transferring it to this server - with process building as we discussed in the course. Of course, I don't have BigData, everything is much more prosaic and smaller, but now I have real experience ;).

Kevin

I worked with machine learning and analytics, doing scoring and recommendation models. In my previous job, I managed a team of data engineers. And I wanted to tighten up my competences. Now I've changed jobs because of the move. The company is smaller, so somewhere I do analytics, somewhere I act as an engineer, and somewhere I do development.

At first I took courses on Stepik, and from there I learnt about the Hard ML course. I return to my own Hard ML notes regularly to better solve work tasks. I had no doubts when buying the data engineering course, although I had high expectations after the Hard ML course. Results: overall everything I wanted to learn, I learnt. The theoretical videos were interesting and informative. I liked the block on cloud storages, I had an opportunity to deploy something of my own right away. Sometimes I revisit the block on ETL - the knowledge from there helps me to solve work tasks. A bit lacking in practice. I would like more assignments to write code. In terms of format - it is good that all lectures are recorded in advance. I think it's right - the lecturers don't get tired or exhausted. It's nice that a community has formed around the courses, and both students and professors help in chat rooms.

NICOLE

TUITION FEE

Start mastering the data engineering profession, get access to remote server work and support from our instructors.

> Relational and MPP DBMS
> ETL process automation
> Big Data
> DWH Design
> Cloud storage
> Data Visualisation

> Big ML
> Model management
> Data management
> Support from teachers
> Working on a remote server

$700

ADD TO CART

Leave a request

FAQ

ANY QUESTIONS?

Fill out the form, we will contact you, answer all your questions and tell you more about the course.

START IMMEDIATELY

Click to order