Beschreibung
remote München: Data Engineer w/m/d POS64969 SQL Python R Azure Databricks Data LakeEinsatzort: remote München
Position: Data Engineer w/m/d
Start: asap
Dauer: Ende 2021+
Ort: aktuell remote, ggf München
Umfang: 320 Stunden+
Our client has a lot of company data from all kinds of providers.
Those company entities do not match a priori. Therefore, a mapping table is needed with
matching ID#s to uniquely identify companies within all datasets.
We call this a "golden mapping table". Pulling data together from different sources within a data lake
environment.
Matching, deduplication and logic tasks to create a "golden table" out of the different tables.
This newly created file describes a standard company data set with mapping ID#s for different data providers.
Used
procedures should be well maintained and documented.
Skills:
- SQL
- Excellent communication skills
- R or Python
- Data Lake Environment
- Azure Databricks
PL/SQL
Python