Data Engineer - required ASAP

Nordrhein-Westfalen

‐ Vor Ort

Dieses Projekt ist archiviert und leider nicht (mehr) aktiv.
Sie finden vakante Projekte hier in unserer Projektbörse.

Schlagworte

Monitoring Apache Python Java Stack Hadoop Deployment Dokumentation aws Rohrleitung

Beschreibung

Skillset:
- Very good understanding of the Apache Hadoop stack: HDFS, Oozie, Hue, Spark, Hive
- Experience with Cloudera Impala
- Good skills in Python or any other non-Java language supported by Amazon AWS Lambda
- Knowledge of AWS Lambda preferred
- Understanding of Apache Tez and experience with Datameer is an optional plus
- Knowledge of ELK (Elastic stack) is a plus
Responsibilities:
- Assessment of data sources: are existing ones sufficient to satisfy a particular demand, or are new sources required?
- Technical integration discussions with the owners of new data sources
- Building of data pipelines (cleansing, joining, transformation, aggregation)
- Building of data pipeline monitoring (did all jobs run successfully? If not, trigger an alarm)
- Documentation of pipelines
- Investigations on data mismatch
- Creation, deployment, evolution and monitoring of ingestion scripts (AWS lambda)

Start: ab sofort
Dauer: 6 months
Von: Optimus Search
Eingestellt: 23.06.2018
Projekt-ID:: 1587032
Vertragsart: Freiberuflich

Um sich auf dieses Projekt zu bewerben müssen Sie sich einloggen.

Data Engineer - required ASAP

Schlagworte

Beschreibung

Projekt melden

Projekt empfehlen

Bewerbungslimit erreicht

Willkommen bei freelancermap!