Text Analytics Engineer - Darmstadt (GOE-105880)

Darmstadt  ‐ Vor Ort
Dieses Projekt ist archiviert und leider nicht (mehr) aktiv.
Sie finden vakante Projekte hier in unserer Projektbörse.

Beschreibung

Task:
Text Analytics Engineer (with strong focus on data engineering and software engineering) is expected to build data pipelines for data ingestions from source systems, data transformation, data cleansing, and ingestion to target systems with heavy focus on text data such as clinical trials, publications, patents, social media, and internal legacy documents (PDFs, WORD files) stored in RDBMS systems. He/She needs to ensure that data pipelines adhere to internal engineering standards, they are robust and ready for operational use in terms of good documentation and solid testing.

Requirements (Must have):
- Atleast 5 years of hand-on experience in building data pipelines and ETL/ELT running within "Hadoop" environment
- Strong programming skills in Scala and/or Java is a must.
- Solid working knowledge of components of Hadoop ecosystem such as HDFS, Hive, Spark, Sqoop, Oozie, Kafka, Atlas.
- Has developed pipelines to support text mining, text analytics, or natural language processing projects
- Good understanding of document indexing systems such as Elasticsearch, Solr or Lucene
- Backgroud of software engineering, data engineering, software development is an assett
- Experience in processing text data and documents (especially PDF and WORD formats) is an asset
- Working experience with common data exchange formats such as XML and JSON
- Technical project leadership, requirements engineering, conceptual design of solutions are assets
- Integration with analytics and reporting applications such as Qlik, Spotfire, Kibana, Tableau is an asset (not mandatory)
- English

Nice to have:
- Python skills are welcome
- Experience in working in pharmaceutical industry (GxP/GAMP environment)
- German

Environment/Miscellaneous:
- 20% Remote

Beginn: 15.10.2018
Dauer: 31.03.2019
Branche: Chemie/Pharma
Start
10.2018
Dauer
6 Monate
Von
Allgeier Experts Consulting GmbH
Eingestellt
03.10.2018
Ansprechpartner:
Tobias Trockel
Projekt-ID:
1643068
Vertragsart
Freiberuflich
Um sich auf dieses Projekt zu bewerben müssen Sie sich einloggen.
Registrieren