Profilbild von Stephan Sahm Senior Data Science & Engineering Consultant, Developer, Architect, and Project Lead aus Muenchen

Stephan Sahm

verfügbar

Letztes Update: 16.04.2024

Senior Data Science & Engineering Consultant, Developer, Architect, and Project Lead

Firma: Jolin.io
Abschluss: M.Sc. Applied Stochastics, M.Sc. Cognitive Science, B.Sc. Mathematik/Informatik, B.Sc. Cognitive Science
Stunden-/Tagessatz: anzeigen
Sprachkenntnisse: deutsch (Muttersprache) | englisch (gut) | niederländisch (Grundkenntnisse) | spanisch (Grundkenntnisse)

Dateianlagen

CV-Stephan-Sahm-2024_160424.pdf
CV-Stephan-Sahm-2024_160424.docx

Skills

10+ years experience in Data Science and Data Engineering.
5+ years experience in Data Consultancy.

Selected Programming Languages:
Julia, Python, R, SQL, Scala, Matlab, Java, C++, Haskell, ROS, JavaScript, HTML, CSS (Web Stack)

Selected Industries:
Telecommunication, Automotive, Retail, Bonus Program, Media, Manufacturer

Selected Data Science Fields:
Statistics, Machine Learning, Deep Learning, Computer Vision, NLP Natural Language Processing, Analytics, Anomaly Detection, Time Series Prediction, Recommendation, Object recognition, ETL, Data Pipelines, Data Lake, Visualization, Dashboards

Selected Big Data:
Julia, Dask, PySpark, Spark, Hadoop MapReduce, Data Lake Setup, Yarn, HDFS, Hive, HBase

Selected Database:
PostgreSQL, MongoDB, MySQL, Oracle, Microsoft, Hive, HBase

Cloud:
AWS, Azure, Infrastructure-as, code, terraform, cloudformation, sceptre

AWS:
S3, SNS, Kubernetes, AWS VPC, ETL, CRM, API, pandas, AWS SNS, AWS SQS, PostgreSQL, MongoDB, AWS DocumentDB, AWS API Gateway, AWS Cognito, AWS Lambda, infrastructure-as-code cloudformation, Lambda, AWS Transit Gateway, AWS Networking, EC2, AWS Session Manager, AWS CloudWatch

Azure:
Azure Machine Learning, Azure App Service, Azure AD, infrastructure-as-code terraform

Methodology: 
Scrum, Waterfall

Projekthistorie

09/2020 - 03/2021
Lead Developer & Architect
Automotive (Automobil und Fahrzeugbau, >10.000 Mitarbeiter)

Supporting Usecase Development on Datalake

Guidance was provided for architectural decisions, adapting access policies, and debugging routing issues. A specific GDPR treatment ingestion processes was implemented and rolled-out. In production.

Duration: 6 months
Team setting: Team Lead, Team of 2, remote
Technologies: Infrastructure-as-code, cloudformation, sceptre, python, boto3, PySpark, scala, Spark, AWS Glue, AWS Secrets,

01/2019 - 03/2021
Senior Data Science Consultant & Technical Lead
Machine Learning Reply (Internet und Informationstechnologie, 10-50 Mitarbeiter)

Everything around data science consultancy:
- recruiting new colleagues
- pitching new projects
- request for proposals
- requirements engineering
- team setup
- team lead
- conceptualization of data science or data engineering solution
- development of data science or data engineering solutions
- giving workshops, trainings
- auditing customers solutions
- architecting data lakes and cloud data infrastructure
- ...

06/2000 - 03/2021
Lead Developer & Architect
Automotive (Automobil und Fahrzeugbau, >10.000 Mitarbeiter)

20 ETL Pipelines on AWS

Replacing an CRM required the development of about 20 ETL pipelines to replace existing systems with new data-flows. Including one REST API. In production.

Team setting: Team Lead, Team of 3, remote
Technologies: AWS Glue, PySpark, python, boto3, pandas, AWS SNS, AWS SQS, PostgreSQL, MongoDB, AWS DocumentDB, AWS API Gateway, AWS Cog

01/2020 - 05/2020
Lead Developer & Architect
Automotive (Automobil und Fahrzeugbau, >10.000 Mitarbeiter)

Building Multitenant Datalake on AWS

Implementing from scratch a datalake platform on AWS which is deployed in several countries using InfrastructureAsCode as the key technology. A key focus was GDPR conformity. In production.

Team setting: Team Lead, Team of 2, remote with a few on-side workshops
Technologies: Infrastructure-as-code, cloudformation, sceptre, python, boto3, PySpark, scala, Spark, AWS Glue, AWS Secrets, AWS IAM, S3, SNS, Lambda, Kubernetes, AWS VPC, AWS Transit Gateway, AWS Networking, AWS EC2, AWS Session Manager, AWS CloudWatch, AWS Sagemaker

04/2019 - 12/2019
Core Developer
Telecommunication (Telekommunikation, >10.000 Mitarbeiter)

Unification of Existing Time Series Analytics

Several custom anomaly detection solutions on time series were refactored and unified into a generic framework which can be easily deployed to new usecases and new infrastructures (AWS tested). In production.

Team setting: Team of 15, on-site, Scrum
Technologies: Python, PySpark, (PL)SQL, Hive, HBase, Oracle, Tableau

07/2018 - 12/2019
Senior Data Science & Engineering Consultant
Data Reply (Internet und Informationstechnologie, 50-250 Mitarbeiter)

Everything around data science consultancy:
- recruiting new colleagues
- pitching new projects
- request for proposals
- requirements engineering
- conceptualization of data science or data engineering solution
- development of data science or data engineering solutions
- giving workshops, trainings
- auditing customers solutions
- ...

08/2018 - 03/2019
Data Science Developer
Bonus Program Company (Marketing, PR und Design, 500-1000 Mitarbeiter)

Recommender System

Designed, implemented, and deployed Big Data recommendation system, now running in production for Millions of daily customers. In production.

Team setting: Team of 1, on-site, weekly reviews
Technologies: On-premise, R, Scala, SBT, Spark, Yarn, HDFS

10/2018 - 11/2018
Quality Assurance & Adviser
Manufacturer (Konsumgüter und Handel, >10.000 Mitarbeiter)

Review: Custom Datascience Framework

Infrastructure review and code review of a framework implemented build by one of our customers.

Team setting: Team of 1, mixed remote & on-site
Technologies: R, AWS

09/2018 - 09/2018
Teacher
(Konsumgüter und Handel, >10.000 Mitarbeiter)

Workshop: Developing with Apache Spark

Four one-day workshops at customers, two introductory, the other two advanced. Contents: Performance optimization, monitoring, interfacing Scala-R-Python, best practices

Setting: Group of 15 persons, sole presenter
Technologies: R, Python, Spark

06/2017 - 08/2018
Data Science Developer
Bonus Program Company (Marketing, PR und Design, 5000-10.000 Mitarbeiter)

Fraud Detection

Draft, development, implementation, evaluation and deployment of an anomaly detection system to detect previously unkown types of fraud.

Team Setting: Team of 1, on-site, review once every three months
Technologies: R, Scala, Spark, Yarn

11/2016 - 07/2018
Data Science Consultant
Data Reply (Internet und Informationstechnologie, 10-50 Mitarbeiter)

Everything around data science consultancy:
- recruiting new colleagues
- pitching new projects
- request for proposals
- requirements engineering
- conceptualization of data science or data engineering solution
- development of data science or data engineering solutions
- giving workshops, trainings
- auditing customers solutions
- ...

11/2016 - 04/2017
Data Science Developer
Telecommunication (Telekommunikation, >10.000 Mitarbeiter)

Callcenter and Webcontent Optimization using Speech Analytics.

A 3 dimensional content detection system was setup for written conversations. Given only plain text, it identifyed customer specific product entities, services, and problems.

Team Setting: Team of 3, on-site, reviews every week
Technologies: Python, NLP, spacy

01/2016 - 09/2016
Python Developer
Trufflebit (Internet und Informationstechnologie, < 10 Mitarbeiter)

Data Parsing

Build parser to extract time series data from customer specific text data formats

Team Setting: Team of 1, remote, steady exchange with CEO
Technologies: Python, PyParsing, Cython

09/2015 - 01/2016
Web Developer
Trufflebit (Internet und Informationstechnologie, < 10 Mitarbeiter)

Web Visualization

Build Django based web-dashboard with Bokeh based interactive data analysis visualization.

Team Setting: Team of 1, remote, steady exchange with CEO
Technologies: Python, Django, Bokeh

04/2013 - 03/2014
Computer vision & Object recognition
University of Osnabrück (Industrie und Maschinenbau, 10-50 Mitarbeiter)

Building an Autonomous Robot

Programmed robot with wheels and arms to grab a muffin from the receptionist on first floor, take the elevator, and bring it to the robotics lab.

Team Setting: Team of 14, on-site, Scrum
Technologies: ROS, Gazebo, SCRUM, Python, C++, OpenCV

Reisebereitschaft

Weltweit verfügbar
Travelling within Germany, Austria and Switzerland.
Remote work is preferred.
» International availability only remote «br />
--------------------------------------- DEUTSCH ---------------------------------------

Reisebereitschaft innerhalb Deutschland, Österreich und Schweiz
Remotearbeit ist bevorzugt
» Internationale Verfügbarkeit ausschließlich remote «

Sonstige Angaben

Berufshaftpflichtversicherung
Hiscox SA
Niederlassung für Deutschland
Hauptbevollmächtigter für Deutschland: Robert Dietrich
Arnulfstr. 31
80636 München
Räumliche Geltung: Deutschland

Youtube - Video

YouTube Profil Jolin.io
Profilbild von Stephan Sahm Senior Data Science & Engineering Consultant, Developer, Architect, and Project Lead aus Muenchen Senior Data Science & Engineering Consultant, Developer, Architect, and Project Lead
Registrieren