REMOTE Data Engineer and MLOps/Cloud Engineer

Berlin, Berlin  ‐ Vor Ort

Schlagworte

Datenbanken Cloud-Engineering Information Engineering SQL Automatisierung Big Data ETL Apache Hadoop Python Kinesiologie Nosql Objektorientierte Software-Entwicklung Bediensytem Prometheus Softwareentwicklung Versionierung Daten- / Datensatzprotokollierung Data Science Docker Swarm Large Language Models Grafana Apache Spark Amazon Rds Data Lake Kubernetes Apache Kafka Datenmanagement Docker Elk Stack Microservices

Beschreibung

We are currently looking for one Senior Data Engineer and one Senior MLOps/Cloud Engineer to support an LLM/NPL Project. Here are the must- haves for both of them.

Project Details:
• Starting period: 01.01.2024
• Ending period: 01.04.2024 (extension)
• Location: Only Remote
• Capacity: 5 days/week
• Language: English
• Nearshore : Possible

Data Engineer Must-have experiences with:
• Data management and integration, including Dat Mesh, Data Lakes, and integration with external services
• Core cloud concepts, with a special focus on databases (e.g., AWS RDS /Kinesis /Glue /EC2 /EKS /ECS
• Optimization of NoSQL and SQL databases in a cloud environment
• Software engineering, especially in object-oriented programming (OOP)
• SQL and database query optimization techniques
• Implementing ETL and data ingestion pipelines for both initial and update loads, including batch processing of data (structured and unstructured sources)
• Performing database benchmarking for latency and performance optimization
• Good programming practices, particularly in implementing data cleansers and parsers
• Understands the differences between various database solutions and can recommend appropriate ones, including document databases and vector databases
• Big Data technologies(Hadoop, Spark, or Apache Kafka)
• Message queues and asynchronous processing

Cloud Engineer/MLOps Developer must-have experiences with:
• One or more cloud providers and their services
• Monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack)
• Infrastructure-as-code solutions/templates
• Microservices, Docker, and Kubernetes
• The data science stack, including Kubeflow, MLFlow, data versioning tools, GPU acceleration techniques, and multiple types of databases
• Optimizing cloud resources effectively
• Implementing and maintains CI/CD pipelines for ML models
• Python or Bash for automation tasks
• Container orchestration platforms like Docker Swarm or Amazon ECS/EKS
• Best security practices in cloud environments

Are you interested or do you have someone in your network, who might be, we will be happy to work together!
Start
01.2024
Auslastung
100% (5 Tage pro Woche)
Dauer
3 Monate
(Verlängerung möglich)
Von
Digital Associates GmbH
Eingestellt
20.12.2023
Ansprechpartner:
Marcus Kallies
Projekt-ID:
2694988
Branche
Medizin und Pharma
Vertragsart
Freiberuflich
Um sich auf dieses Projekt zu bewerben müssen Sie sich einloggen.
Registrieren