Beschreibung
Key Responsibilities- Deploying LLM models: Design, develop and deploy the LLM-based system, which can provide content-related insights and generative AI applications, ensuring scalability, efficiency, and accuracy
- Developing and Deploying a semantic search service
- Generative AI Development: Contribute to the data processing, prompt engineering, and fine-tuning to fit the LLM in different use scenarios
- Evaluating possible architecture solutions by taking into account cost, business requirements, emerging technologies, and technology requirements, like latency, throughput, and scale
- Maintain clean, scalable code, ensuring reproducibility and easy integration of models into production environments
- Collaborate with multidisciplinary teams: understand business requirements and translate them into LLM solutions
Requirements
- Experience in deploying large-scale language models like GPT, BERT, or similar architectures
- Good understanding of LLM API/LLM open source models/prompt engineering/fine-tuning
- Good understanding of semantic search and text summarisation solutions
- Experience in LLM for long-tailed language application - an advantage
- Experience with cloud frameworks like AWS and Azure
Darwin Recruitment is acting as an Employment Business in relation to this vacancy.