Service Reliability Engineer

Bayern  ‐ Vor Ort
Dieses Projekt ist archiviert und leider nicht (mehr) aktiv.
Sie finden vakante Projekte hier in unserer Projektbörse.

Beschreibung

Summary Overview of Responsibilities:

  • Participate in the successful delivery of the end to end services with agreed SLAs to our customers by leveraging

  • Improving, designing and implementing services that automate application provisioning and manage the underlying infrastructure as a service (all layers, from compute to storage, including network).

Goal:

  • Accelerate Application teams' ability to reliably and consistently deliver applications by developing standardized automation to control

  • Build, artifact and deploy managed services

  • Integrated into loosely coupled toolchains

  • Form a common continuous deployment pipeline for application development teams

Other responsibilities include:

  • capacity planning

  • change management

  • problem management

  • incident management

  • release management

  • performance improvement

  • automation/tool development

Good communication and teamwork is extremely important.

Major Responsibilities:

Support an ultra-highly available cloud-based applicative platform for our customers.

Support application deployments, building new systems and upgrading and patching existing ones.

Develop automation to quickly and rapidly deploy instances from blue-printed applications or golden images.

Develop and use monitoring tools to find problems, resolve and/or escalate to development and ensure that we exceed our SLAs.

Build and manage development and testing environments, assisting developers in debugging application issues using tools.

Participate in the building of tools and processes to support the infrastructure.

Leverage Scripting to build required automation and tools on an adhoc basis.

Operate the platform within our security and privacy guidelines.

Learn on the job and explore new technologies with little supervision.

Ability to use a wide variety of open source technologies and tools.

Experience with systems and IT operations.

Comfort with frequent, incremental code testing and deployment.

A strong focus on business outcomes.

Strong sense of collaboration, open communication and reaching across functional borders.

Provide hands-on engineering, administration and technical support.

Troubleshoot issues across the entire stack - hardware, software, application, and network.

Document current and future configuration processes and policies.

Proactive thought leadership for creative and efficient technology solutions.

Drive continuous improvement to the service delivered to customer (agility, stability ...)

Process reengineering and optimization

Drive the enforcement and definition of operational requirements/non-functional requirements in collaboration with application owners and Middleware organizations.

24x7 pager rotation of the team

Know How/Skills

Software Engineering methodologies and development cycle (Open Source development), including:

 Version Control system (GIT and SubVersion) and Continuous Integration and testing methods (Jenkins)

Oriented Architecture design patterns

knowledge in Networking is needed, including:

Communication Protocols (TCP/IP, DNS, SSH, HTTP/S)

Load balancing techniques, traffic routing, and caching for distributed applications, scalability

Identifying, troubleshooting, and resolving system level issues on large, busy networks

Deployment and infrastructure configuration management tools (such as Maven, Capistrano, Puppet, NPM, etc)

Linux operating system administration (RHEL or SLES)

Linux Containers deployment technologies (Docker or LXC)

C, C++, or Java, and Shell, Perl, GO or Python

monitoring tools and concepts (Kibana, ElasticSearch)

cloud systems and related ecosystem (CouldStack, OpenStack, AWS API, etc...) ?Virtualization Technology (such as EC2, Xen, KVM, OpenStack)

Very good knowledge in relational DB (Oracle, MySQL, MariaDB) and noSQL technology (Cassandra, S3, HBase, Hadoop, MongoDB, CouchBase)

Good understanding of security information and event management technologies

Please forward an up to date CV for a prompt Response.

Start
ASAP/August 2017
Dauer
limited for 6 months
Von
Personal Phoenix GmbH
Eingestellt
15.07.2017
Projekt-ID:
1381416
Vertragsart
Freiberuflich
Um sich auf dieses Projekt zu bewerben müssen Sie sich einloggen.
Registrieren