Job Description
Evolvice is a German nearshore service provider with branches in Egypt and Ukraine. Founded in 2012, Evolvice has a strong technical background and business domain knowledge, combining software engineering and Agile methodology, leading its’ clients’ path to digital transformation. Headquartered in the heart of the automobile industry, Stuttgart (Germany), our expertise stretches from automotive, healthcare, travel, financial, governmental, and insurance to manufacturing industries.
Our team is over 50 people including web (C#/.NET, Java, JS) & mobile (iOS/Android/Ionic) developers together with business analysts, project managers, QA, and support staff. Our corporate culture is characterized by agile processes, autonomous teams without hierarchies, as well as openness and transparency – both internally and with our clients. Currently, we are searching for Senior Site Reliability Engineer to join the big team of professional in Cairo. We are looking for an active, responsive, and devoted person.
Responsibility
– Responsible for how code is deployed, configured, and monitored, as well as the availability, latency, change management, emergency response, and management capacity of services in production;
– Helps teams to determine what new features can be incorporated and when by using service-level agreements (SLAs) to define the required reliability of the system through service-level indicators (SLI) and service-level objectives (SLO);
– Using automation to fix issues, bugs and bloatware;
– Develop solution to implement the SLO/SLI requirements, including visualization of the monitoring dashboard;
– Collaborate with other development, security, and compliance teams to execute on product deliverables;
– Focus on observability with an eye to quicker resolution of production issues;
– Strong verbal and written communication skills, with the ability to work effectively across internal and external organizations.
Requirements
– 4+Years of experience with cloud-based technologies and tools in configuration management, deployment, monitoring and operations;
– Excellent experience in DevOps/SRE;
– Define consistent monitoring, metrics and alerting across different micro-services (Docker-Kubernetes, Serverless);
– Strong experience working with scripting languages like python and bash;
– Candidate must have hands on experience with logging and monitoring solutions such as Prometheus, Grafana, Splunk/Elastic, Fluent Bit, Logstash, Kibana, Grafana, Application and Infrastructure monitoring tools, and Public Cloud monitoring tools such as CloudWatch, VPC flow logs;
– Experience with configuration management and Infrastructure as a code tools like Terraform, Ansible, CloudFormation, Salt;
– AWS Cloud is preferred;
– At least Upper-Intermediate level (all the internal communication is in English);
– Strong architectural understanding for large scale distributed microservice and serverless based systems is a plus.
We Offer
– Financial stability.
– Interesting and challenging projects within professional self-managed teams.
– Friendly team and a comfortable working environment.
– Flexible schedule (8-10AM start) with a possibility to work assigned hours and/or adjust work schedule as requested by manager.
– 21 working day paid annual vacation.
– Health insurance.
– Social insurance -the highest level.
– Paid sick leave.
– Performance review after half of the year.
Why You Should Work With Us
We work as a self-driven team without complex management structures. Our teams make independent decisions without recommendations from the client. We nurture an open, transparent environment where we all enjoy our work.