Site Reliability Engineer

Andersen

Site Reliability Engineer

Описание вакансии

Andersen is hiring a Site Reliability Engineer to drive reliability and performance for large-scale digital insurance platforms, enhancing integrations, optimizing cloud systems, and ensuring stable, high-quality service delivery.

The customer is a well-established global organization providing financial protection and risk-management services across various markets. With a diverse portfolio and teams operating in multiple regions, the company supports businesses and individuals through reliable, scalable solutions.

The project focuses on enhancing large-scale digital platforms, improving cloud performance, optimizing integrations, and modernizing systems to support efficient service delivery and ongoing expansion. ​​​​​​​

Responsibilities:

  • Ensuring high availability, performance, scalability, and overall reliability of application infrastructure through proactive monitoring, automation, and continuous improvement.
  • Developing and implementing performance optimization strategies, including code optimization, memory management, load testing, and capacity planning.
  • Implementing and maintaining end-to-end observability, including real-time telemetry, CUJ-level metrics, dashboards, alerts, and actionable reporting.
  • Monitoring Critical User Journeys (CUJs) with product and business teams to improve end-to-end user experience and service reliability.
  • Managing SLIs, SLOs, SLAs, and error budgets across critical services while ensuring uptime and availability targets are consistently met.
  • Implementing next-generation architectural patterns and SRE recommendations to enhance fault tolerance, resilience, and disaster recovery capabilities.
  • Identifying and mitigating reliability risks, proactively addressing issues that may impact availability and minimizing service disruptions.
  • Automating key operational tasks such as deployments, scaling, failover, and remediation, and reducing manual toil through tools and process improvements.
  • Leading incident response efforts, participating in on-call rotations, and driving automated remediation for common failure scenarios.
  • Performing root-cause analysis, conducting blameless post-mortems, and implementing corrective actions to prevent recurring incidents.
  • Creating and maintaining comprehensive runbooks, operational documentation, and guidelines for incident response and system reliability.
  • Collaborating with global and regional digital teams on reliability best practices, mentoring junior SREs, and contributing to the hiring and onboarding of new SRE candidates.

Must-haves:

  • Experience in application support and reliability engineering environments for 6+ years.
  • Strong technical background with proficiency in software development principles, application production support, SDLC best practices, and Agile methodology.
  • Hands-on SRE experience with a strong understanding of SLOs, SLIs, error budgets, incident management, and conducting blameless post-mortems.
  • Solid understanding of application architectures with the ability to analyze systems and identify areas for improvement.
  • Experience working with monitoring, logging, and observability tools to track and optimize application performance.
  • Proficiency in scripting and automation tools (e.g., Python, Bash, Terraform) to reduce toil and improve operational efficiency.
  • Strong incident response and troubleshooting skills with the ability to perform effective root cause analysis.
  • Excellent collaboration and communication skills for working with cross-functional teams and clearly explaining technical concepts.
  • Ability to coach and mentor team members in SRE practices and foster a culture of reliability.
  • Proactive mindset with a focus on continuous improvement to enhance application reliability and performance.
  • Level of English – from Intermediate+ and above.

Reasons why this job would be interesting to you:

  • Experience in teamwork with leaders in FinTech, Healthcare, Retail, Telecom, and others. Andersen cooperates with such businesses as Samsung, Siemens, Johnson & Johnson, BNP Paribas, Ryanair, Mercedes, TUI, Verivox, Allianz, T-Systems, etc..
  • The opportunity to change the project and/or develop expertise in an interesting business domain.
  • Job conditions – you can work both fully remotely and from the office or can choose a hybrid variant.
  • Guarantee of professional, financial, and career growth! The company has introduced systems of mentoring and adaptation for each new employee.
  • The opportunity to earn up to an additional 1,000 USD per month, depending on the level of expertise, which will be included in the annual bonus, by participating in the company's activities.
  • Access to the corporate training portal, where the entire knowledge base of the company is collected and which is constantly updated.
  • Bright corporate life (parties / pizza days / PlayStation / fruits / coffee / snacks / movies).
  • Certification compensation (AWS, PMP, etc).
  • Referral program.
  • English courses.
  • Private health insurance and compensation for sports activities.

Your personal data is protected in accordance with GDPR regulations.

Join us!

Навыки
  • SRE
  • Python
  • Bash
  • Terraform
  • SDLC
  • SLOs
Посмотреть контакты работодателя

Похожие вакансии

Andersen
Полный день
  • Алматы

  • Не указана

Рекомендуем
Andersen
Полный день
  • Алматы

  • Не указана

Рекомендуем
inDrive
Полный день
  • Алматы

  • Не указана

Рекомендуем
Intella
Полный день
  • Алматы

  • до 11000 USD

Sandvik Mining and Construction Kazakhstan LTD

Senior Service Engineer (Rock Processing)

Sandvik Mining and Construction Kazakhstan LTD

Полный день
  • Алматы

  • до 11000 USD

Beeline, ТМ
Полный день
  • Алматы

  • до 11000 USD

Plexy Platform Kazakhstan
Полный день
  • Алматы

  • до 11000 USD

Марс

Data Engineer

Марс

Полный день
  • Алматы

  • до 11000 USD

Quality Engineer (Quality Management Systems)

Филип Моррис Казахстан

Полный день
  • Алматы

  • до 11000 USD

PIXEL NETWORKS KZ
Полный день
  • Алматы

  • до 900000 KZT

Eurasiana
Удаленная работа
  • Алматы

  • до 900000 KZT

Cyber Temple
Полный день
  • Алматы

  • до 900000 KZT

Orbicom, ТОО
Полный день
  • Алматы

  • до 900000 KZT

Epam Kazakhstan (Эпам Казахстан),ТОО

Lead AI Engineer

Epam Kazakhstan (Эпам Казахстан),ТОО

Полный день
  • Алматы

  • до 900000 KZT

Полный день
  • Алматы

  • до 900000 KZT

Удаленная работа
  • Алматы

  • до 900000 KZT

Logycom
Полный день
  • Алматы

  • до 900000 KZT

Plexy Platform Kazakhstan
Полный день
  • Алматы

  • до 900000 KZT

Arbuz Group (Арбуз Груп)

Senior DevOps-инженер

Arbuz Group (Арбуз Груп)

Полный день
  • Алматы

  • до 900000 KZT

Мобайл Телеком-Сервис (Объединенная Компания Tele2/Altel)

Старший инженер в отдел контроля качества сервисов и локальных решений

Мобайл Телеком-Сервис (Объединенная Компания Tele2/Altel)

Полный день
  • Алматы

  • до 900000 KZT

Хотите оставить вакансию?

Заполните форму и найдите сотрудника всего за несколько минут.
Оставить вакансию