Wazifni announces the following vacancy as part of its recruitment services on behalf of a leading global company in the field of medical software solutions:
Senior DevOps Engineer takes absolute ownership of the Kubernetes infrastructure to stabilize, optimize, and scale production clusters for the Health Claim Management Solution. This role drives the migration to a GitOps-driven pipeline, ensuring zero-downtime deployments and deep observability for Java Spring Boot microservices and AI workloads under regional data sovereignty mandates.
- Key Tasks & Responsibilities:
Kubernetes Architecture Management: Design, upgrade, and troubleshoot production-grade Kubernetes clusters, managing CNI networking (Calico/Cilium) and ingress controllers.
GitOps Pipeline Migration: Migrate application deployment infrastructure to a GitOps model using ArgoCD or Flux to align cluster states with Git repositories.
CI Pipeline Optimization: Build and optimize continuous integration pipelines (GitHub Actions/GitLab CI) using caching for Java (Maven) and Python (Poetry) microservices.
Infrastructure as Code Enforcement: Mandate and manage all environment provisioning exclusively through Terraform or OpenTofu, eliminating manual console configurations.
Dependent Resource Automation: Automate high-availability deployments for cloud and messaging resources, including PostgreSQL RDS, Apache Kafka, Redpanda, and Redis.
Observability & SRE Stack Maintenance: Enhance platform monitoring via Prometheus and Grafana while configuring ELK or Loki for unified logging.
Distributed Tracing Configuration: Implement OpenTelemetry and Jaeger to provide distributed tracing for diagnosing latency across microservices.
DevSecOps Integration: Secure the container ecosystem by embedding image scanning (Trivy/Clair), automated secrets management (HashiCorp Vault), and rigid network policies.
Data Sovereignty Compliance: Enforce infrastructure compliance with KSA (NAPHIES) and UAE data laws, ensuring healthcare data remains strictly within regional boundaries.
Production Incident Resolution: Diagnose and remediate critical runtime failures such as CrashLoopBackOff states, OOMKilled errors, and distributed system networking issues.
- Work Conditions:
Regular Employment
Indoor
Damascus Governorate - دمشق، المزة فيلات شرقية
- Job Requirements
Education Degree: Bachelor
Education Specification: Bachelor Degree in Information Technology