Talent.com
No longer accepting applications
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

SallaWorkFromHome, Riyadh Region, Saudi Arabia
10 days ago
Job description

We are looking for a Senior Site Reliability Engineer (SRE) to help design, scale, and secure our rapidly growing platform infrastructure. You will work across all critical systems — from customer-facing applications and APIs to internal platforms and data services — ensuring availability, performance, and cost efficiency at scale. You'll be hands‑on with Kubernetes, observability, GitOps, automation, and cloud infrastructure, while partnering closely with application, platform, and data teams to deliver a highly reliable and self‑healing environment. This role is ideal for an engineer who thrives on complex distributed systems, loves to automate everything, and can balance speed, stability, and cost‑efficiency in production.

Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field — or equivalent work experience.
  • Design, deploy, monitor, and maintain production workloads across Kubernetes (EKS / AKS / GKE) clusters.
  • Build self‑healing, auto‑scaling systems that minimize manual intervention and ensure uptime.
  • Design and operate reliable database and storage platforms (SQL, NoSQL, and object stores) within Kubernetes environments.
  • Implement backup, disaster recovery, replication, and failover strategies to meet RPO / RTO targets.
  • Troubleshoot and recover Kubernetes Persistent Volumes (StorageClasses, CSI drivers, PVC issues).
  • Optimize storage performance and cost through multi‑tier strategies, hot / cold data separation, and S3 / offloading lifecycle policies.
  • Secure and scale object storage platforms (e.g., MinIO / S3‑compatible) for high‑throughput data pipelines.
  • Manage block storage (EBS / io2 / gp3) and shared file systems (EFS, NFS) for resilience and cost balance.
  • Collaborate with teams to optimize networking, ingress / egress traffic, and service mesh for secure communication.

Platform & Infrastructure Reliability

  • Design, deploy, monitor, and maintain production workloads across Kubernetes (EKS / AKS / GKE) clusters
  • Build self‑healing, auto‑scaling systems that minimize toil and manual intervention
  • Optimize networking, ingress / egress traffic control, and service mesh for secure & performant communication
  • Design and operate reliable database and storage platforms (SQL, NoSQL, and object stores) in Kubernetes environments
  • Own backup, disaster recovery, replication, and failover strategies to meet RPO / RTO targets for critical data services
  • Optimize storage performance and cost through multi‑tier strategies, hot / cold data separation, and S3 / offloading lifecycle policies
  • Troubleshoot and recover Kubernetes Persistent Volumes confidently during incidents (StorageClasses, CSI drivers, PVC issues)
  • Secure and scale object storage platforms (e.g., MinIO / S3‑compatible) and integrate with workloads for high‑throughput data pipelines
  • Work with block storage (EBS / io2 / gp3) and shared file systems (EFS, NFS) to balance performance, resiliency, and cost
  • Automation & Delivery

  • Champion GitOps and CI / CD best practices (ArgoCD, Flux, GitHub Actions). Build automation for infrastructure provisioning and upgrades using Terraform, Helm, and Kubernetes Operators
  • Reduce release risk through progressive delivery strategies (blue / green, canary, spot instance rolling updates)
  • Observability & Incident Response

  • Own the monitoring and alerting stack (Prometheus, Grafana, Loki, VictoriaMetrics, OpenSearch)
  • Lead incident management and postmortems to prevent recurrence
  • Provide real-time visibility into system health, performance, and cost metrics
  • Security & Compliance

  • Implement least‑privilege IAM policies, secure service‑to‑service communication, and network ACLs / firewalls
  • Enforce Kubernetes RBAC, secret management, and secure image supply chain
  • Participate in audit readiness and compliance efforts
  • Performance & Cost Optimization

  • Analyze and tune system performance under scale (CPU / memory / IO)
  • Partner with product and platform teams to right‑size clusters, databases, and storage tiers
  • Introduce cost visibility dashboards for engineering leadership.

    Preferred Qualifications

  • Experience managing mission‑critical systems at scale (high traffic, multi‑region)
  • Proven cost optimization in cloud / K8s environments
  • Familiarity with service mesh (Istio, Linkerd) or advanced networking / egress control
  • Experience with data platform components (Airflow, Debezium, ClickHouse, etc.) is a plus but not required
  • Strong communication skills and teamworker — able to collaborate across engineering, DevOps, security, and product teams.

    Requirements

  • 8+ years in SRE / DevOps / Infrastructure Engineering roles
  • Deep Kubernetes expertise (multi‑cluster, Helm chart development, advanced networking)
  • Strong GitOps workflows using ArgoCD / Flux
  • Expertise with AWS (preferred) or Azure / GCP, plus Infrastructure‑as‑Code (Terraform, Pulumi, CloudFormation)
  • Advanced knowledge of SQL & NoSQL databases (MySQL / Aurora, PostgreSQL, MongoDB, Redis)
  • Scripting / automation skills in Python, Bash, or Go
  • Solid background in monitoring / observability (Prometheus, Grafana, Loki, ELK / Opensearch, VictoriaMetrics)
  • Experience with CI / CD at scale and managing production incidents
  • Experience with streaming / messaging (Kafka, RabbitMQ, or similar)
  • Benefits

  • Comprehensive Training & Development programs
  • Performance‑based Bonus incentives
  • Flexible Work From Home options
  • #J-18808-Ljbffr

    Create a job alert for this search

    Senior Site Engineer • WorkFromHome, Riyadh Region, Saudi Arabia

    Related jobs
    • Promoted
    Sr. Site Reliability Engineer (SRE)

    Sr. Site Reliability Engineer (SRE)

    SiFiWorkFromHome, Riyadh Region, Saudi Arabia
    Site Reliability Engineer (SRE).SiFi is a rapidly growing B2B Fin-Tech company transforming expense management for businesses in Saudi Arabia. As a licensed EMI from the Saudi Central Bank, we empow...Show moreLast updated: 3 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    CanonicalWorkFromHome, Riyadh Region, Saudi Arabia
    Senior Site Reliability Engineer.Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used i...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability / Gitops Engineer

    Senior Site Reliability / Gitops Engineer

    CanonicalWorkFromHome, Riyadh Region, Saudi Arabia
    Senior Site Reliability / Gitops Engineer.Senior Site Reliability / Gitops Engineer.Senior Site Reliability / Gitops Engineer. Be among the first 25 applicants.Senior Site Reliability / Gitops Engin...Show moreLast updated: 30+ days ago
    • Promoted
    Site Manager(Power Plant) | KSA

    Site Manager(Power Plant) | KSA

    Hudson ManpowerRiyadh Region, Saudi Arabia
    Oversee day-to-day site operations and ensure the timely execution of construction and installation activities.Supervise all on-site personnel, subcontractors, and suppliers to ensure work is perfo...Show moreLast updated: 18 days ago
    • Promoted
    SITE ENGINEER

    SITE ENGINEER

    KILONEWTONSRiyadh Region, Saudi Arabia
    Riyadh, KSA (NEOM Project Site).Years in Construction / Civil Engineering.About KILONEWTONS & The NEOM Project.KILONEWTONS is a global leader in engineering and construction, and we’re proud to be a ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Mechanical Engineer

    Site Mechanical Engineer

    Washnah Trading and ContractingRiyadh Region, Saudi Arabia
    Washnah Contracting Company is looking for an experienced.The successful candidate will be responsible for overseeing mechanical installations, maintenance, and ensuring all mechanical aspects meet...Show moreLast updated: 11 days ago
    • Promoted
    Energy & Dry Utilities Engineer

    Energy & Dry Utilities Engineer

    Buro HappoldWorkFromHome, Riyadh Region, Saudi Arabia
    Energy & Dry Utilities Engineer.Ready to shape the region’s infrastructure with Buro Happold? We’re looking for a passionate Energy & Dry Utilities Engineer to join our Riyadh infrastructure team a...Show moreLast updated: 3 days ago
    • Promoted
    Senior ICA Engineer

    Senior ICA Engineer

    JacobsRiyadh Region, Saudi Arabia
    At Jacobs, we're challenging today to reinvent tomorrow by solving the world's most critical problems for thriving cities, resilient environments, mission-critical outcomes, operational advancement...Show moreLast updated: 30+ days ago
    • Promoted
    Safety / EHS Engineer – Overhead Transmission Line (OHTL) & Substation – KSA (#S1002)

    Safety / EHS Engineer – Overhead Transmission Line (OHTL) & Substation – KSA (#S1002)

    Hudson ManpowerRiyadh Region, Saudi Arabia
    Candidates must have a strong understanding of health, safety, and environmental regulations in the construction and electrical infrastructure industry. Transmission Lines / Substation Projects.With...Show moreLast updated: 30+ days ago
    • Promoted
    Safety Engineer

    Safety Engineer

    Albawani | البوانيRiyadh Region, Saudi Arabia
    The Safety Engineer I is responsible for assisting in the development, implementation, and continuous improvement of safety programs and initiatives within their assigned work area.This role involv...Show moreLast updated: 7 days ago
    • Promoted
    Engineering Manager- Ceph & Distributed Storage

    Engineering Manager- Ceph & Distributed Storage

    CanonicalWorkFromHome, Riyadh Region, Saudi Arabia
    Engineering Manager- Ceph & Distributed Storage.Be among the first 25 applicants.Engineering Manager- Ceph & Distributed Storage. Canonical is a leading provider of open source software and operatin...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CanonicalWorkFromHome, Riyadh Region, Saudi Arabia
    Be among the first 25 applicants.Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT.Our customers in...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineering Manager

    Site Reliability Engineering Manager

    CanonicalWorkFromHome, Riyadh Region, Saudi Arabia
    We are hiring a Site Reliability Engineering Manager aspiring for a world-class DevOps and GitOps engineering management challenge, bringing together operations management, software engineering and...Show moreLast updated: 30+ days ago
    • Promoted
    Site Civil Engineer

    Site Civil Engineer

    Washnah Trading and ContractingRiyadh Region, Saudi Arabia
    Washnah Contracting Company is seeking a highly skilled.This role will involve managing civil engineering aspects on-site, including site preparation, structural installations, and ensuring complia...Show moreLast updated: 11 days ago
    • Promoted
    Safety / EHS Engineer – Overhead Transmission Line (OHTL) & Substation – KSA (L1)

    Safety / EHS Engineer – Overhead Transmission Line (OHTL) & Substation – KSA (L1)

    Hudson ManpowerRiyadh Region, Saudi Arabia
    Candidates must have a strong understanding of health, safety, and environmental regulations in the construction and electrical infrastructure industry. Transmission Lines / Substation Projects.With...Show moreLast updated: 30+ days ago
    • Promoted
    PLG Sr Backend Engineer / Lead (Remote)

    PLG Sr Backend Engineer / Lead (Remote)

    Lucidya LLC.WorkFromHome, Riyadh Region, Saudi Arabia
    Lucidya empowers brands to unlock the power of customer intelligence across the Middle East and beyond.Joining this team means working on. Lucidya’s global expansion and PLG success.Your work will p...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Civil Engineer

    Site Civil Engineer

    NAUURiyadh Region, Saudi Arabia
    Get AI-powered advice on this job and more exclusive features.We are searching for a Site Civil Engineer or Structural Engineer, with a completed Degree, and minimum of 7+ years of experience.Alrea...Show moreLast updated: 12 hours ago
    • Promoted
    Senior Sales Engineer - Reliance Contracting Company

    Senior Sales Engineer - Reliance Contracting Company

    QureosRiyadh Region, Saudi Arabia
    Senior Sales Engineer - Reliance Contracting Company.Senior Sales Engineer - Reliance Contracting Company.Responsibilities & Qualifications. Must have a strong network with developers, consultants, ...Show moreLast updated: 7 days ago