Talent.com
عرض العمل هذا غير متوفر في بلدك.
Resident DevOps & Monitoring Engineer - Octopus by RTG

Resident DevOps & Monitoring Engineer - Octopus by RTG

Robusta StudioRiyadh, Riyadh Region, Saudi Arabia
منذ أكثر من 30 يومًا
الوصف الوظيفي

Resident DevOps & Monitoring Engineer - Octopus by RTG

Resident DevOps & Monitoring Engineer - Octopus by RTG

Octopus by RTG is enabling a key partner organization to build their digital hub in Egypt looking for the right pioneers to work on exciting AI Projects.

Octopus is proud to be part of the Robusta Technology Group (RTG), a leading tech consultancy group. With a decade of experience and a successful track record of delivering over 300 projects across Europe, the Middle East, and North America, RTG has established itself as a preferred employer in the Egyptian market. Octopus and Robusta are building a bridge between Europe and Africa, creating tailored hub solutions to connect companies with top talent across the globe.

Octopus is specialized in rapidly assembling remote global tech teams that are fully aligned with the culture and practices of a particular brand. By providing tailored hubs to suit its clients needs, Octopus gives companies all the advantages of remote work and offshoring without all the negatives.

We are looking for a Resident DevOps & Monitoring Engineer to manage the day-to-day deployment, monitoring, and coordination of a vendor solution hosted within a client's secure on-premises environment. This role requires strong cross-functional collaboration skills, a proactive approach to incident handling, and solid experience with DevOps in on-prem setups.

Key Responsibilities :

  • Deployment & Configuration :
  • Package Docker images, maintain Kubernetes manifests and Helm charts
  • Align and manage versions of Postgres, MongoDB, and DB2 connectors
  • Monitoring & Observability :
  • Set up monitoring systems across applications and infrastructure
  • Capture metrics, logs, and traces; configure sensible alert thresholds
  • Internal Service Coordination :
  • Submit, track, and follow up on service requests across IT, Data, Security, Ops, and QA
  • Ensure timely completion and resolution across departments
  • Incident Response & RCA :
  • Detect and resolve production outages or performance issues
  • Lead coordination efforts with on-site teams and drive root cause analysis
  • Stakeholder Management :
  • Act as the communication bridge between client Ops, Security, QA, and the vendor's engineering / product teams
  • Process Optimization & Documentation :
  • Develop deployment guides, runbooks, checklists, and automation scripts
  • Drive process improvement through standardization and documentation
  • On-Call Support :
  • Be available for critical incidents and lead resolution coordination when needed

Requirements

  • Containers & Orchestration :
  • Proficient with Docker, Kubernetes, Podman
  • Strong understanding of networking fundamentals
  • Operating Systems :
  • Experience with RHEL and Windows Server environments
  • Databases :
  • Familiarity with Postgres, MongoDB, DB2, and SQL querying
  • Scripting & Tooling :
  • Python, Git, Pandas
  • Monitoring Expertise :
  • Hands-on experience with implementing observability across metrics, logs, traces, and alerting systems
  • AI / ML Observability (Nice to Have) :
  • Understanding of monitoring ML models (accuracy, drift, hallucinations) and ensuring data pipeline integrity
  • Security & Compliance :
  • Knowledge of secure zones, change management protocols, and vulnerability remediation practices
  • Soft Skills :
  • Strong written and verbal communication
  • Excellent stakeholder management
  • Clear technical documentation and meeting facilitation skills
  • Seniority level

    Seniority level

    Mid-Senior level

    Employment type

    Employment type

    Full-time

    Job function

    Job function

    Other

    Industries

    IT Services and IT Consulting

    Referrals increase your chances of interviewing at Robusta Studio by 2x

    Get notified about new Monitoring Engineer jobs in Riyadh, Riyadh, Saudi Arabia .

    Riyadh, Riyadh, Saudi Arabia 17 hours ago

    Engineer Service Operations Specialist (Infrastructure & Networking exp)

    Electrical Infrastructure Engineer QA / QS Inspection Services

    AI Computing Infrastructure Engineer – GPU & High-Performance Computing

    Application Development Manager (BDM) for water analytical online instrumentation

    Systems Engineer - Secure Access Networking

    VMware Horizon View Core Infrastructure Support Engineer

    Application Development Manager (BDM) for water analytical online instrumentation

    We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

    #J-18808-Ljbffr

    إنشاء تنبيه وظيفي لهذا البحث

    Engineer Engineer • Riyadh, Riyadh Region, Saudi Arabia