Join to apply for the
Site Reliability Engineer (SRE)
role at
Dicetek LLC . The
Site Reliability Engineer (SRE)
is responsible for implementing and maintaining highly reliable and scalable applications and services. The primary goal of an SRE is to ensure smooth operation and performance of applications, minimizing downtime and maximizing user experience. Requirements include : 4+ years of relevant experience in platform monitoring, application performance monitoring, problem solving, incident response, troubleshooting, and post-incident analysis. Key responsibilities : Automation and Tooling :
Develop and maintain automation tools, scripts, and frameworks to streamline system deployment, configuration management, monitoring, and incident response. Automate repetitive tasks to minimize manual intervention. Monitoring and Alerting :
Implement monitoring solutions to proactively detect issues. Set up dashboards and alerts for system health, performance, and availability. Incident Response and Troubleshooting :
Participate in incident management, conduct root cause analysis, and collaborate with teams to resolve issues efficiently. Performance Optimization :
Identify bottlenecks and work with development teams to improve application performance. Security and Compliance :
Collaborate with security teams to implement controls, ensure compliance, and perform security audits. Collaboration and Documentation :
Foster cross-team collaboration and document system designs, configurations, and procedures. Continuous Improvement :
Stay updated with industry trends and drive initiatives to enhance system reliability and scalability. Seniority level :
Not Applicable Employment type :
Contract Job function :
Engineering and Information Technology Industries :
IT Services and IT Consulting Note : This job posting appears active. Referrals can increase your chances of interviewing.
#J-18808-Ljbffr
Reliability Engineer • Riyadh, Saudi Arabia