Join to apply for the Site Reliability Engineer (SRE) role at Dicetek LLC .
The Site Reliability Engineer (SRE) is responsible for implementing and maintaining highly reliable and scalable applications and services. The primary goal of an SRE is to ensure smooth operation and performance of applications, minimizing downtime and maximizing user experience.
Requirements include :
- 4+ years of relevant experience in platform monitoring, application performance monitoring, problem solving, incident response, troubleshooting, and post-incident analysis.
Key responsibilities :
Automation and Tooling : Develop and maintain automation tools, scripts, and frameworks to streamline system deployment, configuration management, monitoring, and incident response. Automate repetitive tasks to minimize manual intervention.Monitoring and Alerting : Implement monitoring solutions to proactively detect issues. Set up dashboards and alerts for system health, performance, and availability.Incident Response and Troubleshooting : Participate in incident management, conduct root cause analysis, and collaborate with teams to resolve issues efficiently.Performance Optimization : Identify bottlenecks and work with development teams to improve application performance.Security and Compliance : Collaborate with security teams to implement controls, ensure compliance, and perform security audits.Collaboration and Documentation : Foster cross-team collaboration and document system designs, configurations, and procedures.Continuous Improvement : Stay updated with industry trends and drive initiatives to enhance system reliability and scalability.Seniority level : Not Applicable
Employment type : Contract
Job function : Engineering and Information Technology
Industries : IT Services and IT Consulting
Note : This job posting appears active. Referrals can increase your chances of interviewing.
#J-18808-Ljbffr