Kforce - Los Angeles, CA
posted 3 months ago
Kforce is immediately adding a full-time Datacenter Site Reliability Engineer in support of our industry-leading technology development client in Los Angeles, CA. Our client is seeking candidates who are driven to make positive changes to the way people live and work by creating breakthrough technology solutions. The role involves a variety of responsibilities centered around ensuring the reliability and performance of data center operations. This includes data monitoring and alerting, data quality assurance, and anomaly detection. The engineer will be responsible for documenting team processes and policies, including methods of engagement and Service Level Objectives (SLOs). In this position, the engineer will analyze, design, and implement solutions at the system level to remove bottlenecks and improve edge service performance. Implementing monitoring and alerting systems to enhance issue detection and response is a key part of the role. The engineer will work in a fast-paced environment and participate in technical operations and rotations in response to performance and reliability issues. Additionally, participation in on-call rotations is required, where the engineer will be responsible for resolving or escalating incoming events. A strong understanding of maintaining and operating a Linux and Kubernetes environment is essential for success in this role.