Sapient Razorfish - Irving, TX
posted 22 days ago
The Site Reliability Engineer (SRE) will ensure the reliability, scalability, and availability of services across cloud and on-prem platforms, focusing on OpenShift and Grafana. This role combines expertise in automation, observability, and infrastructure management to optimize resource allocation and maintain service uptime, particularly for AI/ML and GPU-based workloads.