Electronic Arts - Seattle, WA
posted 4 months ago
As a Site Reliability Engineer at Electronic Arts, Inc. in Seattle, WA, you will play a crucial role in enhancing the performance and reliability of our applications and infrastructure. Your primary responsibilities will include creating monitoring, alerting, and dashboarding solutions that provide improved visibility into application performance and business metrics. You will design and implement CI/CD pipelines to automate operational tasks and build new capabilities, ensuring that our deployment processes are efficient and reliable. In this role, you will deploy and manage Kubernetes clusters and resources across various cloud platforms, including Azure, AWS, and Google Cloud, in a production setting. You will work on Kubernetes cluster deployments within the Cloud fabric and collaborate with partners to onboard and support their infrastructure in Kubernetes across different cloud providers. Your expertise will be essential in automating, optimizing, and driving efficiency in our efforts, code, and processes. You will also be responsible for writing Chef cookbooks and recipes to automate the deployment process and integrate these cookbooks into GitLab for a continuous delivery framework. Collaboration is key, as you will work closely with Title teams to ensure launch readiness and provide support during new Title launches. Additionally, you will troubleshoot complex technical issues and drive innovations that enhance system availability, resilience, and performance. Working with a 24/7 support team and other service lines, you will address issues related to server infrastructure and identify improvements in the availability and performance of various environments supported by the systems engineering team. Telecommuting is permitted for this position.