There are still lots of open positions. Let's find the one that's right for you.
As a Site Reliability Engineer at First Citizens Bank, you will play a crucial role in ensuring the performance, reliability, and availability of our critical applications. This remote position is open to candidates located in Arizona or North Carolina. You will be part of a dedicated team responsible for the availability and performance of customer-facing systems, driving adherence to Service Level Objectives (SLOs) through effective monitoring, alerting, and scaling practices. Your expertise in software development within an Enterprise Java environment, particularly with Spring Boot and Python for Continuous Integration and Continuous Deployment (CICD) pipelines, will be essential in maintaining, supporting, and troubleshooting large-scale application and infrastructure deployments. In this role, you will dive deep into issues and outages to establish root causes, effectively communicating these findings to your business partners. A solid understanding of Site Reliability Engineering concepts and best practices is vital, as you will be executing system deployments across various platforms, including AWS, private cloud, and OpenShift. You will also design, document, and implement automated procedures, leveraging your skills in scripting tools such as Python or shell to automate system administrative tasks. Your role will require a fundamental understanding of Internet networking protocols, including TCP/IP, TLS, DNS, HTTP, and SMTP. You will utilize extensive experience with monitoring and automation tools such as Ansible, Gitlab, Splunk, Grafana, and Prometheus to ensure system health and performance. As a culture champion for SRE best practices, you will communicate effectively with both technical and non-technical staff, promoting a collaborative environment. Familiarity with system hardening and security best practices will also be beneficial in this position.