First Citizens Bank

posted 2 months ago

Full-time - Mid Level
Remote
Credit Intermediation and Related Activities

About the position

As a Site Reliability Engineer at First Citizens Bank, you will play a crucial role in ensuring the performance, reliability, and availability of critical applications. This remote position focuses on maintaining customer-facing systems and driving adherence to service level objectives (SLOs) through effective monitoring and automation. You will work within a dynamic environment that emphasizes innovation in banking technology, particularly in the areas of green tech and life sciences.

Responsibilities

  • Own the availability, performance, and reliability of customer-facing systems.
  • Drive adherence to SLOs through monitoring, alerting, and scaling.
  • Engage in software development in an Enterprise Java environment, utilizing Spring Boot and Python for CI/CD pipelines.
  • Maintain, support, and troubleshoot critical, large-scale application and infrastructure deployments.
  • Analyze and troubleshoot application, operating system, networking, configuration, and performance problems.
  • Understand and apply Site Reliability Engineering concepts and best practices.
  • Execute system deployments in AWS, private cloud, and OpenShift environments.
  • Design, document, and implement automated procedures.
  • Automate system administrative tasks using scripting tools, preferably Python or shell.
  • Utilize monitoring and automation tools such as Ansible, Gitlab, Splunk, Grafana, and Prometheus.
  • Champion SRE best practices and communicate effectively with both technical and non-technical staff.
  • Implement system hardening and security best practices.

Requirements

  • Bachelor's Degree and 2 years of experience in Application Engineering, or High School Diploma/GED and 6 years of experience in Application Engineering.
  • 4+ years of experience in Software Engineering background.
  • 2+ years of experience implementing or following SRE practices.
  • Experience working in a large financial institution or similar environment in scope and complexity.
  • Hands-on experience with deploying and maintaining systems in a containerized environment (public or private cloud).
  • Understanding of performance and availability requirements, with experience collaborating with Software Engineering teams to define deployment, configuration, and monitoring requirements.
  • Ability to create meaningful metrics and alerting for service health monitoring.
  • Experience in reducing manual effort through automation with scripting.
  • Skilled in configuration management and automation frameworks.
  • Proficiency in driving Root Cause Analyses to meaningful improvements.
  • Experience leading troubleshooting efforts with production and non-production systems.

Nice-to-haves

  • Experience with monitoring and automation tools such as Ansible, Gitlab, Splunk, Grafana, and Prometheus.
  • Familiarity with system hardening and security best practices.

Benefits

  • Comprehensive benefits program for full-time associates (20+ hours).
  • Customized offerings designed to support families.
  • Access to various health and wellness programs.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service