Federal Reserve Bank - San Francisco, CA

posted 4 days ago

Part-time,Full-time - Senior
Remote - San Francisco, CA
Monetary Authorities-Central Bank

About the position

As a Sr. /Lead Site Reliability Engineer at the Federal Reserve Bank of San Francisco, you will play a crucial role in managing the systems that support the Cash Application Delivery Services (ADS) applications suite, both on-premises and in the cloud. Your primary focus will be to ensure optimal operation of applications, facilitate quick troubleshooting, and maintain high availability of cloud-based assets. This position emphasizes collaboration with various teams, including development, QA, and DevOps, to enhance system performance and reliability while adhering to security standards.

Responsibilities

  • Establish and run playbooks to support the resolution of incidents in production environments.
  • Help design dashboards for effective monitoring of infrastructure resources in cloud environments.
  • Work with development teams to establish Service-Level Objectives and key Service-Level Indicators.
  • Conduct Production Readiness Reviews to ensure services meet operational readiness standards before going live.
  • Ensure infrastructure aligns with security standards, assist in audits, and implement recommended practices to protect data and systems.
  • Facilitate the design and implementation of Disaster Recovery plans, including backups, failover, and recovery mechanisms.
  • Drive improvement opportunities in infrastructure, tooling, and workflows using a continuous feedback loop between development and CloudOps.
  • Ensure uptime and reliability of cloud-based infrastructure and systems, monitoring performance and maintaining high availability.
  • Participate in incident response and troubleshooting by conducting root cause analysis and implementing solutions to prevent recurrence.
  • Establish thresholds for cloud-based services, set up and maintain monitoring systems, and configure alerts for system anomalies.

Requirements

  • Bachelor's degree in Computer Science, Information Systems, Computer Engineering, Systems Analysis, or a related field, or equivalent work experience.
  • 7+ years of industry experience in building and supporting enterprise-level systems as a platform engineer or equivalent in a production environment (for Lead SRE).
  • 5+ years of hands-on experience implementing, supporting, and using tools and services for software orchestration and environment monitoring (for Senior SRE).
  • Experience in Ansible, GitLab, Terraform, CloudWatch, Dynatrace, Grafana is required.
  • 2+ years of hands-on experience with AWS services, including AWS Lambda, AWS CloudWatch, and AWS X-Ray.

Benefits

  • Medical
  • Dental
  • Vision
  • Pre-tax Flexible Spending Account
  • Backup Child Care Program
  • Pre-Tax Day Care Flexible Spending Account
  • Paid Family Care Leave
  • Vacation Days
  • Sick Days
  • Paid Holidays
  • Pet Insurance
  • Matching 401(k)
  • Retirement/Pension
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service