Federal Reserve Bank - Richmond, VA

posted 2 months ago

Part-time,Full-time - Mid Level
Remote - Richmond, VA
Monetary Authorities-Central Bank

About the position

The Federal Reserve Bank of Boston is seeking a Senior Engineer for the SRE / Production Operations team for the FedNow program. This position plays a crucial role in operating the production environment for the FedNow initiative, which is a transformative service that enables financial institutions to provide real-time payment capabilities to their customers. The FedNow Service allows for 24x7x365 real-time gross settlement, ensuring that payments can be sent and received at any time, with immediate access to funds. As part of a strategic effort to evolve Federal Reserve Financial Services (FRFS) into a national, enterprise-focused organization, this role is integral to enhancing the customer experience and supporting the ongoing technical and delivery needs of the FedNow program. In this role, you will be responsible for architecting, implementing, and leveraging monitoring solutions and tooling for capacity planning, utilization reporting, and scaling. The SRE / Production Operations team utilizes both open-source and proprietary software to support Engineering, DevOps, and DevSecOps tools, services, and solutions. You will engage in CI/CD and Infrastructure as Code (IaC) pipeline automation design and development, as well as ensure resiliency, disaster recovery (DR), and business continuity planning (BCP) are effectively managed and tested. The position requires close collaboration with engineers and architects to maintain seamless automation across the FedNow platform. You will proactively identify gaps in system architecture and design experiments to address these issues. The ideal candidate will have a passion for building and maintaining reliable, scalable systems and automating cloud-based applications that are highly available and high performing.

Responsibilities

  • Operate the production environment for the FedNow program.
  • Architect, implement, and leverage solution monitoring and tooling for capacity planning and utilization reporting.
  • Design and develop CI/CD and IaC pipeline automation.
  • Manage resiliency, disaster recovery, and business continuity planning, including testing.
  • Interface with internal stakeholders and customers for planning, delivery, and service management.
  • Own ongoing ITIL processes and drive continuous improvement initiatives.
  • Work closely with Engineers and Architects to maintain seamless automation across the platform.
  • Proactively identify gaps in system architecture and design experiments to expose them.

Requirements

  • Strong communication and collaboration skills.
  • Technical/functional expertise in tooling for ITIL, Agile, Project Management, and SDLC.
  • Extensive knowledge of AWS environments and services such as EC2, EBS, RDS, Aurora, S3, Route 53, ELB, IAM.
  • Experience with Hashicorp Terraform, Consul, Vault, and Ansible.
  • Automation experience preferably with GitLab.
  • Proficiency in scripting languages, preferably Python, for automated processes.
  • Experience supporting infrastructure for large multi-service applications.
  • Experience working with continuous deployment in micro-services architectures.
  • Familiarity with fault injection/experimentation and system attacks.
  • Best practices in chaos engineering process and implementation.

Nice-to-haves

  • Experience with monitoring and measuring KPIs with a focus on root cause analysis and corrective action.
  • Familiarity with Fault Injection tooling such as AWS Fault Injection Simulator, Gremlin, ChaosToolkit, and Chaos Monkey.
  • Experience with observability tools like CloudWatch, Dynatrace, Grafana, and Prometheus.

Benefits

  • Diverse and inclusive workplace
  • Equal employment opportunities
  • Comprehensive security screening process including background checks and drug screening
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service