Federal Reserve Bank - Dallas, TX
posted 3 months ago
As a Senior Cloud Reliability Engineer in the SRE chapter, you will be accountable for implementing reliability practices using software as means for the cloud foundational product line in the Federal Reserve. The SRE Chapter is part of the Cloud Solutions & Services department and has the overall responsibility for reliability of the numerous cloud foundational environments in the FRS. You will work as part of cloud foundational platform squads to demonstrate and champion site reliability culture and practices and exert technical influence throughout your team. Your role will involve solving reliability issues of cloud platforms using software engineering principles, developing and maintaining automations, scripts, and code associated with automating manual work, and improving the reliability and stability of the cloud platform. Additionally, you will develop, integrate, and maintain synthetics (canaries) code to establish the health of the platform, lead SLIs, SLOs, and Error budgets efforts in collaboration with the product team to instrument and visualize for proactively managing the stability of cloud platforms. You will also implement observability (logs, metrics, traces) and monitoring for cloud foundational platforms, define chaos experiments in collaboration with product owners, and conduct experiments. Furthermore, you will be responsible for developing and mentoring junior engineers in the team, along with other duties as assigned.