Lead Site Reliability Engineer

$110,000 - $190,000/Yr

Royal Bank of Canada - Jersey City, NJ

posted 3 months ago

Full-time - Mid Level
Jersey City, NJ
Credit Intermediation and Related Activities

About the position

The Lead Support Site Reliability Engineer (SRE) at City National Bank (CNB), an RBC company, plays a pivotal role in the development and implementation of Site Reliability Engineering solutions across all applications. This position requires a collaborative approach, working closely with various teams across multiple lines of business and technology partners to ensure the success of the SRE mandate. The ideal candidate will possess advanced knowledge and experience in application development, support, and technology operations, and will be expected to take on a production support role while collaborating with the SRE team in Consumer Banking, Commercial Banking, and Wealth Management. In this role, the Lead Support SRE will perform application production support, including off-hours support, and spearhead the development of SRE solutions such as monitoring and alerting, machine learning anomaly detection, self-healing, and reliability testing. The individual will be responsible for running the production environment, monitoring availability, and maintaining a holistic view of system health. Additionally, the Lead Support SRE will build software and systems to manage platform infrastructure and applications, improve reliability and quality, and enhance the time-to-market of software solutions. The position also involves leading incident management and problem management for applications, ensuring compliance with service level objectives, and maintaining technology currency through server patching and certificate renewal. The Lead Support SRE will implement monitoring and alerting systems, support automation solutions, and adopt a design-thinking and agile mindset while collaborating with SREs, Scrum Masters, and partner team leads. Continuous learning and staying abreast of technology changes are essential components of this role, as is the ability to automate tasks to reduce toil and increase operational efficiency.

Responsibilities

  • Perform application production support role including off-hours support.
  • Spearhead the development of SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing).
  • Run the production environment by monitoring availability and taking a holistic view of system health.
  • Build software and systems to manage platform infrastructure and applications.
  • Improve reliability, quality, and time-to-market of our suite of software solutions.
  • Lead and assist in incident management and problem management for applications in scope.
  • Maintain technology currency (manage server patching, certificate renewal, etc.) with keen eye on automating opportunities.
  • Ensure availability and uptime of applications in scope, as per service level objectives.
  • Ensure compliance of all systems and applications in scope, including maintaining segregation of duties.
  • Implement monitoring and alerting, anomaly detection, self-healing and reliability testing for applications in scope.
  • Support unit's goals to adopt automation solutions for applications in scope.
  • Adopt a design-thinking and agile mindset in working with SREs, Scrum Masters and partner team leads.
  • Stay abreast of technology change and learn constantly, through official training assignments and self-assigned learning.

Requirements

  • Minimum 4+ years of related experience in Application support, Software development (SDLC, working knowledge of at least two of C/C++, Java, Golang, Python, .NET) and Operations (SRE, DevOps, Cloud, Data).
  • Advanced knowledge of industry practice (Financial Institution) with a focus on SRE.
  • Advanced experience in a variety of environments (Linux, Windows, Databases, Cloud, distributed and mainframe, business workflows, and Services/APIs).
  • Able to automate simple tasks to reduce the toil and increase operating system efficiency.
  • Hands-on experience in a variety of SRE languages and tools (Ansible, Dynatrace, Moogsoft, PagerDuty, ServiceNow, Elastic, Logstash, Kibana, Blue Prism, Catch Point, Grafana).
  • Effective negotiation skills, and stakeholder management.
  • Excellent communication skills, direct style.
  • Consumer banking experience.

Nice-to-haves

  • Experience working as an SRE within the Financial Trading Industry.

Benefits

  • Comprehensive Total Rewards Program including competitive compensation, bonuses, and flexible benefits.
  • Continued opportunities for career advancement.
  • World-class sales training, coaching, and development opportunities.
  • Support from a dynamic, collaborative, progressive, and high performing team, as well as world-class tools and training.
  • Opportunity to achieve great success and grow your career with RBC.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service