Fidelity Investments - Jersey City, NJ

posted 2 months ago

Full-time - Senior
Jersey City, NJ
Securities, Commodity Contracts, and Other Financial Investments and Related Activities

About the position

The Principal Site Reliability Engineer will be a key member of the TechOps SRE team, collaborating closely with engineering partners to drive initiatives from design to implementation. This role focuses on managing highly available multi-region Kubernetes environments on AWS EKS, supporting mission-critical workloads. The position offers an opportunity to refine technical skills, collaborate across teams, and influence the infrastructure strategies of Fidelity Digital Assets.

Responsibilities

  • Work closely with engineering partners to enable and drive initiatives from design to implementation.
  • Manage and maintain Kubernetes clusters on AWS EKS.
  • Build and deploy Docker images, including Docker Compose.
  • Create and deploy Helm charts and libraries.
  • Author and maintain declarative CI/CD pipelines using Jenkins Core.
  • Craft and maintain logging, monitoring, and alerting capabilities using tools like Datadog and Splunk.
  • Drive the design of highly available, secure, scalable microservices-based applications in AWS.
  • Provide technical leadership to teams of Site Reliability Engineers and Cloud Engineers.
  • Collaborate with risk, product, and engineering team leaders to deploy applications to the cloud.
  • Promote a DevOps mentality and establish development standard methodologies for AWS infrastructure-as-code.

Requirements

  • 5+ years of hands-on experience with AWS in a production environment.
  • Experience building and deploying Docker images including Docker Compose.
  • Production experience running Kubernetes workloads ideally on AWS EKS.
  • Experience managing and maintaining Kubernetes Clusters on AWS EKS.
  • Experience with Confluent or Kafka.
  • Hands-on experience with Jenkins Core, including authoring and maintaining declarative CI/CD pipelines and libraries.
  • Experience with monitoring tools e.g., CloudWatch, Datadog & Splunk Cloud.
  • Proficiency with UNIX operating systems and shell scripting.
  • Experience with Amazon Web Services (AWS), managing services and applications in a large AWS cross-account environment using IAM and federated SSO.
  • Ability to communicate at all levels with strong written and verbal communications.

Nice-to-haves

  • Experience with Apache or Confluent Kafka.
  • Experience with CDN Providers e.g., Akamai.
  • Programming experience, e.g., Python preferred.
  • Experience with distributed version control systems, Git preferred.
  • Experience with the agile software development lifecycle and Kanban preferred.

Benefits

  • Comprehensive health care coverage and emotional well-being support.
  • Market-leading retirement plans.
  • Generous paid time off and parental leave.
  • Charitable giving employee match program.
  • Educational assistance including student loan repayment and tuition reimbursement.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service