This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

iCapitalposted 2 months ago
$120,000 - $160,000/Yr
Full-time • Mid Level
Greenwich, CT
Resume Match Score

About the position

At iCapital the Site Reliability Engineering team is fundamental to ensure our platform delivers consistent, reliable service to our client base. This role will work at the intersection of software engineering and operations, applying engineering principles to infrastructure challenges. This individual will design and implement scalable systems, create observability solutions that offer actionable insights, and develop automation to improve our platform's reliability. iCapital seeks a Site Reliability Engineer who thinks systematically about reliability, can translate business requirements into technical implementations, and thrives on making complex systems more robust.

Responsibilities

  • Design, implement, and maintain service level objectives (SLOs) that align with business goals and customer expectations.
  • Develop observability strategies, focusing on meaningful metrics that drive actionable insights.
  • Architect and implement scalable infrastructure solutions using cloud-native technologies and infrastructure as code.
  • Drive automation initiatives to eliminate toil and improve system reliability.
  • Champion reliability best practices across development teams through consultation and tooling.
  • Design and operation of a Kubernetes environment for container management and orchestration.
  • Lead incident response, conduct thorough postmortems, and drive systematic improvements.
  • Participate in on-call rotations with a focus on continuous service improvement.

Requirements

  • 5+ years of SRE experience or related experience with 3+ years in AWS.
  • Strong experience with container orchestration platforms like Kubernetes and related ecosystem tools.
  • Working knowledge of databases such as MongoDB, Postgres, DynamoDB.
  • Strong foundation in reliability engineering principles and distributed systems behavior.
  • Experience defining and implementing SLOs/SLIs and using them to drive system improvements.
  • Demonstrated ability to design and implement observability solutions that provide actionable insights while minimizing alert fatigue.
  • Coding abilities in at least one IaC language, with Terraform strongly preferred and one programming language such as Python, Ruby or Java with a focus on maintainable, tested code.
  • Understand modern observability practices and experience implementing and maintaining monitoring solutions such as Prometheus/Grafana, Splunk, NewRelic, CloudWatch, and ELK in the cloud.
  • Strong incident response skills with experience leading incident retrospectives and driving improvements.
  • Excellent problem-solving abilities and experience debugging distributed systems.
  • Track record of successfully automating operations and reducing toil.
  • Strong communication skills with ability to explain complex technical concepts to diverse audiences.

Benefits

  • Base salary range of $120,000 to $160,000.
  • Compensation package includes salary, equity for all full-time employees, and an annual performance bonus.
  • Comprehensive benefits package that includes an employer matched retirement plan.
  • Generously subsidized healthcare with 100% employer paid dental, vision, telemedicine, and virtual mental health counseling.
  • Parental leave.
  • Unlimited paid time off (PTO).
  • Flexibility to work remotely on Friday.

Job Keywords

Hard Skills
  • Java
  • Kubernetes
  • MongoDB
  • Prometheus
  • Python
  • 0Bc8Y UsP2dTHq
  • 0E7a1SL
  • 0FQ C9EIJP szH1uJ6Pp7G
  • 318uAsJNQp4F eHhYgW6S3K
  • 9QKCeFt4 VpKJafGrkjmt
  • AQ3hzgS Mb7kqZfRe
  • atMQ9sfdjh
  • bdaz6SVtA QTf6CM302KlS
  • DJ6CgjUEKqMl lPfLvc0VWiqz
  • e4sXnGD 6QAXEjy
  • g6y0n
  • h5ivbxqmOoG TV61g3EA2
  • HthCE8o sNAG7FEDfWk
  • iu2hZRNW
  • JdqGzU63 PeV1XL
  • jZOzY4sK0 rB604HSgTYs2t
  • NUP4zc0jKZiw b3yRmolFMSig
  • PGHf7kjYTX0ur1F XfK CGVOi
  • pXYDh9WUNV GCHaXnwbhvE
  • rX1y5mbSNGJt 3odFQBq4bEgn
  • Ss2FBERqn sCEXc0VK
  • uoQCIMXelPpd fbsX1ZpACn68
  • VCXgKuj96 tuB DMehaSOHVE
  • VXgISvcte AwkQHMY4C
  • xoPA71Sw 5VlDsTp
  • YkmPSBcQbq 1UGgz9TE8D4
Soft Skills
  • UIobRKSx mSEgtqh3
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service