SITE RELIABILITY ENGINEER

$104,000 - $145,600/Yr

Motion Recruitment - Phoenix, AZ

posted 2 months ago

Full-time - Mid Level
Remote - Phoenix, AZ
Administrative and Support Services

About the position

The Site Reliability Engineer (SRE) position is a fully remote role focused on enhancing the observability of a new cloud platform being developed by a financial client. This role is crucial as the company transitions to a cloud-first mindset, and the SRE team is in its early stages, tasked with identifying and addressing gaps in observability across various shared services such as Mulesoft, Collibra, Confluent, Kafka, and APIC. The SRE team will not be responsible for day-to-day monitoring but will instead focus on creating tools and utilities that enhance visibility and provide critical data metrics for these services. The ideal candidate will be a problem solver with a strong background in Infrastructure as Code (IaC), configuration management, and cloud technologies, particularly Google Cloud Platform (GCP), although experience with other cloud providers is also acceptable. The role emphasizes hands-on technical skills, particularly in Linux systems, and requires proficiency in scripting, especially with Python. The SRE will play a pivotal role in shaping the observability strategy of the infrastructure and platform, ensuring that the systems are robust and reliable.

Responsibilities

  • Identify gaps in observability within the infrastructure and platform.
  • Develop tools to send metrics to Splunk and provide data metrics for shared services.
  • Collaborate with the SRE team to enhance the overall observability strategy.
  • Utilize Infrastructure as Code (IaC) and configuration management tools to automate processes.
  • Work hands-on with Linux systems to ensure reliability and performance.
  • Engage in problem-solving to address issues related to cloud services.

Requirements

  • 5+ years of experience working with Linux systems.
  • 3+ years of experience in a DevOps or Site Reliability Engineering position.
  • Proficiency in IaC/configuration management technologies such as Terraform, Ansible, Puppet, or Chef.
  • Experience with at least one major cloud provider (AWS, Azure, GCP).
  • Strong Python scripting experience.
  • Familiarity with monitoring or observability tools.

Nice-to-haves

  • Experience with Kubernetes container orchestration.
  • Experience working with message broker technologies like Kafka, RabbitMQ, or Kinesis.

Benefits

  • Medical Insurance - Four medical plans to choose from for you and your family.
  • Dental & Orthodontia Benefits.
  • Vision Benefits.
  • Health Savings Account (HSA).
  • Health and Dependent Care Flexible Spending Accounts.
  • Voluntary Life Insurance, Long-Term & Short-Term Disability Insurance.
  • Hospital Indemnity Insurance.
  • 401(k) including match with pre and post-tax options.
  • Paid Sick Time Leave.
  • Legal and Identity Protection Plans.
  • Pre-tax Commuter Benefit.
  • 529 College Saver Plan.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service