Crunchyroll - San Francisco, CA

posted 2 months ago

Full-time - Senior
Remote - San Francisco, CA

About the position

As a Staff Site Reliability Engineer for the Data Engineering team at Crunchyroll, you will be responsible for maintaining and enhancing the reliability of our data infrastructure. This role is crucial for ensuring the availability and performance of our data services, which directly impacts the organization's ability to make informed decisions. You will collaborate with data engineers and software engineers to drive automation and best practices in monitoring and alerting, ultimately supporting millions of anime fans worldwide.

Responsibilities

  • Maintain and enhance the reliability of data infrastructure.
  • Collaborate with data engineers and software engineers to develop automation and best practices.
  • Standardize and implement monitoring and alerting across all datastores.
  • Track key metrics like errors, latency, and throughput.
  • Lead efforts to keep databases up-to-date and implement Infrastructure as Code (IaC).
  • Automate key processes to enhance operational efficiency.
  • Define and document operational requirements and develop incident response processes.
  • Continuously improve load testing and optimize data governance practices.

Requirements

  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • 12+ years of experience in site reliability engineering, database operations, or a related role with a focus on data platforms.
  • Extensive experience with AWS cloud platform and their data-related services.
  • Proficiency in monitoring tools (e.g., Datadog, CloudWatch, DevOps Guru, DB Performance Insights).
  • Proficiency in one or more programming languages (e.g., Python, Java).
  • Proficiency in automation frameworks (e.g., Terraform, Cloud Formation).
  • Strong understanding of various performance metrics and database internals.
  • Experience in identifying and eliminating bottlenecks in the system.
  • Strong understanding of database systems (e.g., SQL, NoSQL) and managing large-scale data infrastructures.
  • Hands-on implementation of CI/CD pipelines and DataOps practices.

Nice-to-haves

  • Experience with data governance, compliance, and lifecycle management.
  • Ability to own and execute projects while collaborating with the team.

Benefits

  • Great compensation package including salary plus performance bonus earning potential.
  • Flexible PTO and time off policies.
  • Generous medical, dental, vision, STD, LTD, and life insurance options.
  • Health saving account (HSA) program plus healthcare and dependent care FSA programs.
  • Employer match on 401(k) plan.
  • Employer paid commuter benefit for eligible employees.
  • Generous support program for new parents.
  • Pet insurance and pet-friendly offices.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service