Google - San Francisco, CA

posted 21 days ago

Full-time - Senior
Remote - San Francisco, CA
Web Search Portals, Libraries, Archives, and Other Information Services

About the position

The Senior Software Developer in Site Reliability Engineering (SRE) at Google Cloud is responsible for managing the lifecycle of services, from inception and design to deployment and operation. This role focuses on optimizing existing systems, building infrastructure, and automating processes to enhance performance and reliability. The SRE team values diversity, intellectual curiosity, and problem-solving, fostering a collaborative and supportive environment for engineers to thrive and innovate.

Responsibilities

  • Engage in and improve the whole lifecycle of services from inception and design to deployment, operation, and refinement.
  • Support services before they go live through system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
  • Scale systems sustainably through automation and evolve systems by advocating for changes that improve reliability and velocity.
  • Practice sustainable incident response and conduct blameless postmortems.

Requirements

  • Proficiency in coding, algorithms, complexity analysis, and large-scale system design.
  • Experience in software development and system optimization.
  • Strong understanding of system capacity and performance management.

Nice-to-haves

  • Experience with cloud infrastructure and services.
  • Familiarity with automation tools and practices.
  • Knowledge of incident response and postmortem analysis.

Benefits

  • Generous parental and caregiver leave.
  • Fertility and growing family support.
  • Flexible work options including a hybrid work model and remote work opportunities.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service