Lead Site Reliability Engineer

$112,151 - $262,854/Yr

Comcast - York, PA

posted 5 months ago

Full-time - Mid Level
York, PA
501-1,000 employees
Broadcasting and Content Providers

About the position

FreeWheel, a Comcast company, is seeking a Site Reliability Engineer (SRE) to join our team. In this role, you will be responsible for ensuring the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning for the FreeWheel platforms. You will engage in designing, analyzing, and troubleshooting large-scale distributed systems, debugging and optimizing code, and automating routine tasks. As part of a diverse team with both software and technology infrastructure backgrounds, you will provide subject matter expertise, resolve complex break/fix scenarios, and collaborate with engineering, vendors, and client services to deliver successful technical solutions. This position requires a high degree of independence and the ability to develop non-routine solutions while following operational practices. Your core responsibilities will include overseeing the reliability and technical operations of the FreeWheel TV Platform Ad-Serving components, leading technical solutions to measure and improve reliability, quality, and efficiency of FreeWheel platforms. You will conduct complex analytical duties in the planning, deployment, testing, and evaluation of FreeWheel products, and support high-profile live events such as the Super Bowl, Olympic Games, March Madness, and FIFA World Cup. You will also be involved in the software release cycle, ensuring that releases are well designed, planned, implemented, and monitored. Additionally, you will lead the design and implementation of infrastructure as code, advocate for engineering best practices, and provide training and coaching to peers and junior SRE team members. This role requires a commitment to regular attendance and the ability to work nights and weekends as necessary.

Responsibilities

  • Be responsible for reliability and technical operations of FreeWheel TV Platform Ad-Serving component(s).
  • Lead technical solutions in measuring and improving reliability, quality, and efficiency of FreeWheel platforms.
  • Conduct complex analytical duties in the planning, deployment, testing, and evaluation of FreeWheel products.
  • Support FreeWheel powered live events such as Super Bowl, Olympic Games, March Madness, and FIFA World Cup.
  • Engage in the software release cycle, working closely with developers and tech leads to ensure software releases are well designed, planned, implemented, released, and monitored.
  • Lead in design and implementation of infrastructure as code with best practices, tool use, and quality assurance.
  • Lead technical solutions for infrastructure and application management, monitoring, and operations with a focus on standardization and automation.
  • Perform code level debugging on issues escalated to the team.
  • Lead on-call shifts, incident prevention, response, and retrospectives.
  • Advocate for engineering and technical operations procedures, policies, processes, and SRE best practices.
  • Partner with developers and vendors to identify and drive improvements in production quality, operational efficiency, and engineering productivity.
  • Provide support for the Cybersecurity program needs, including patching, vulnerability cleanup, secure server configuration, and incident remediation efforts.
  • Provide training and coaching to peers and junior SRE team members.

Requirements

  • Bachelor's degree in computer science, a related engineering field, or equivalent practical experience.
  • 7 years of experience in software engineering with programming languages such as Python, Golang, or JavaScript.
  • 5 years of technical operation experience for business-critical applications over public cloud services, preferably AWS.
  • 5 years of experience with SDLC tools including Containers, Kubernetes, Docker, Salt/Ansible/Chef/Puppet, Jenkins, and Git.
  • Experience in Linux administration, network security, and system infrastructure.
  • Excellent communication and collaboration skills across teams and continents.

Nice-to-haves

  • Prior experience in supporting business-critical services before they go live through system design consulting and capacity planning.
  • Demonstrated technical leadership and influence in focused product/tech areas.
  • Prior experience in providing technical solutions at an internet company.

Benefits

  • Comprehensive health insurance coverage.
  • 401(k) retirement savings plan with company matching.
  • Paid time off including holidays and vacation days.
  • Tuition reimbursement for further education.
  • Flexible work hours and remote work options.
  • Employee discounts on Comcast services.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service