Cable
posted 3 months ago
As the Lead Software Engineer - Site Reliability Engineering (SRE) at Comcast, you will play a pivotal role in ensuring the reliability and performance of the FreeWheel platforms. This position requires a deep understanding of large-scale distributed systems, where you will be responsible for various aspects including availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. You will engage in designing, analyzing, and troubleshooting these systems, while also debugging and optimizing code and automating routine tasks. You will be part of a dynamic team that combines software engineering and technology infrastructure expertise. Your responsibilities will include leading technical solutions to enhance the reliability and efficiency of FreeWheel platforms, supporting high-profile live events, and collaborating closely with developers and tech leads throughout the software release cycle. You will also be responsible for authoring infrastructure as code, dedicating approximately 30% of your time to developing tools in Python or Golang, and advocating for best practices in engineering and technical operations. In this role, you will lead on-call shifts, incident prevention, and response efforts, while also providing training and coaching to junior team members. Your ability to exercise independent judgment and discretion will be crucial as you navigate complex technical challenges and drive improvements in production quality and operational efficiency. This position requires a commitment to working Eastern Standard hours, including weekends during on-call rotations, and a proactive approach to problem-solving and collaboration across teams.