Comcast - Reston, VA

posted 4 months ago

Full-time - Mid Level
Reston, VA
Broadcasting and Content Providers

About the position

FreeWheel, a Comcast company, is seeking a Cloud Site Reliability Engineer to join its Advanced Advertising organization. This role is pivotal in managing and governing FreeWheel's global cloud infrastructure, primarily utilizing AWS, with plans to expand into GCP and Azure. The ideal candidate will possess a strong background in cloud strategy, design, development, and implementation of large-scale projects. They will be responsible for ensuring the efficiency and security of cloud operations, leveraging their expertise in emerging technologies and platforms to enhance business operations. The Cloud SRE will play a crucial role in maintaining the infrastructure's reliability and scalability, directly impacting the company's operational efficiency and growth. The responsibilities of the Cloud Site Reliability Engineer include designing and building tools for scalable and reliable infrastructure, maintaining governance and security standards for AWS, and writing code to support Infrastructure as Code (IaC) and automated incident resolution. The engineer will also participate in on-call rotations, provide documentation and guidance to engineering teams, and collaborate with security teams to address threats and misconfigurations. This position requires a proactive approach to problem-solving and a commitment to keeping documentation current with changes in the AWS environment. The Cloud SRE will work closely with development teams to understand application requirements and optimize system performance, ensuring that FreeWheel's cloud infrastructure remains robust and efficient.

Responsibilities

  • Design and build tooling to provide highly scalable, reliable infrastructure using industry best practices.
  • Build and maintain baseline governance and security standards for AWS infrastructure across all AWS accounts.
  • Write code and scripts to support Infrastructure as Code (IaC), configuration management, and automated incident resolution.
  • Participate in on-call rotations and serve as an escalation contact for service incidents.
  • Write systems documentation, playbooks, and other instructional manuals.
  • Provide advice and guidance to engineering teams on deploying AWS technologies and architectures, focusing on best practices and security models.
  • Collaborate with the Security organization to build tools that provide information to react to threats and misconfigurations in the infrastructure.
  • Maintain detailed documentation for AWS configurations, procedures, and troubleshooting steps.
  • Create and maintain Low-Level Design and assist with developing high-level design documents.
  • Keep documentation up to date with changes in the AWS environment.
  • Work closely with development teams to understand application requirements and provide AWS infrastructure support.
  • Collaborate with cross-functional teams to resolve issues and optimize system performance.

Requirements

  • Bachelor's degree in computer science, computer engineering, relevant technical field, or equivalent practical experience.
  • Experience in designing, analyzing, and troubleshooting large-scale distributed systems.
  • At least three (3) years of administration experience with Linux, AWS, and Windows.
  • At least three (3) years of experience in configuration management using Cloud Formation, Terraform, and Ansible or similar tools.
  • AWS cloud governance experience using AWS Organizations, AWS SSO, EC2 Connect, Guard Duty, IAM, CloudTrail, Security Hub, Config, etc.
  • At least three (3) years of experience coding in higher-level languages, such as Python, Java, or C++.
  • An analytical approach to problem-solving, with a belief that the best decisions are data-backed.
  • DevOps experience solving operational problems with automation, scripting, and software development.
  • Demonstrated desire to learn new technologies and programming languages as responsibilities evolve over time.
  • Strong verbal and written communication skills.

Nice-to-haves

  • Postgraduate degree in computer-related fields.
  • Prior experience with large scale, highly distributed applications.
  • Certified AWS Solutions Architect Pro or DevOps Pro.
  • Certified AWS Security and GCP Architect certified is a plus.

Benefits

  • Comprehensive health insurance coverage.
  • 401k retirement savings plan.
  • Paid time off and holidays.
  • Tuition reimbursement for further education.
  • Professional development opportunities.
  • Flexible scheduling options.
  • Employee discounts on Comcast services.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service