Comcast - York, PA

posted 4 months ago

Full-time - Mid Level
York, PA
Broadcasting and Content Providers

About the position

FreeWheel, a Comcast company, is seeking a Cloud Site Reliability Engineer to join its Advanced Advertising organization. This role is pivotal in managing and governing FreeWheel's global cloud infrastructure, primarily utilizing AWS, with plans to expand into GCP and Azure. The ideal candidate will possess a strong background in cloud strategy, design, development, and implementation of large-scale projects. They will be responsible for ensuring the security and efficiency of our cloud operations, leveraging their expertise in emerging technologies and platforms. The Cloud SRE will play a crucial role in enhancing the daily efficiency of cloud operations and supporting ongoing growth within the organization. The responsibilities of the Cloud Site Reliability Engineer include designing and building tooling for scalable and reliable infrastructure, maintaining governance and security standards for AWS, and writing code to support Infrastructure as Code (IaC) and automated incident resolution. The engineer will also participate in on-call rotations, provide guidance to engineering teams on AWS best practices, and collaborate with the Security organization to address threats and misconfigurations. Additionally, the role involves maintaining detailed documentation for AWS configurations and working closely with development teams to understand application requirements and optimize system performance. Candidates should have a Bachelor's degree in computer science or a related field, along with at least three years of experience in Linux, AWS, and Windows administration. Proficiency in configuration management tools such as Cloud Formation, Terraform, and Ansible is essential, as is experience with AWS governance tools. Strong coding skills in languages like Python, Java, or C++ are required, along with a data-driven approach to problem-solving and a desire to learn new technologies. Excellent communication skills are also necessary for this role, which operates on Eastern Standard hours.

Responsibilities

  • Design and build tooling to provide highly scalable, reliable infrastructure using industry best practices.
  • Build and maintain the baseline governance and security standards for AWS infrastructure for all AWS accounts.
  • Write code and scripts to support Infrastructure as Code (IaC), configuration management, and automated incident resolution.
  • Participate in on-call rotations and be an escalation contact for service incidents.
  • Write systems documentation, playbooks, and other instruction manuals.
  • Provide advice and guidance to engineering teams on deploying AWS technologies and architectures based on best practices and security models.
  • Partner with the Security organization to build tools that provide information to react to threats and misconfigurations in our infrastructure.
  • Maintain detailed documentation for AWS configurations, procedures, and troubleshooting steps.
  • Create and maintain Low-Level Design and assist with developing high-level design documents.
  • Keep documentation up to date with changes in the AWS environment.
  • Work closely with development teams to understand application requirements and provide AWS infrastructure support.
  • Collaborate with cross-functional teams to resolve issues and optimize system performance.

Requirements

  • Bachelor's degree in computer science, computer engineering, relevant technical field, or equivalent practical experience.
  • Experience in designing, analyzing, and troubleshooting large-scale distributed systems.
  • At least three (3) years of administration experience with Linux, AWS, and Windows.
  • At least three (3) years of experience in configuration management using Cloud Formation, Terraform, and Ansible or similar.
  • AWS cloud governance experience using AWS Organizations, AWS SSO, EC2 Connect, Guard Duty, IAM, CloudTrail, Security Hub, Config, etc.
  • At least three (3) years of experience coding in higher-level languages, such as Python, Java, or C++.
  • An analytical approach to problem-solving, with a belief that the best decisions are backed by data.
  • DevOps experience solving operational problems with automation, scripting, and software development.
  • Demonstrated desire to learn new technologies and programming languages as responsibilities evolve over time.
  • Strong verbal and written communication skills.

Nice-to-haves

  • Postgraduate degree in computer-related fields.
  • Prior experience with large scale, highly distributed applications.
  • Certified AWS Solutions Architect Pro or DevOps Pro.
  • Certified AWS Security - GCP Architect certified is a plus.

Benefits

  • Comprehensive health insurance coverage.
  • 401k retirement savings plan.
  • Paid time off and holidays.
  • Tuition reimbursement for further education.
  • Professional development opportunities.
  • Employee discounts on products and services.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service