Comcast - Philadelphia, PA

posted 4 months ago

Full-time - Senior
Philadelphia, PA
Broadcasting and Content Providers

About the position

As the Senior Database Administrator at FreeWheel, a Comcast company, you will play a crucial role in ensuring the availability, performance, and reliability of our database systems. Your primary focus will be on the FreeWheel Database, where you will engage in designing, analyzing, and troubleshooting large-scale distributed systems. This position requires a deep understanding of database management, as you will be responsible for monitoring, emergency response, and capacity planning for the database components of FreeWheel platforms. You will work closely with a diverse team that combines software and technology infrastructure expertise, providing subject matter expertise in database management and resolving complex issues that may arise within the database layer. In this role, you will implement technical solutions aimed at improving the reliability, quality, and efficiency of the database layer. You will review all database data and schema changes, ensuring that all modifications are executed safely and in accordance with our release cadence. Your responsibilities will also include optimizing database parameters for enhanced performance and stability, as well as evolving the database architecture to address high availability, disaster recovery, data consistency, and security needs. You will enforce security measures and support the implementation of all database-related processes, ensuring compliance with best practices. Additionally, you will design and enhance monitoring and alert systems from both system and database perspectives. The position requires you to fulfill on-call duties, potentially extending to 24/7 coverage as needed, to meet service level agreements (SLA) and service level objectives (SLO). You will also develop and implement automation tools to boost operational efficiency and support cybersecurity initiatives, including database patching and incident remediation. Training and coaching junior team members will be part of your responsibilities, ensuring knowledge transfer and skill development within the team. Regular attendance, including the ability to work nights, weekends, and overtime as necessary, is expected.

Responsibilities

  • Be responsible for database reliability and technical operation of FreeWheel Platforms.
  • Implement technical solutions for measurement and improvement on reliability, quality, and efficiency of FreeWheel platforms' DB layer.
  • Review all DB data, schema changes, and application changes requiring database access.
  • Safely execute all DB changes according to release cadence and ad-hoc needs.
  • Optimize database parameters for better performance and stability.
  • Enhance DB architecture evolution, including high availability, disaster recovery, data consistency, security, version selection & upgrade, cost analysis, and control.
  • Enforce security measures, support, review, and implement all DB-related processes.
  • Design, implement, and enhance monitoring & alert systems from system and DB perspectives.
  • Fulfill 12X7 (potentially extend to 24x7 as needed) on-call duties to meet SLA/SLO.
  • Develop and implement DB automation tools to boost operational efficiency.
  • Provide support for the Cybersecurity program needs such as DB patching, vulnerability cleanup, secure server configuration, testing and validation, technical controls implementation and cybersecurity DB incident remediation efforts.
  • Provide training and coaching to peers and more junior SRE team members.
  • Regular, consistent and punctual attendance. Must be able to work nights and weekends, variable schedule(s) and overtime as necessary.

Requirements

  • Bachelor's degree in computer science, a related engineering field, or equivalent practical experience.
  • 7+ years managing, tuning, backing up (full/incremental) MySQL databases (or other popular RDBMS) replication cluster.
  • Expert level MySQL DB knowledge required & preferred experience with Aurora/DynamoDB/MongoDB/Cassandra/ClickHouse/SQL Server/PostgreSQL/Areospike/Snowflake/Hive/HBase/Elasticsearch/NEO4J.
  • Must have software development experience in GoLang or Python. Experience with Ansible, VUE.js, CSS, JavaScript, Java, C/C++ is a plus.
  • Must have AWS Cloud platform experience. GCP & Azure experience is a plus.
  • Knowledge and experiences of Linux/Unix like: Redhat, Debian, CentOS, RockyLinux, and very basic network knowledge (Router, Switch, Firewall etc.) is must to have.
  • Familiar with monitoring tools such as Prometheus, Kibana/Grafana, ELK, Nagios etc.
  • Experienced at working with geographically distributed (WAN based) systems.
  • A working knowledge of Nginx, NetScaler, VPLS, GSLB, Storage (like: PureStorage, Cohesity ...) is a plus.
  • Experienced at troubleshooting, designing and building reliable, fault tolerant systems is a plus.
  • Good communication and collaboration skills, within/across team(s) and continents.

Nice-to-haves

  • Prior experience in supporting business-critical DB services before they go live through activities such as DB system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
  • Technical leadership and influence demonstrated in focused product/tech areas and practices.
  • Prior experience of providing DB technical solutions at an internet company.

Benefits

  • Comprehensive health insurance coverage
  • 401k retirement savings plan
  • Paid time off and holidays
  • Tuition reimbursement
  • Professional development opportunities
  • Flexible work hours
  • Employee discounts on Comcast services
  • Wellness programs
  • Life insurance and disability insurance
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service