Senior Manager, Site Reliability

$115,000 - $150,000/Yr

Foot Locker - Irving, TX

posted 2 months ago

Full-time - Senior
Irving, TX
Clothing, Clothing Accessories, Shoe, and Jewelry Retailers

About the position

This position will be hybrid (3 days in office in Dallas, TX). Our global house-of-brands inspires and empowers youth culture. Relentlessly committed to fueling a shared passion for self-expression, we create unrivaled experiences at the heart of the sport and sneaker communities through the power of our people. If you want to be a part of something bigger than you can imagine, you've come to the right place. Are you passionate about sneakers and eager to make your mark at the dynamic intersection of technology and sneaker culture? Foot Locker, Inc. is on the lookout for a Sneaker-Inspired Sr. Manager for the IT Tools and Observability Services to elevate and revolutionize how we build and run our software. As a Sneaker-Inspired Sr. Manager of a Platform Engineering team, you'll be a creative force in driving innovative solutions at the crossroads of technology and sneaker culture. The Sr. Manager for the IT Tools and Observability team is responsible for leading a talented team of automation engineers dedicated to transforming the alerting, monitoring, automation, and cognitive compute space within Foot Locker. This team is responsible for facilitating, managing, and providing self-service, “as-a-service” solutions for highly resilient and available application monitoring and observability in a hybrid infrastructure. You will use agile methodologies with the latest in automation practices and tools.

Responsibilities

  • Lead a team of skilled automation engineers to develop our Observability-as-a-Service platform in a hybrid cloud environment.
  • Oversee the creation, orchestration and automation for monitoring and observability solutions (e.g., effective alerting, monitoring, self-healing) that support availability, reliability, scalability, recoverability, and flexibility.
  • Collaborate with cross-functional technology and business units to understand and continually improve the services provided.
  • Interface with software vendors to evaluate and ensure adequate ongoing support, licensing, and governance.
  • Drive required upgrades, changes, and patching with the tools under your leadership.
  • Lead and performance manage a team of platform engineers in software engineering discipline to deliver high-performance, proactive monitoring and observability solutions to product development teams.
  • Report on team progress, dependencies, and other key metrics to stakeholders and executive leadership.

Requirements

  • Bachelor's degree in computer science, Information Technology, or related fields, or equivalent experience.
  • Minimum of 8 years experience in Information Technology.
  • Minimum of 5 years in Monitoring, Observability, Automation, or equivalent technology experience.
  • Minimum 8 years of coding experience, with strong expertise in Python, Golang, or Java, and RESTful Services, focusing on building high throughput/high volume distributed systems.
  • Strong expertise in Unix, container orchestration (e.g., Kubernetes), container runtimes, and optimization.
  • Experience in observability of hybrid host environments and modern cloud-native application architectures using event-driven microservices backed by RESTful APIs.
  • Strong technical acumen in Cloud Architecture, Performance Benchmarking, and Capacity Planning.
  • Experience managing and growing engineers and teams.
  • Proven ability to concentrate and demonstrate a capacity for learning technical concepts and adapting to new technologies quickly.
  • Strong Cloud (AWS, GCP, Azure, etc.) platform knowledge.
  • Understanding of the Software Development Lifecycle and automation using CI/CD pipelines while integrating monitoring & observability throughout.
  • Strong knowledge of systems, networks, hardware, and software from an automation & monitoring perspective.
  • Strong understanding of observability and performance monitoring tools in distributed cloud computing, virtualization, and microservices architectures.

Nice-to-haves

  • Experience within retail or a technology company.
  • Expertise with modern observability technologies like New Relic, Catchpoint, and Solarwinds.
  • Software engineering/development background.
  • Familiarity with IaC (Infrastructure as Code).
  • Ability to work across teams/time-zones, managing large infrastructure & cross-organizational projects.

Benefits

  • Employee Discount
  • Paid Time Off
  • Medical | Dental | Vision Coverage
  • 401(k) | Roth 401(k)
  • Stock Purchase Plan
  • Life Insurance
  • Flexible Spending Account
  • Opportunities for Advancement
  • Tuition Reimbursement for Qualified Courses
  • Strong Company Culture
  • Employee Resource Groups
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service