Nvidia - Santa Clara, CA

posted 4 months ago

Full-time - Senior
Santa Clara, CA
Computer and Electronic Product Manufacturing

About the position

NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years. Our legacy of innovation is driven by great technology—and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. As an NVIDIAN, you'll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Join the team and see how you can make a lasting impact on the world. NVIDIA is looking to hire a deeply technical, creative, and experienced Principal Site Reliability Engineer (SRE) with expertise in Content Delivery Networks (CDN). This role will be crucial in building, supporting, and maintaining the next generation of AI-powered enterprise products that enhance engineering efficiency. Additionally, this role is responsible for the CDN configurations that NVIDIA uses for interacting with all its customers, partners, and employees.

Responsibilities

  • Lead the technical roadmap and cross-organizational projects.
  • Design and implement scalable, reliable, and efficient distributed systems.
  • Manage CDN infrastructures and ensure robust security configurations.
  • Analyze and troubleshoot complex distributed systems, promoting best practices.
  • Innovate to tackle operational challenges and lead AI technology implementation.
  • Mentor junior engineers and promote professional growth.
  • Execute multiple projects efficiently, ensuring timely delivery and high-quality outcomes.

Requirements

  • 15+ years in cloud, platform, or SRE roles, with over 7 years focused on CDN configurations and management.
  • Bachelor's degree or equivalent experience.
  • Strong knowledge of networking concepts and application protocols (TCP/IP, DNS, TLS, HTTP/S) with focused experience on CDNs and HTTP cache/proxy technologies.
  • Proficiency in programming languages such as Python, with skills in automation.
  • Strong infrastructure skills, including networking, DNS, SSL, and firewalls, with expertise in public cloud architecture (AWS, Azure, Google).
  • Advanced troubleshooting skills and proficiency in data analytics using tools like CDN vendor portals and Splunk.
  • Outstanding communication skills, problem-solving, negotiation, and interpersonal skills.
  • Expert-level knowledge of managing and debugging Unix/Linux systems at scale.

Nice-to-haves

  • Passion for and experience with AI methodologies.
  • Systematic problem-solving approach with a proactive attitude.
  • Strong understanding of client behavior, including browsers and automated clients.

Benefits

  • Equity and benefits eligibility based on position and experience.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service