Site Reliability Engineer - 30768

$146,400 - $201,300/Yr

Splunk - McLean, VA

posted 3 months ago

Full-time - Mid Level
McLean, VA
Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

Splunk is seeking a Site Reliability Engineer to join our Splunk Cloud's Traffic Engineering team. This role is pivotal in scaling and securing our global Cloud networking infrastructure, which is essential for hosting Splunk's enterprise software. As a Site Reliability Engineer, you will be responsible for developing and deploying software that enhances the availability, performance, efficiency, and security of Splunk's services. You will work with various technologies, including DNS, Cloud Connectivity, Load Balancing (IPVS, L4, L7), API Gateways, IP address management (IPAM), VPN, and TLS infrastructure. In this position, you will collaborate with cloud providers to integrate their networking products into our ecosystem and build a control plane to scale and secure Kubernetes deployments across multiple cloud platforms such as Amazon AWS, Google Cloud Platform, and Microsoft Azure. A strong commitment to automation is essential, as you will be expected to embrace and master new technologies to automate routine tasks, allowing more time for innovation. You will also engage in distributed systems programming, working on and debugging systems like CDNs, Kubernetes, databases, and data replication. Your role will require a focus on technical excellence, utilizing continuous delivery, testing, and security best practices. Operational excellence is key; you will make data-driven decisions and strive to identify issues before they impact our customers. You should be adept at managing product outages, identifying performance bottlenecks, and determining the root causes of incidents. This position is ideal for someone with a passion for technology and a desire to contribute to a resilient digital world.

Responsibilities

  • Develop and deploy software to improve the availability, performance, efficiency, and security of Splunk's services.
  • Work with cloud providers to integrate their networking products into our ecosystem.
  • Build a control plane to scale and secure Kubernetes deployments on various cloud platforms.
  • Commit to automation and master new technologies to automate routine tasks.
  • Engage in distributed systems programming and debug systems like CDNs and databases.
  • Utilize continuous delivery, testing, and security best practices for technical excellence.
  • Make data-driven decisions to ensure operational excellence and preemptively identify issues.
  • Manage product outages and identify performance bottlenecks.

Requirements

  • 8+ years of relevant industry experience; Bachelor's degree in Computer Science, Computer Engineering, or equivalent work experience.
  • Experience deploying production cloud networking and infrastructure solutions following DevOps principles.
  • Experience handling SaaS and/or On-prem applications for a large customer base.
  • Experience with one or more public cloud providers (AWS, Azure, GCP).
  • Proficiency in coding with Python and Go.
  • Experience with CI/CD tools (Gitlab, Jenkins) and automation technologies (Terraform or equivalent).
  • Ability to mentor junior engineers and provide technical direction.

Benefits

  • Medical insurance
  • Dental insurance
  • Vision insurance
  • 401(k) plan and match
  • Paid time off
  • Incentive compensation
  • Equity or long-term cash awards
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service