Splunk - McLean, VA
posted 3 months ago
Splunk is seeking a Site Reliability Engineer to join our Splunk Cloud's Traffic Engineering team. This role is pivotal in scaling and securing our global Cloud networking infrastructure, which is essential for hosting Splunk's enterprise software. As a Site Reliability Engineer, you will be responsible for developing and deploying software that enhances the availability, performance, efficiency, and security of Splunk's services. You will work with various technologies, including DNS, Cloud Connectivity, Load Balancing (IPVS, L4, L7), API Gateways, IP address management (IPAM), VPN, and TLS infrastructure. In this position, you will collaborate with cloud providers to integrate their networking products into our ecosystem and build a control plane to scale and secure Kubernetes deployments across multiple cloud platforms such as Amazon AWS, Google Cloud Platform, and Microsoft Azure. A strong commitment to automation is essential, as you will be expected to embrace and master new technologies to automate routine tasks, allowing more time for innovation. You will also engage in distributed systems programming, working on and debugging systems like CDNs, Kubernetes, databases, and data replication. Your role will require a focus on technical excellence, utilizing continuous delivery, testing, and security best practices. Operational excellence is key; you will make data-driven decisions and strive to identify issues before they impact our customers. You should be adept at managing product outages, identifying performance bottlenecks, and determining the root causes of incidents. This position is ideal for someone with a passion for technology and a desire to contribute to a resilient digital world.