Splunk

posted 3 months ago

Full-time - Entry Level
Remote
Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

Splunk is dedicated to building a safer and more resilient digital world, and as a Site Reliability Engineer (SRE) early in your career, you will play a crucial role in this mission. The Cloud organization at Splunk is focused on developing and maintaining robust and resilient platform solutions for the Software as a Service (SaaS) hosting of Splunk's enterprise software. This position is part of the TechOps team, which is responsible for monitoring and resolving issues that affect the availability and performance of Splunk for our cloud customers 24/7. As a member of this team, you will be the authority on customer experience, providing support and guidance to ensure that all technical issues are addressed promptly and effectively. In this role, you will work on a 4 x 10 shift schedule from Wednesday to Saturday, 4 PM to 2 AM. Your primary responsibilities will include providing technical support for the Splunk Cloud fleet, performing impact assessments, documenting issues and remediation steps, and leading support cases. You will also communicate with TechOps engineers and business partners regarding cloud-related issues, assist colleagues with complex tasks, and represent the TechOps team in meetings to recommend new procedures and processes. Your goal will be to restore normal service operations as quickly as possible during escalated incidents, ensuring a quality customer experience at all times. You will thrive in this position if you have a passion for large complex systems and enjoy working on distributed systems. You will be expected to think critically about how to automate processes and improve efficiency across thousands of machines. Data-driven decision-making is key, and you will strive to identify and resolve issues before they impact customers. This is a fully remote position, but candidates must be U.S. citizens working on U.S. soil and able to support FedRAMP High requirements.

Responsibilities

  • Provide technical support for the Splunk Cloud fleet
  • Perform impact assessments and problem solving according to established procedures
  • Document issues, remediation steps, and help with follow up problem management
  • Lead support cases and ensure queue management
  • Communicate with TechOps engineers and business partners around Cloud through email, chat, and in person
  • Assist other TechOps engineers on your shift with complex tasks
  • Represent the TechOps team in meetings and recommend new procedures/processes
  • Use internal tools to restore normal service operations during escalated incidents
  • Drive the core values of the company and ensure a quality customer experience
  • Work nights, weekends, and swing shifts as required

Requirements

  • Minimum of 2 years of related experience with a technical Bachelor's degree or equivalent practical experience
  • Ability to obtain an adjudicated Single Scope Background Investigation (SSBI) and SECRET clearance
  • Experience with monitoring and troubleshooting Splunk environments
  • Understanding of administering or architecting distributed Splunk environments
  • Familiarity with the development and deployment of a hosted cloud environment, preferably Azure
  • Experience with Python, Golang, or Shell for scripting, and Git or similar version control systems
  • Understanding of systems programming (network stack, file system, OS services) and networking (L2 vs. L3, network architecture, VLANs, etc)
  • Knowledge of standard methodologies related to security, performance, and disaster recovery
  • Knowledge of the Linux Operating System

Nice-to-haves

  • Understanding of monitoring and troubleshooting Splunk environments
  • Understanding of administering or architecting distributed Splunk environments
  • Understanding of the development and deployment of a hosted cloud environment, preferably Azure
  • Experience with Python, Golang or Shell for scripting, and Git or similar version control systems
  • Knowledge of standard methodologies related to security, performance, and disaster recovery
  • Knowledge of the Linux Operating System

Benefits

  • Medical insurance
  • Dental insurance
  • Vision insurance
  • 401(k) plan and match
  • Paid time off
  • Employee Stock Purchase Plan (ESPP)
  • Competitive benefits package
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service