Site Reliability Engineer - Early in Career (4 x 10 shifts)

Splunk

posted 2 months ago

Full-time - Entry Level

Remote

Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

Splunk is dedicated to building a safer and more resilient digital world, and as a Site Reliability Engineer (SRE) early in your career, you will play a crucial role in this mission. The Cloud organization at Splunk is focused on developing and maintaining robust platform solutions for the Software as a Service (SaaS) hosting of Splunk's enterprise software. This position is part of the TechOps team, which is responsible for monitoring and resolving issues that affect the availability and performance of Splunk for our cloud customers around the clock. As a member of this team, you will be the authority on customer experience, providing support and guidance to ensure that all technical issues are addressed promptly and effectively. In this fully remote position, you will work 4 x 10 shifts from Wednesday to Saturday, 4 PM to 2 AM. Your primary responsibilities will include providing technical support for the Splunk Cloud fleet, performing impact assessments, documenting issues and remediation steps, and leading support cases. You will also communicate with other TechOps engineers and business partners, assist with complex tasks, and represent the TechOps team in meetings to recommend new procedures and processes. Your role will require you to restore normal service operations quickly during escalated incidents, ensuring a quality customer experience at all times. You will thrive in this role if you have a passion for large complex systems and experience working with distributed systems. You will be expected to think critically about automation and data-driven decision-making, always striving to identify and resolve issues before they impact customers. This position requires a proactive approach to problem-solving and a commitment to maintaining the high standards of service that Splunk is known for.

Responsibilities

Provide technical support for the Splunk Cloud fleet
Perform impact assessments and problem solving according to established procedures
Document issues, remediation steps, and help with follow up problem management
Lead support cases and ensure queue management
Communicate with TechOps engineers and business partners around Cloud through email, chat, and in person
Assist other TechOps engineers on your shift with complex tasks
Represent the TechOps team in meetings and make recommendations on new procedures/processes
Use internal tools to restore normal service operations quickly during escalated incidents
Drive the core values of the company and ensure a quality customer experience
Work nights, weekends, and swing shifts as required

Requirements

Requires a minimum of 2 years of related experience with a technical Bachelor's degree or equivalent practical experience
Ability to obtain an adjudicated Single Scope Background Investigation (SSBI) and SECRET clearance
Experience with monitoring and troubleshooting Splunk environments
Understanding of administering or architecting distributed Splunk environments
Experience with the development and deployment of a hosted cloud environment, preferably Azure
Proficiency in Python, Golang, or Shell for scripting, and Git or similar version control systems
Understanding of systems programming (network stack, file system, OS services) and networking (L2 vs. L3, network architecture, VLANs, etc)
Knowledge of standard methodologies related to security, performance, and disaster recovery
Knowledge of the Linux Operating System

Nice-to-haves

Understanding of monitoring and troubleshooting Splunk environments
Understanding of administering or architecting distributed Splunk environments
Understanding of the development and deployment of a hosted cloud environment, preferably Azure
Experience with Python, Golang or Shell for scripting, and Git or similar version control systems
Knowledge of standard methodologies related to security, performance, and disaster recovery
Knowledge of the Linux Operating System

Benefits

Medical insurance
Dental insurance
Vision insurance
401(k) plan and match
Paid time off
Employee Stock Purchase Plan (ESPP)
Competitive benefits package

Site Reliability Engineer - Early in Career (4 x 10 shifts)

About the position

Responsibilities

Requirements

Nice-to-haves

Benefits

Tools

Career Hubs

Guides

Company