Dev Technologyposted about 2 months ago
Mid Level
Ashburn, VA

About the position

Dev Technology Group is recruiting for a Site Reliability Engineer who will be a part of a dynamic, energetic, and mission-oriented team responsible for the continuous monitoring (24x7x365) of multiple applications, responding to alerts and potential issues. This position will cover the Tuesday – Saturday from 11:00 PM – 07:00 AM shift, onsite in Ashburn, 5 days a week. Additionally, the position requires the willingness to work various shifts to accommodate coverage as needed; this role is an hourly paid position. To be successful the Site Reliability Engineer must be able to multitask and manage multiple systems individually or as part of a team. This work includes leveraging automated and manual performance monitoring tools to determine if an issue exists and its severity. A detailed record of all incidents and their resolutions shall be tracked for future evaluation and trending purposes.

Responsibilities

  • Provide outstanding support of the Passenger Services Program Directorate’s (PSPD) suite of customer-facing applications
  • Timely reporting of events and performance statistics; contacting the entity responsible for the application and recommending a resolution
  • Monitor various applications to proactively identify system disruptions and preempt enterprise outages
  • Ensure that required Service Level Agreements (SLAs) are met
  • Notify internal and external departments of performance issues and trends
  • Support maintenance and scheduled outages
  • Review and update tickets with most current status information
  • Understand applications and their interdependencies
  • Incorporate monitoring of any new applications or systems
  • Review and suggest monitoring tools as needed
  • Perform application triage during active incidents
  • Perform mitigation services for Mission Essential applications during scheduled changes and unplanned events
  • Monitor and support scheduled change activity in the production environment and escalate unexpected issues
  • Provide application verification support to support teams upon completion of scheduled changes in the production environment
  • Root Cause Analyst (RCA) and follow up both internal and external
  • Provide updated reports for monthly PMR presentation and review
  • Provide daily executive reports detailing the health of the Passenger Services Program Directorate’s (PSPD) environment and any pending changes which may potentially impact PSPD applications
  • Provide documentation and presentation support as needed
  • Effectively document incidents describing the issue, business impact, root cause and fix actions
  • Identify areas where improvements in processes or documentation will increase the team’s overall proficiency

Requirements

  • At least 5+ years of experience supporting an IT system through troubleshooting, applications maintenance, and network operations
  • ITIL experience with the ability to convey demonstrated strong knowledge in ITIL
  • Experience with automated monitoring and performance management tools (i.e.: Auto Ops, App Dynamics, Splunk, etc.)
  • Outstanding communications skills---both written and oral
  • Ability to work in a collaborative environment as well as manage individual tasks

Nice-to-haves

  • Experience with US Government systems (preferred)
  • ITIL certification preferred; OR willingness to sit for the ITIL certification exam within the first year of hire

Benefits

  • Generous and flexible time-off policy
  • Flexible work schedules and telework options, including remote work availability for eligible projects
  • Career development opportunities including a mentorship program, technical and management training through Dev University, hands-on learning through DevLab, tuition reimbursement, and paid training opportunities
  • Industry-leading benefits including a choice of two health plans that include dental and vision, flexible spending account, commuter benefits, life insurance, and more
  • 401K matching with a 5% matching contribution
  • Regular team and company social events including our annual party, happy hours, fitness challenges, and more
  • A focus on community engagement including company wide support activities, employer match for donations, and time off for volunteer efforts

Job Keywords

Hard Skills
  • Application Development
  • Data Management
  • Mobile Data
  • Reliability Engineering
  • Splunk
  • 2ohgRHeDq NiMjyKS9x3g
  • 728aLMw EXsbPIMaHum1
  • 7WHiufE ft2SDMJZ
  • Cd65JKn8 3NixW2bXFvz
  • cUdMpQ2ifkx UIzhox7KTkq
  • E4jR2fruQP FwE5bzk1
  • gEXz5s bT5Is2QBw4Y
  • IpdKPMGk7bwW acC5IEz2
  • IZtyVmjMYvgL uDbY56tz
  • kjQs1CEtX W82sLg4rJ uXo5w4dGHR7
  • Lbhrdyq8aGug 46uQJcyNzHM
  • loIz2 59DdXLSZxEK
  • MsCHjczVB x1kYo478y36
  • oKRNOv gA9UbiWL7YO
  • qUWysneSoO kroWsPXAgN
  • qXzO1PL qTOQcXZL0jR97h plSQnUw
  • VMY26ovUwsbC meYFAg7Iky5l
  • XNtTSked HrKidb
  • ZwRYdT9si uqn58KTR
Soft Skills
  • zfxIP s39qELNae
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service