Walmart - Bentonville, AR

posted 19 days ago

Full-time - Mid Level
Remote - Bentonville, AR
General Merchandise Retailers

About the position

The Senior Systems and Infrastructure Engineer, Site Reliability role at Walmart focuses on developing and implementing best-in-class Disaster Recovery (DR) solutions. This position is responsible for ensuring that critical business services have adequate protection plans in place to respond efficiently to emergencies. The engineer will work cross-functionally to establish and maintain the technology DR plan, coordinate DR strategies, and lead testing exercises, all while innovating the future of Disaster Recovery at Walmart.

Responsibilities

  • Work with Architects/engineers to build Disaster Recovery solutions.
  • Ensure systems have adequate protection plans for emergencies.
  • Define, plan, and coordinate the creation, reporting, and testing of Disaster Recovery plans.
  • Develop DR framework, including best practices and guidelines.
  • Provide long-term vision and strategic direction for the disaster recovery program.
  • Establish and maintain the technology DR plan, processes, and procedures.
  • Act as a consultant to teams in applying a DR framework to their technology stacks.
  • Assist in setting DR strategy, coordinating DR runbooks, and leading DR test exercises.
  • Assess potential vulnerabilities and develop procedures to minimize downtime.
  • Develop and implement strategies for Disaster Recovery capability in the Cloud and On-premises.
  • Lead Disaster Recovery testing activities including preparation, execution, and documentation.
  • Evaluate and research new technologies to increase DR capability and reduce expenses.
  • Present and discuss pertinent topics and issues appropriate to the audience.

Requirements

  • 4 years of experience in technology infrastructure engineering or related experience.
  • Experience designing and implementing Disaster Recovery Solutions in Azure Cloud and VMware virtual platform.
  • Experience installing, configuring, automating, and monitoring Cloud Services (IaaS, PaaS & SaaS).
  • Experience administering Microsoft Windows Server 2012, 2016, 2019 & Linux operating systems (RHEL & SLES).
  • Experience implementing and managing Disaster Recovery as a Service (DRaaS) in Cloud using Azure Site Recovery.
  • Proficiency in automation/scripting languages such as PowerShell and Azure CLI.
  • Experience supporting Microsoft MSSQL, Oracle & SAP HANA Databases.
  • Experience with Storage & Backup solutions like NetApp and Azure Blob.
  • Familiarity with replication technology such as NetApp SnapMirror.
  • Knowledge of Identity and Access Management including Microsoft Active Directory, Azure RBAC, and Single Sign-On.
  • Understanding of Networking concepts including DNS, DHCP, IP Addressing, Routing, Load Balancer, and VPN.
  • Knowledge of Security measures including Certificate Services, Azure Network Security Group (NSG), and Firewall.

Benefits

  • Competitive pay and performance-based incentive awards.
  • Health benefits including medical, vision, and dental coverage.
  • Financial benefits including 401(k), stock purchase, and company-paid life insurance.
  • Paid time off benefits including PTO, parental leave, family care leave, bereavement, jury duty, and voting.
  • Short-term and long-term disability benefits.
  • Education assistance with 100% company-paid college degrees.
  • Company discounts and military service pay.
  • Adoption expense reimbursement.
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service