Walmart - Sunnyvale, CA

posted about 1 month ago

Full-time - Principal
Sunnyvale, CA
General Merchandise Retailers

About the position

The Principal, Software Engineer - Cloud Storage Architect at Walmart will focus on designing and optimizing cloud and enterprise storage solutions. This role involves collaborating with various teams to ensure high availability, fault tolerance, and cost-effective storage services across public and private clouds. The architect will leverage data analytics to enhance storage efficiencies and will be responsible for driving architectural decisions that improve performance and reduce costs.

Responsibilities

  • Design, evangelize, advocate for, and optimize cross-functional Cloud and Enterprise Storage solutions that are high performant and cost effective.
  • Enable Application teams to follow best practices to deploy on Cloud Platforms and optimize Storage spend and improve efficiency and performance based on utilization.
  • Initiate & drive significant architectural solutions/decisions of our Public and Private Clouds that will improve efficiency, performance, and reduce Cloud Storage spend.
  • Drive cost optimization initiatives across storage services organization, with a focus on improving performance and next-gen tech stack adoption.
  • Design automation for storage optimization and other processes on various cloud platforms.
  • Partner with architects in Application and other platform teams and ensure alignment of solutions across the stack.
  • Implement comprehensive monitoring solutions to track performance, availability, and cost metrics across the multi-cloud ecosystem.
  • Identify and resolve performance bottlenecks and implement proactive measures to maintain high availability and service continuity.
  • Leverage data analytics and AI tools to predict usage patterns, optimize resource allocations and enhance infrastructure performance.
  • Build, tune, troubleshoot block, file and object storage systems in high availability storage clusters.
  • Participate in on-call escalation chain during incidents and proactively troubleshoot issues.
  • Create and maintain technical documentation for operational readiness.
  • Design and maintain cloud storage best practices.
  • Provide troubleshooting expertise for storage performance, availability and other issues.
  • Train and educate others within Technology about Cloud technologies.
  • Solve business needs by evaluating different storage technology options and vendor products.
  • Develop and integrate provisioning and lifecycle tools for storage services components.
  • Contribute to an environment that promotes and reinforces the highest standards of integrity and ethics.
  • Demonstrates creativity and strength in the face of change, obstacles, or adversity.
  • Adapt to competing demands and shifting priorities.

Requirements

  • Extensive experience with Microsoft Azure, Google Cloud and understanding of various cloud storage abstraction layers.
  • Experience supporting large scale, highly available, production Cloud Storage deployments in public and private cloud environments.
  • Experience with Azure Blob Storage, Google Cloud Storage, S3, Azure managed disks, Google persistent disks.
  • Experience with cloud storage services, resource management, and cloud architecture.
  • Experience in troubleshooting issues during an incident and drive down MTTR across the platform.
  • Experience with enterprise storage solutions (such as Pure, NetApp, Portworx) is desirable.
  • Experience with software defined storage systems such as Ceph is an added advantage.
  • Experience with software development skills using Python/Go.
  • Experience with Containers (Kubernetes, Docker, etc.).
  • Experience in Architecting infra solutions for applications.
  • Experience with monitoring, reporting tools and data analytics.
  • Experience with managing cloud budgets and tools for analysis.
  • Good understanding of clustered/distributed systems.
  • Experience working with cloud deployments (scaling, resiliency, load balancing etc.) and solid understanding of Service Monitoring, KPI, SLA, Disaster Recovery.
  • Deep experience with the Linux ecosystem, automation of common tasks, and configuration of systems monitoring tools.
  • Experience with capacity/performance management, monitoring and tuning.
  • Experience with Network Storage, Replication and Backups (SAN, iSCSI, NFS, etc.) is a plus.
  • Strong interpersonal skills to coordinate with other organizations across the business while managing customer expectations.
  • Bachelor's or master's degree in CS or similar field of study OR work equivalent.
  • Work equivalent of 8+ years of experience in cloud & storage, and more than 15 years of work experience in software engineering industry.

Nice-to-haves

  • Master's degree in computer science, computer engineering, computer information systems, software engineering, or related area and 3 years' experience in software engineering or related area.
  • Background in creating inclusive digital experiences, demonstrating knowledge in implementing Web Content Accessibility Guidelines (WCAG) 2.2 AA standards, assistive technologies, and integrating digital accessibility seamlessly.

Benefits

  • 401(k) match
  • Stock purchase plan
  • Paid maternity and parental leave
  • PTO
  • Multiple health plans
  • Incentive awards for performance
  • Live Better U education benefit program covering tuition, books, and fees.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service