Wells Fargo - Irving, TX

posted 7 days ago

Full-time - Senior
Irving, TX
Credit Intermediation and Related Activities

About the position

The Sr. Site Reliability Engineer (SRE) role at Wells Fargo involves solving complex problems through innovative solutions that impact change at scale across a diverse environment. The position focuses on advancing SRE practices across multiple applications and business lines, driving technology transformation, and ensuring high availability and observability of applications. The SRE will collaborate with various teams to automate processes, improve operational insights, and lead the strategy for resolving complex challenges in a multi-cloud ecosystem.

Responsibilities

  • Instantiate Site Reliability Engineering and AIOPs capabilities at Wells Fargo Enterprise Functions Technology (EFT).
  • Assist in training skilled peer engineers and grow the SRE practice within EFT.
  • Introduce and mature the adoption of enterprise capabilities, tools, and innovation to improve availability.
  • Evolve AIOPS by introducing self-healing and autonomic capabilities to solve operational issues.
  • Automate key SRE metrics and IT Service Operations processes.
  • Share support responsibilities for critical applications and lead technical resolution of high priority incidents.
  • Conduct blameless post mortems and root cause analysis to introduce continuous improvement.
  • Collaborate with EFT application development teams to drive stability and SRE aligned capability.
  • Act as an advisor to leadership on complex business and technical needs.
  • Lead the strategy and resolution of highly complex challenges across the enterprise.

Requirements

  • 10+ years of Engineering experience or equivalent.
  • 7+ years of experience in Java, C#, Python or other object-oriented software engineering.
  • 5+ years of experience on Linux/Unix and Windows Servers.
  • 3+ years of experience with Cloud technologies.
  • 3+ years of experience supporting enterprise-level complex applications in Production.
  • 5+ years of experience designing and building observability solutions.
  • 5+ years working with configuration and monitoring technologies such as Ansible, Grafana, Elastic, Splunk, Prometheus.
  • Strong verbal, written, and interpersonal communication skills.

Nice-to-haves

  • A Master's degree or higher in computer science or engineering.
  • Experience with AI, Natural Language Processing, or Machine Learning Architecture.
  • Experience with Agile Scrum and Kanban methodologies.

Benefits

  • Health benefits
  • 401(k) Plan
  • Paid time off
  • Disability benefits
  • Life insurance, critical illness insurance, and accident insurance
  • Parental leave
  • Critical caregiving leave
  • Discounts and savings
  • Commuter benefits
  • Tuition reimbursement
  • Scholarships for dependent children
  • Adoption reimbursement
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service