Wells Fargo - New York, NY

posted 3 months ago

Full-time - Senior
New York, NY
Credit Intermediation and Related Activities

About the position

The Sr. Site Reliability Engineer (SRE) role at Wells Fargo involves solving complex problems through innovation and technology transformation across multiple applications and business lines. The position focuses on advancing SRE practices, ensuring application availability, and integrating automation and observability into the operational processes. The engineer will collaborate with various teams to enhance system reliability, drive continuous improvement, and lead the adoption of enterprise capabilities in a multi-cloud environment.

Responsibilities

  • Instantiate Site Reliability Engineering and AIOPs capabilities at Wells Fargo Enterprise Functions Technology (EFT).
  • Assist in training peer engineers and grow the SRE practice within EFT.
  • Introduce and mature the adoption of enterprise capabilities, tools, and innovation to improve availability.
  • Evolve AIOPS by introducing self-healing and autonomic capabilities to solve operational issues.
  • Automate key SRE metrics and IT Service Operations processes.
  • Share support responsibilities for critical applications and lead technical resolution of high priority incidents.
  • Conduct blameless post mortems and root cause analysis to introduce continuous improvement.
  • Collaborate with EFT application development teams to drive stability and SRE aligned capability.
  • Act as an advisor to leadership on complex business and technical needs.
  • Lead the strategy and resolution of highly complex challenges across the enterprise.

Requirements

  • 10+ years of Engineering experience or equivalent through work experience, training, military experience, or education.
  • 7+ years of experience in Java, C#, Python, or other object-oriented software engineering.
  • 5+ years of experience performing engineering and support tasks on Linux/Unix and Windows Servers.
  • 3+ years of experience with Cloud technologies.
  • 3+ years of experience supporting enterprise-level complex applications and platforms in Production.
  • 5+ years of designing and building complex observability solutions.
  • 5+ years working with configuration and monitoring technologies such as Ansible, Grafana, Elastic, Splunk, Prometheus.

Nice-to-haves

  • A Master's degree or higher in computer science or engineering.
  • Experience with design, implementation, and governance of AI, Natural Language Processing, or Machine Learning Architecture.
  • Experience with Agile Scrum and Kanban methodologies.

Benefits

  • Health benefits
  • 401(k) Plan
  • Paid time off
  • Disability benefits
  • Life insurance, critical illness insurance, and accident insurance
  • Parental leave
  • Critical caregiving leave
  • Discounts and savings
  • Commuter benefits
  • Tuition reimbursement
  • Scholarships for dependent children
  • Adoption reimbursement
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service