Wells Fargo - New York, NY

posted 2 months ago

Full-time - Senior
New York, NY
Credit Intermediation and Related Activities

About the position

The Principal Engineer - Sr. Site Reliability Engineer role at Wells Fargo involves solving complex problems through innovation and advancing Site Reliability Engineering (SRE) practices across multiple applications and business lines. The position focuses on technology transformation, automation, and ensuring high availability of services while collaborating with various teams to enhance operational efficiency and reliability.

Responsibilities

  • Instantiate Site Reliability Engineering and AIOPs capabilities at Wells Fargo Enterprise Functions Technology (EFT).
  • Assist in training peer engineers and grow the SRE practice within EFT.
  • Introduce and mature the adoption of enterprise capabilities, tools, and innovation to improve availability in a multi-cloud ecosystem.
  • Evolve AIOPS by introducing self-healing and autonomic capabilities to solve complex operational issues.
  • Automate key SRE metrics and IT Service Operations processes to enhance customer impact and availability.
  • Share support responsibilities for critical applications and lead technical resolution of high priority incidents.
  • Conduct blameless post mortems and root cause analysis to introduce continuous improvement.
  • Collaborate with EFT application development teams to drive stability and SRE aligned capabilities.
  • Act as an advisor to leadership on complex business and technical needs across multiple groups.
  • Lead the strategy and resolution of highly complex challenges requiring in-depth evaluation.

Requirements

  • 10+ years of Engineering experience or equivalent through work experience, training, military experience, or education.
  • 7+ years of experience in Java, C#, Python, or other object-oriented software engineering.
  • 5+ years of experience performing engineering and support tasks on Linux/Unix and Windows Servers.
  • 3+ years of experience with Cloud technologies.
  • 3+ years of experience supporting enterprise-level complex applications and platforms in Production.
  • 5+ years of designing and building complex observability solutions.
  • 5+ years working with configuration and monitoring technologies such as Ansible, Grafana, Elastic, Splunk, Prometheus.
  • Strong verbal, written, and interpersonal communication skills.

Nice-to-haves

  • A Master's degree or higher in computer science or engineering.
  • Experience with design, implementation, and governance of AI, Natural Language Processing, or Machine Learning Architecture.
  • Experience with Agile Scrum and Kanban methodologies.

Benefits

  • Health benefits
  • 401(k) Plan
  • Paid time off
  • Disability benefits
  • Life insurance, critical illness insurance, and accident insurance
  • Parental leave
  • Critical caregiving leave
  • Discounts and savings
  • Commuter benefits
  • Tuition reimbursement
  • Scholarships for dependent children
  • Adoption reimbursement
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service