Principal Engineer - Sr. Site Reliability Engineer

$144,400 - $300,000/Yr

Wells Fargo - New York, NY

posted 2 months ago

Full-time - Senior

New York, NY

Credit Intermediation and Related Activities

About the position

The Principal Engineer - Sr. Site Reliability Engineer role at Wells Fargo involves solving complex problems through innovation and advancing Site Reliability Engineering (SRE) practices across multiple applications and business lines. The position focuses on technology transformation, automation, and ensuring high availability of services while collaborating with various teams to enhance operational efficiency and reliability.

Responsibilities

Instantiate Site Reliability Engineering and AIOPs capabilities at Wells Fargo Enterprise Functions Technology (EFT).
Assist in training peer engineers and grow the SRE practice within EFT.
Introduce and mature the adoption of enterprise capabilities, tools, and innovation to improve availability in a multi-cloud ecosystem.
Evolve AIOPS by introducing self-healing and autonomic capabilities to solve complex operational issues.
Automate key SRE metrics and IT Service Operations processes to enhance customer impact and availability.
Share support responsibilities for critical applications and lead technical resolution of high priority incidents.
Conduct blameless post mortems and root cause analysis to introduce continuous improvement.
Collaborate with EFT application development teams to drive stability and SRE aligned capabilities.
Act as an advisor to leadership on complex business and technical needs across multiple groups.
Lead the strategy and resolution of highly complex challenges requiring in-depth evaluation.

Requirements

10+ years of Engineering experience or equivalent through work experience, training, military experience, or education.
7+ years of experience in Java, C#, Python, or other object-oriented software engineering.
5+ years of experience performing engineering and support tasks on Linux/Unix and Windows Servers.
3+ years of experience with Cloud technologies.
3+ years of experience supporting enterprise-level complex applications and platforms in Production.
5+ years of designing and building complex observability solutions.
5+ years working with configuration and monitoring technologies such as Ansible, Grafana, Elastic, Splunk, Prometheus.
Strong verbal, written, and interpersonal communication skills.

Nice-to-haves

A Master's degree or higher in computer science or engineering.
Experience with design, implementation, and governance of AI, Natural Language Processing, or Machine Learning Architecture.
Experience with Agile Scrum and Kanban methodologies.

Benefits

Health benefits
401(k) Plan
Paid time off
Disability benefits
Life insurance, critical illness insurance, and accident insurance
Parental leave
Critical caregiving leave
Discounts and savings
Commuter benefits
Tuition reimbursement
Scholarships for dependent children
Adoption reimbursement

Principal Engineer - Sr. Site Reliability Engineer

About the position

Responsibilities

Requirements

Nice-to-haves

Benefits

Tools

Career Hubs

Guides

Company