This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Senior Site Reliability Engineer

Fidelity Investmentsposted about 2 months ago

Mid Level

Durham, NC

About the position

Monitors and analyzes performance metrics and application logs by leveraging application server technologies -- Tomcat, Node, or Apache. Works with the latest performance testing tools -- LoadRunner, CloudTest, Datadog, Grafana and JMeter. Supports testing efforts across multiple business units supported by Enterprise Infrastructure (EI) to deliver services at high scale, high availability with resilience by using automation and Infrastructure Code. Builds ecosystem reliability by applying best practices in Resiliency Engineering, Automation, Observability, and Chaos Testing. Defines and executes a comprehensive reliability and observability strategy, ensuring systems are always available when customers need them across the enterprise. Ensures platforms support and can scale to meet the needs of multiple business units. Coordinates systems using infrastructure code tools (IAM, ARM, Terraform, and Chef). Builds, operates, monitors, logs, and alerts services of distributed systems at scale. Implements advanced observability practices and techniques at scale. Configures dashboards using Datadog, Splunk, Grafana and Prometheus to identify system resource utilization and for all BPM metrics.

Responsibilities

Computes and submits performance Test Reports and Execution Summary using dashboards.
Recommends designs for new systems based on requirements gathered during the requirements analysis phase.
Documents objectives, use cases, requirements, and specifications.
Diagrams business processes and system workflows.
Documents specifications describing solutions to meet requirements.
Establishes project plans for projects of moderate scope.
Supports complex assignments and multi-phase projects.
Performs independent and complex technical and functional analysis for multiple projects.
Troubleshoots stack-wide engineering issues related to hardware, software, network, applications, and cloud service providers.
Configures alerts in PROD regions.
Identifies, removes bottlenecks, and avoids memory leaks in the JVM using monitoring tools.
Monitors and analyzes performance metrics and application logs.
Triages defects with development partners and project management teams.
Works with application architects to identify performance bottlenecks and make tuning recommendations.
Prepares and effectively communicates performance results to Director-level management.
Ensures timely escalation of critical issues to the development, project and performance engineering teams.
Coordinates activities of offshore engineers as/when required.
Coordinates and interprets large datasets using query languages and visualization tools.

Requirements

Bachelor’s degree (or foreign education equivalent) in Computer Science, Engineering, Information Technology, Information Systems, Mathematics, Physics, or a closely related field and three (3) years of experience as a Senior Site Reliability Engineer (or closely related occupation) designing and developing container and Cloud-based platform products and infrastructure solutions within a financial services environment.
Or, alternatively, Master’s degree (or foreign education equivalent) in Computer Science, Engineering, Information Technology, Information Systems, Mathematics, Physics, or a closely related field and one (1) year of experience as a Senior Site Reliability Engineer (or closely related occupation) designing and developing container and Cloud-based platform products and infrastructure solutions within a financial services environment.

Nice-to-haves

Demonstrated Expertise in performance testing Online Transaction Processing Applications and webservices within Java or .NET environments using HP LoadRunner.
DE scripting Web based multi-tier applications using Web HTTP, Web HTML, Webservices, Java, RDP, or Truclient Protocols in HP LoadRunner.
DE designing and developing automated financial applications to classify and extract data from documents in a Windows or Unix environments, using Object Oriented Programming, Spring MVC Framework, Clojure, or Drools programming languages and client-side technologies (Angular.js, Node.js, Bootstrap, or Express.js).
DE developing distributed, rich, low-latency internet applications within the financial services industry, using Angular, JavaScript, Web security technologies (OAuth and SAML), Web services, or Agile methodologies; and performing unit testing of Web applications, using Junit or Karma open-source frameworks.

Job Keywords

Create a Teal account and upgrade to Teal+ to unlock all 58 keywords

Create an Account

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder

Senior Site Reliability Engineer

About the position

Responsibilities

Requirements

Nice-to-haves

Job Keywords

Tools

Career Hubs

Guides

Company