Senior Site Reliability Engineer

Fidelity InvestmentsDurham, NC
298d

This job is no longer available

There are still lots of open positions. Let's find the one that's right for you.

About The Position

Monitors and analyzes performance metrics and application logs by leveraging application server technologies -- Tomcat, Node, or Apache. Works with the latest performance testing tools -- LoadRunner, CloudTest, Datadog, Grafana and JMeter. Supports testing efforts across multiple business units supported by Enterprise Infrastructure (EI) to deliver services at high scale, high availability with resilience by using automation and Infrastructure Code. Builds ecosystem reliability by applying best practices in Resiliency Engineering, Automation, Observability, and Chaos Testing. Defines and executes a comprehensive reliability and observability strategy, ensuring systems are always available when customers need them across the enterprise. Ensures platforms support and can scale to meet the needs of multiple business units. Coordinates systems using infrastructure code tools (IAM, ARM, Terraform, and Chef). Builds, operates, monitors, logs, and alerts services of distributed systems at scale. Implements advanced observability practices and techniques at scale. Configures dashboards using Datadog, Splunk, Grafana and Prometheus to identify system resource utilization and for all BPM metrics.

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service