Cigna - Bloomfield, CT

posted 9 days ago

Full-time
Bloomfield, CT
Insurance Carriers and Related Activities

About the position

The Observability & Alerting Specialist plays a crucial role in ensuring the reliability, availability, and performance of applications within the Medicare Technology Operations domain. This position involves close collaboration with development and operations teams to build and maintain monitoring capabilities and dashboards that align with the company's business objectives. The specialist will also be responsible for troubleshooting issues, implementing automation, and optimizing system performance.

Responsibilities

  • Design and implement observability and alerting solutions across various technology platforms, including real-time and synthetic user monitoring of customer-facing applications, API health, and microservice responsiveness.
  • Collaborate with cross-functional teams to define and establish service level objectives (SLOs) and service level agreements (SLAs) for critical systems.
  • Monitor systems and applications, proactively identifying and resolving performance bottlenecks or availability issues.
  • Develop and maintain monitoring tools, alerts, and dashboards to provide visibility into system health and performance.
  • Create and maintain documentation for system architecture, configuration, and troubleshooting procedures.
  • Assist with capacity planning and resource allocation to ensure optimal system performance and scalability.
  • Collaborate with development teams to implement and deploy new features and enhancements, ensuring they meet reliability and performance standards.
  • Stay up to date with industry best practices, new technologies, and emerging trends in observability engineering.

Requirements

  • Strong knowledge of Linux/Unix systems and command line tools.
  • Familiarity with cloud platforms like AWS or Azure.
  • Understanding of networking principles and protocols (TCP/IP, HTTP, DNS, etc.).
  • Knowledge of containerization technologies (Docker, Kubernetes) and orchestration tools.
  • Experience with monitoring and logging tools such as Dynatrace, Splunk, Prometheus, or Grafana.
  • Strong problem-solving and troubleshooting skills, with the ability to analyze and resolve complex technical issues.
  • Excellent communication skills.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service