Insight Global - Herndon, VA

posted 2 months ago

Full-time
Remote - Herndon, VA
Administrative and Support Services

About the position

As a Remote Cloud Operations Analyst at Insight Global, you will play a crucial role in managing and overseeing performance and security monitoring tools. Your primary responsibilities will include responding to alerts, triggers, and other warning conditions to ensure the smooth operation of cloud services. You will closely coordinate with the Engineering team to generate root cause analyses (RCAs), update tickets, and resolve problems and incidents within established performance Service Level Agreements (SLAs). Following established documented methods, practices, and standard operating procedures (SOPs) will be essential to deliver effective, efficient, and professional operations support. In this position, you will participate in shift-transition calls to ensure that all open tickets and tasks are properly managed and addressed. You will also be responsible for creating and updating standard operating procedures (SOPs) for Operations and Maintenance (O&M) support. Maintaining the confidentiality, integrity, and availability of data across physical and logical solution boundaries in multi-Agency environments will be a key aspect of your role. Additionally, you will coordinate with government engineering resources and Original Equipment Manufacturers (OEMs) to patch, upgrade, or refresh tool and sensor software and hardware, ensuring that all systems are up to date and functioning optimally.

Responsibilities

  • Manage and oversee the performance and security monitoring tools, responding to alerts, triggers, and other warning conditions.
  • Closely coordinate with Engineering to generate root cause analyses (RCAs), update tickets, and resolve problems and incidents within established performance SLAs.
  • Follow established documented methods, practices, and standard operating procedures (SOPs) to deliver effective, efficient, and professional operations support.
  • Participate on shift-transition calls to ensure all open tickets and tasks are properly managed and addressed.
  • Create and update standard operating procedures (SOPs) for Operations and Maintenance (O&M) support.
  • Maintain the confidentiality, integrity, and availability of data across physical and logical solution boundaries in multi-Agency environments.
  • Coordinate with government engineering resources and OEMs to patch, upgrade or refresh tool and sensor software and hardware.

Requirements

  • 4+ years of experience in a technical discipline, preferably with a Bachelor's Degree in computer science, data science, engineering, applied mathematics, or a closely related field, or equivalent on-the-job experience.
  • Familiarity with and exposure to Elasticsearch and Kibana or other similar data aggregation and analytics platforms.
  • Familiarity with automated monitoring tools such as Dynatrace, Azure Sentinel, Zabbix, Nagios, Datadog, etc.
  • Familiarity with the Elasticsearch and preferably Elastic Cloud Enterprise (ECE) and Elastic Cloud on Kubernetes (ECK) platforms.
  • Strong analytical skills with the ability to collect, organize, analyze, and disseminate significant amounts of information with attention to detail and accuracy.
  • Understanding of containerized PaaS platforms such as Azure Kubernetes Service or Elastic Kubernetes Service as well as IaaS hosted platforms such as Docker and Podman.
  • Cloud platform certifications (AWS Practitioner / Sysops admin, Azure Fundamentals / Admin).
  • Security certification such as Security-Solid customer-facing communication skills, both verbal and written.
  • Ability to manage multiple tasks and work with cross-functional teams.
  • Excellent time management and organizational skills with the ability to prioritize workload.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service