Datamaxis LTD - Chicago, IL

posted 20 days ago

Full-time - Mid Level
Chicago, IL

About the position

The Sr. Site Reliability Engineer will be responsible for managing and monitoring systems and infrastructure both on-premises and in the cloud. This role requires a strong understanding of application layers, system design, and various DevOps tools, with a focus on ensuring the reliability and performance of services. The position involves working in a hybrid environment, requiring on-site presence three days a week, and providing 24x7 on-call support as needed.

Responsibilities

  • Manage and monitor systems and infrastructure hosted on-premises and in the cloud.
  • Understand different layers of application and system design, including networking concepts and microservice architectures.
  • Install, configure, test, and maintain complex technical systems and architectures.
  • Administer Kubernetes platform, deployments, and services.
  • Utilize common DevOps tools such as Docker, GitHub, Jenkins, Terraform, SonarQube, and JFrog.
  • Proficiently use at least one APM tool like Datadog, Dynatrace, Splunk Signal Fx, AppDynamics, or Azure Monitor.
  • Write simple to moderately complex scripts and programs for automation, tools, frameworks, dashboards, and alarms, preferably in Bash, Python, Groovy, or PowerShell.
  • Provide 24x7 on-call, 2nd and 3rd level support to troubleshoot day-to-day issues.

Requirements

  • BS/MS in Computer Science, Information Technology, or related disciplines.
  • At least 6 years of experience in software engineering environments with 7 years of cloud and microservices experience.
  • Experience in administering Kubernetes resources and associated AKS services.
  • Understanding of Azure subscriptions and cost models.
  • Good understanding of DevOps and SRE principles and concepts.
  • Strong verbal and written communication skills.
  • Demonstrate self-learning capabilities and initiative in a fast-paced environment.
  • Ability to work effectively and professionally with cross-functional groups and multiple time zones.

Nice-to-haves

  • Preferred certification areas: Azure Cloud Fundamentals, any industry-recognized Site Reliability Engineering or DevOps Certifications.

Benefits

  • Salary range of $125,000 to $140,000 with a 10% yearly bonus.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service