HR Pundits - Plano, TX

posted 7 days ago

Full-time - Mid Level
Plano, TX
1-10 employees
Professional, Scientific, and Technical Services

About the position

The Site Reliability Engineer (SRE) role involves ensuring the reliability and performance of systems and applications for a leading IT company. The position requires a strong technical background, particularly in scripting, event-driven engineering, and cloud platforms, with a focus on automation and continuous integration/deployment practices. The SRE will work closely with teams to enhance system reliability and efficiency while fostering a collaborative work environment.

Responsibilities

  • Ensure the reliability and performance of systems and applications.
  • Implement and manage CI/CD pipelines using tools like Gitlab and Jenkins.
  • Utilize automation tools such as Terraform, Ansible, Chef, and Puppet.
  • Manage OS-level containerization and virtualization techniques using Docker, VMware, and Kubernetes.
  • Collaborate with teams to document code and catalogue data transformations.
  • Take ownership of work and deliver results effectively.

Requirements

  • Bachelor's Degree in Computer Science, IT-related field, or equivalent experience.
  • At least 3+ years of scripting experience in Python and JavaScript.
  • 3+ years of event-driven engineering experience, preferably with AIOps using AI/ML platforms/tools.
  • 3+ years of experience with Source Code Management, CI/CD tools, and automation tools.
  • 3+ years of experience building CI/CD pipelines and system testing with Gitlab and Jenkins.
  • 3+ years of experience with containerization techniques using Docker, VMware, and Kubernetes.
  • 3+ years of experience with cloud platforms such as AWS, Azure, and Google Cloud Platform.
  • 5+ years of technical, hands-on experience in AWS Cloud Engineering, 5G ORAN, 5G Core, or Data and Transport Engineering.
  • Excellent communication skills and a team player.

Nice-to-haves

  • 5+ years of experience with platforms like Data Dog, Grafana, ServiceNow, and SolarWinds.
  • Experience with log file analysis using LOKI, Elasticsearch, and Prometheus.
  • Experience with systems tracing using Tempo, Jaeger, and Open tracing.
  • Intermediate understanding of utilizing Rest APIs, Apache Spark, and Kafka.

Benefits

  • Paid leaves
  • Medical insurance
  • Continuous learning opportunities
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service