OREGON EMPLOYMENT DEPARTMENT - Salem, OR

posted 8 days ago

Part-time - Mid Level
Salem, OR

About the position

The Site Reliability Engineering (SRE) Cloud Architect position is an engineering role focused on designing, developing, and troubleshooting software programs for databases, applications, and networks. This role specifically supports the development of a modernized cloud-native Session Border Controller (SBC) product, utilizing DevOps automation and Agile methodologies. The position requires extensive experience in operational and development roles, particularly in cloud-native environments and CI/CD practices.

Responsibilities

  • Develop and support SRE framework and automation
  • Develop metric collection of failure events and analytics
  • Analyze failure events, identify and dissect failures by infrastructure layers and by service stack and by application components and their inter-relationship
  • Provide recommendation to improve product development
  • Provide support for components going onto Cloud infrastructure
  • Provide support on other Dev Test and System Test infrastructure
  • Provide best practice on frameworks, automation, methodologies
  • Be a team player and encourage cross learning and cross functional support

Requirements

  • Minimum 10 years of hands-on operational, development, DevOps or SRE experience
  • Experience in a technical leadership role with a history of embracing automated processes, cloud native application design principles and a CI/CD DevOps model
  • Experience with production operations and best practices for deploying quality code in production and troubleshooting issues when they arise
  • Experience with operational support of containerized, microservice-based application(s) in a production-level Kubernetes environment
  • Experience deploying, configuring, managing and debugging cloud infrastructure and platform software such as OpenStack, Kubernetes
  • Experience with commercial Kubernetes on-prem products or public cloud managed Kubernetes
  • Experience with cloud-native administration and monitoring technologies such as Docker, Helm, Prometheus, Grafana, EFK/ELK, Jaeger
  • Knowledge of Infrastructure as Code (IaaC), Configuration as Code (CaC), GitOps and tools such as Terraform, Argo CD, Flux
  • Experience designing and implementing CI/CD pipelines, platforms and components such as Jenkins
  • Experience and working knowledge in scripting languages like Python, Perl, and/or Shell Scripting
  • Knowledge of orchestration tools like Ansible and Chef
  • Knowledge of version control using Git
  • Knowledge and understanding of REST Architecture and JSON is a plus
  • Experience with application frameworks such as Spring, Helidon, Micronaut, etc. is a plus
  • Experience developing or designing telecommunications software is a plus
  • Experience working in Agile/Scrum development process is a plus
  • Experience in Linux/Unix environment
  • Strong troubleshooting capabilities targeting complicated problems in remote systems
  • Strong communication skills required
  • Strong writing skills required
  • Ability to multi-task and handle changing priorities
  • Excellent team skills, can-do attitude, focus on quality
  • BS or MS in Computer Science, Computer Engineering, or equivalent

Nice-to-haves

  • Experience developing or designing telecommunications software is a plus
  • Experience working in Agile/Scrum development process is a plus
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service