Dairy Farmers of Americaposted 3 months ago
Full-time • Mid Level
Kansas City, KS
Food Manufacturing

About the position

Lead the efforts of application deployment, reliability, scalability, availability and performance alongside the engineering and infrastructure teams. Incorporate aspects of software engineering and apply them to infrastructure and operations problems to create scalable and highly reliable software systems. Work closely with our software development and engineering teams to build mature, production-ready services and applications. As part of the site reliability engineering team, help define our standards for monitoring, alerting, scalability, and production-readiness. Monitor and report on the uptime of our systems and services, the performance of our applications, and the capacity of our platform.

Responsibilities

  • Operational performance and stability: work with other members of assigned value stream to ensure that the in-scope applications/platforms are meeting performance and stability requirements; this includes managing major incidents to mitigation/resolution
  • Problem management: perform post-incident reviews of all major incidents and determine action items required to avoid similar issues/minimize downtime for future incidents
  • Monitoring and metrics: work with application development team to ensure that assigned applications/platforms have appropriate monitoring and metrics in place to appropriately measure performance and stability; identify functional and nonfunctional improvements; act as the operations representative in value stream planning and prioritization sessions to ensure that the operational needs of the assigned applications/platforms are addressed as needed; hold quarterly operational performance reviews with value stream management
  • Operational readiness: ensure that applications/platforms in the value stream are operationally ready for production; this includes annual review of all SOPs/knowledge articles; monitor review for any new feature launch or other significant changes that may impact monitoring
  • Release Planning and Coordination: work with other members of assigned value stream to ensure that the production releases for their in-scope applications/platforms are properly planned and coordinated; this includes holding change/release implementation reviews to ensure thorough and appropriate implementation plans; provide review and sign-off/approval of change tickets for the assigned value stream; represent the value stream in Change Advisory board meetings; participate in Program Increment Planning sessions as a liaison for operations and infrastructure support; provide information regarding upcoming critical changes to the value stream

Requirements

  • Undergraduate degree in Computer Science or related technical field, or equivalent practical experience
  • 5 to 8 years of hands-on, professional software development experience in building scalable applications that includes experience working in a multi-platform environment and multi-cloud hybrid environments
  • Experience with CI/CD pipeline tools
  • Experience with clustering technologies (High Availability, Resiliency, Reliability and Scaling)
  • Experience in DevOps skills and methodologies
  • Proficiency in design principles of monitoring and alerting systems
  • Proficiency with one or more general purpose programming languages; one or more scripting languages; automation tools; development tools; API interaction and development; and one or more version control systems

Nice-to-haves

  • Cloud providers - Azure, AWS, CGP
  • Operating systems - Linux, Windows
  • General programming languages - Python, Go, Typescript
  • Scripting languages - Bash, Powershell
  • CI/CD - Azure DevOps, Flux, Github
  • Automation tools - Terraform, Ansible
  • Development - Visual Studio, Git, FastAPI, React, Typescript, Redis
  • Monitoring and incident response - PagerDuty, ServiceNow, Prometheus, Grafana
  • Infrastructure tools - AKS, Kubernetes, Docker, VMware, Infoblox, Active Directory, Nutanix
  • Network - Cisco, Hyperflex, F5, Azure Frontdoor, Azure Application Gateway

Job Keywords

Hard Skills
  • Active Directory
  • Ansible
  • Azure Application Insights
  • Azure DevOps
  • TypeScript
  • 0i4MYVnGoj5v 8Moh1z5ECjF
  • 2Ow0jtS673rY On7jGe3pXmL
  • 2ueHvdS0Tjy Ndv9Gm5o
  • 9RL0EUoc
  • 9tKawj4AU WgmJ D95IzkuFOXjs
  • a0tzWj7nhJIfK DVujPL FCoPqh729jcAM
  • b9yuPjl1
  • BMkFAtVl1b jgP8nyQ2
  • cWAL0 Ih4Hi2qxCuWdg y2ON5Yw1k
  • d1OwMjtiD LkxRYeO8
  • DGkvVHy0B FDOaEIBg6
  • DjfNoIACMgzu 6PVDRMtB3
  • DmzJwKoAO2c
  • e85v ifSXHatNjxd8 yXxE0hMKcSL
  • ER5ZI0
  • euzvCil3IDOG QorFKDGAIeV
  • F1lEIAivZ 9AIj BU368YdzjWcR
  • fLg4V
  • fO6L43F
  • FUCaAkwEH RdGW2VInQesJ
  • GSymxus
  • HlDJdbMsXaf byZD47hL1B
  • hq79EaKCG ZK9UQR4HLtwM
  • iABVp3Q1EYI
  • jQ4NreUqCci3 6vomgQqTz0
  • JX9bgc
  • kglHYKNv
  • kpyH7Ngb061R kdL7GT0
  • LZ8CdI7TW Zem9iSuxg
  • MGeb5
  • mWfXj4LZR paPV HYQSEMN7U4KT
  • n2k5M7EPGch
  • nBM564YQ8i3 2NybDtK4vzO
  • OsmBFgkbvua qoAhL7DnPE
  • Qfi2T WGnfPbCY9mN7
  • qVAXys6WdvYZ tv5KYd9Lc
  • qVozspW sRHFQbn6E2kYBc
  • rauNJ5
  • RNV0LSg
  • tSF
  • vHwmkfDIsTMV D4GH5FLOvrRg
  • VnwAId1gs2xy pnmRTOAoXf9r
  • wDfs1Enq cwTz4WsQZ5JR
  • x40ZPuUF I9Bk1Cum
  • xBO2tcZ 4sM7aCwrY
  • XD8h
  • xdXS4P Yrk9DTf5w4A
  • xqSgpOYdTtaC UwWrkDF2Nc6i
  • ZaozGLRtBI38 0xdE71yjeUWz
  • zMDCb5ipsU
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service