Dentsply Sirona - Waltham, MA

posted 20 days ago

Full-time - Mid Level
Hybrid - Waltham, MA
Miscellaneous Manufacturing

About the position

Dentsply Sirona, the world's largest manufacturer of professional dental products and technologies, is seeking a Sr. Site Reliability Engineer to join their Waltham, MA location. This role is crucial in ensuring system reliability and performance as part of a global team dedicated to providing 24/7 emergency support for products. The engineer will be responsible for restoring services as quickly as possible when downtime occurs, and will work in a hybrid environment that combines remote work with in-office responsibilities. The position is situated within the Software Engineering & User Experience (SWUX) organization, reporting to the Team Lead of Site Reliability Engineering. The ideal candidate will be an experienced technologist capable of optimizing system performance and driving continuous improvement initiatives. The role involves gathering and analyzing metrics from operating systems and applications to assist in performance tuning and fault finding, as well as partnering with development and operations teams to enhance services through rigorous testing and release procedures. In addition to system monitoring and management, the engineer will be tasked with improving existing systems through automation, participating in system design consulting, and balancing feature development speed with reliability. The successful candidate will also be expected to measure and optimize system performance, ensuring that the company stays ahead of customer needs and innovates for continual improvement.

Responsibilities

  • Gather and analyze metrics from operating systems and applications for performance tuning and fault finding.
  • Partner with development and operations teams to improve services through rigorous testing and release procedures.
  • Perform root cause analyses and implement solutions to enhance system reliability.
  • Collaborate with architecture teams to optimize system performance.
  • Improve existing systems through automation and uplifts.
  • Participate in system design consulting and platform management.
  • Monitor availability and take a holistic view of system health in the production environment.
  • Build software and systems to manage platform infrastructure and applications.
  • Measure and optimize system performance, pushing capabilities forward and innovating for continual improvement.
  • Act as 24/7 emergency 2nd/3rd level support for products, restoring services ASAP during downtime.

Requirements

  • Bachelor's or Master's degree in Computer Science or Software Engineering or relevant experience.
  • At least 5 years' experience in a Site Reliability Engineering / Platform Engineering / DevOps role or similar.
  • Excellent troubleshooting skills with proven experience resolving production downtime.
  • Deep understanding of algorithms, data structures, complexity analysis, and software design.
  • Good analytical skills and excellent communication skills; professional English is required, German is a bonus.
  • At least Google Associate Cloud Engineer certification; higher certifications are a bonus.

Nice-to-haves

  • Experience with Kubernetes and GCP cloud as both an admin and user.
  • Previous software development experience in Golang, C++, or any modern programming language; Flutter experience is a bonus.
  • Familiarity with monitoring tools (e.g., Datadog) and project tracking software (e.g., Jira).

Benefits

  • Professional development opportunities
  • High-performance culture
  • Innovative work environment
  • Ability to shape the dental industry
  • Work-life balance with a hybrid work model
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service