Sr HA-Systems Engineer

$103,050 - $179,700/Yr

Datasite - Ontario, CA

posted 18 days ago

Full-time - Mid Level
Remote - Ontario, CA
Professional, Scientific, and Technical Services

About the position

The High Availability Systems Engineer will be responsible for designing, implementing, and optimizing systems to ensure high availability, reliability, and scalability across Datasite's platforms. This role is crucial for maintaining the performance of mission-critical applications and will involve collaboration with cross-functional teams to develop robust cloud-based solutions.

Responsibilities

  • Architect and build highly available, fault-tolerant systems to support mission-critical applications.
  • Collaborate with cross-functional teams to design scalable, robust, and secure cloud-based solutions.
  • Develop strategies for disaster recovery, data replication, and failover processes.
  • Analyze system performance, identify bottlenecks, and implement optimizations to ensure optimal uptime and performance.
  • Conduct load testing, capacity planning, and performance tuning to meet high availability requirements.
  • Utilize monitoring tools to proactively detect issues and minimize downtime.
  • Develop and maintain infrastructure as code (IaC) using tools like Terraform and Ansible.
  • Implement automation for deployments, scaling, and configuration management to reduce manual intervention and increase system reliability.
  • Lead incident response and root cause analysis for system outages, ensuring quick resolution and prevention of future incidents.
  • Build and maintain robust monitoring, alerting, and diagnostic systems for proactive issue identification.
  • Provide technical leadership, mentorship, and guidance to junior engineers and other team members.
  • Stay updated on the latest trends in high availability and distributed systems, and share knowledge within the team.

Requirements

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field (or equivalent experience).
  • 8+ years of experience in systems engineering, infrastructure architecture, or related fields.
  • Proven track record of designing and implementing highly available, fault-tolerant systems in cloud or on-prem environments.
  • Experience with distributed systems, microservices architecture, and high availability patterns (e.g., active-active, active-passive).
  • Proficient in cloud platforms (Azure, GCP, AWS) or on-prem data centers and cloud-native technologies.
  • Deep knowledge and understanding of Linux systems.
  • Experience using monitoring and observability tools (Prometheus, Grafana, Loki, etc.).
  • Strong coding/scripting skills in Python, Go, or Shell for automation.
  • Excellent problem-solving skills with a focus on resilience and scalability.
  • Strong communication skills with the ability to convey complex technical concepts to diverse stakeholders.
  • Ability to work independently and take ownership of projects from inception to deployment.

Nice-to-haves

  • Strong experience with containers and orchestration (Docker, Kubernetes).
  • Familiarity with CI/CD pipelines and DevOps practices.
  • Advanced knowledge of networking, load balancers, and distributed data storage solutions (e.g., Cassandra, Elasticsearch, Kafka).
  • Experience with multi-region deployments and global scaling strategies.
  • Certification in cloud platforms (e.g., AWS Certified Solutions Architect, Google Professional Cloud Architect).
  • Background in security best practices, including compliance frameworks (e.g., SOC 2, ISO 27001).
  • Experience in agile methodologies and DevOps culture.

Benefits

  • Competitive salary and performance-based bonuses.
  • Comprehensive benefits package (health, dental, vision, 401k match).
  • Opportunities for professional growth and career advancement.
  • Flexible work environment, including remote options.
  • A dynamic, collaborative team environment where your ideas matter.
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service