Data Center and Network Engineer

$168,000 - $194,400/Yr

Sage City Syndicate, Incorporated - San Jose, CA

posted about 2 months ago

Full-time - Mid Level
San Jose, CA

About the position

As a Data Center and Network Engineer, you will play a crucial role in maintaining and enhancing our public and private cloud-based infrastructure and SaaS applications. This position is integral to ensuring that our systems are highly available and scalable, which is essential for the success of our business operations. You will be part of a team that possesses a wide range of expertise in Systems Engineering, Cloud Infrastructure Management, networking, and monitoring. Your primary responsibility will be to develop, extend, and maintain our mission-critical infrastructure while ensuring its reliability and performance. This role requires collaboration with cross-functional teams to manage our existing and upcoming technology stack effectively. In your day-to-day activities, you will engage in Linux server administration, both physical and virtual, alongside storage administration, network configuration, and application support. You will be responsible for health and performance monitoring, ensuring quick turnaround times, and maintaining performance levels, availability, and security. Your tasks will include assisting with hardware rack and stack installations, monitoring rack-level space and power utilization, and capturing data in a centralized location. You will also be tasked with monitoring infrastructure and configuring automated alerts for any arising issues, managing hot spares for production hosts and services, troubleshooting network issues, and performing intermediate-level remote troubleshooting of hardware, OS, and network problems. Additionally, you will conduct daily inspections and maintain comprehensive documentation and diagrams related to the infrastructure.

Responsibilities

  • Day-to-day Linux server administration (physical, virtual), storage administration, network configuration, and applications support, health and performance monitoring.
  • Ensuring quick turnaround times, as well as performance levels, availability, and security.
  • Assist with hardware rack, stack, and OS installations to maintain our existing data center infrastructure.
  • Keep track of rack-level space and power utilization and capture data in a centralized location.
  • Properly monitor infrastructure and configure automated alerting when issues arise.
  • Manage hot spares for all production hosts and services.
  • Troubleshoot and support network issues.
  • Perform intermediate-level remote hand's role in troubleshooting hardware, OS, and network issues.
  • Perform daily visible inspections and proactively inform the relevant asset owners.
  • Create and maintain detailed and comprehensive documentation and diagrams.

Requirements

  • 3+ years of experience working in a complex mission-critical data center environment with knowledge of data center, network, and compute deployments at scale.
  • Familiarity with Linux administration and troubleshooting.
  • Working knowledge of system imaging kickstart, DHCP, VLAN, and networking.
  • Experience in creating and maintaining setup diagrams, system hand-over, inventory, spare management documentation.
  • Experience managing hardware installation, troubleshooting, parts replacements, and vendor support.
  • Available for off-hour and weekend maintenance activities and can participate in 24x7 on-call rotation.
  • Experience configuring Cisco or Juniper switches manually or via automated means (e.g., Ansible).

Nice-to-haves

  • Working programming and scripting skills with Python/bash.
  • Experience with logging and monitoring tools and best practices.
  • Working knowledge of network protocols like BGP, OSPF, and other routing protocols.
  • Working knowledge of firewalls (Fortinet, Cisco ASA) and switches (Juniper).
  • Intermediate-level knowledge of networking concepts (TCP/IP, VLAN, DNS, IPSec).
  • Intermediate-level system admin skills in managing Linux systems.
  • Knowledge of public cloud technologies like AWS/Azure.
  • Candidate proximity to the downtown San Jose office and data center in the South Bay.

Benefits

  • 21 days paid time off to start.
  • 5 days of paid time off to volunteer and give back to the community.
  • Company Bonus program.
  • Comprehensive medical, dental, and vision coverage.
  • $5,250 tuition reimbursement per calendar year.
  • 401K contribution match (100% up to 4%).
  • $360 FitBucks per calendar year.
  • Colleague Stock Purchase Plan.
  • Peer recognition and rewards program.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service