Yoh Services - Fort Worth, TX

posted about 2 months ago

Full-time
Fort Worth, TX
Administrative and Support Services

About the position

The Data Center Operations Engineer plays a critical role in managing the lifecycle of data center operations, ensuring optimal performance and scalability. This position involves engaging in all aspects of the data center lifecycle, including design, build, secure, operate, improve, and maintain processes. The engineer will coordinate data center builds, expansions, and modifications, working closely with internal teams and external partners to ensure seamless integration and adherence to project timelines and specifications. In addition to project coordination, the engineer will lead incident management efforts, conducting root-cause analysis and postmortem reviews to identify underlying issues and drive long-term operational improvements. The role also emphasizes process optimization, where the engineer will analyze process gaps and implement automation solutions to enhance efficiency and reduce operational toil. The position requires participation in a 24/7 on-call rotation, providing critical support during off-hours and responding to emergencies to maintain continuous data center operations. The engineer will oversee inventory and capacity management, ensuring the availability of spare parts and managing data center capacity planning, including space, power, and cooling. Compliance with company policies and industry standards is paramount, as is the ability to investigate and resolve technical issues while analyzing data for trends and systemic problems. Contributing to the global data center knowledge base and leading teams in deploying new infrastructure to support organizational growth are also key responsibilities of this role. The Data Center Operations Engineer must possess a strong technical background, customer focus, and the ability to manage complex projects effectively.

Responsibilities

  • Actively engage in all aspects of the data center lifecycle, including design, build, secure, operate, improve, and maintain processes, ensuring optimal performance and scalability.
  • Coordinate data center builds, expansions, and modifications with internal teams and external partners, ensuring seamless integration and adherence to project timelines and specifications.
  • Lead root-cause analysis and postmortem reviews to identify underlying issues and drive long-term operational improvements, reducing the likelihood of recurrence.
  • Analyze process gaps and implement automation solutions to accelerate execution and minimize manual intervention, thereby improving efficiency and reducing operational toil.
  • Participate in a 24/7 on-call rotation, providing critical support during off-hours and responding to emergencies to maintain continuous data center operations.
  • Oversee the tracking of spare parts inventory, ensuring availability and readiness for all hardware components. Manage data center capacity planning, including space, power, and cooling, to optimize resource utilization.
  • Ensure strict adherence to company policies and procedures, maintaining compliance with industry standards and regulatory requirements.
  • Investigate and resolve technical issues, while analyzing data to identify trends and systemic problems, providing actionable insights for ongoing improvements.
  • Contribute to the development and expansion of the global data center knowledge base, and lead teams in deploying new data center infrastructure to support organizational growth.

Requirements

  • 6+ years of experience in operating technical production environments, with extensive hands-on knowledge of data centers and their critical systems.
  • Unwavering commitment to customer success, ensuring that all operational activities align with client needs and expectations of 100% Site-Up is our goal.
  • Expertise in managing tasks and priorities through a ticketing system, consistently meeting or exceeding SLA targets (Jira preferred).
  • Proven experience in managing complex projects, from conception through completion, ensuring alignment with business objectives and timelines.
  • Skilled in hardware troubleshooting, component replacement, power distribution units, CDU s, racking, stacking, and cabling. Solid understanding of storage devices, Linux, and networking concepts.
  • Ability to lift up to 75 lbs, with a strong understanding of electrical, mechanical, and HVAC systems, essential for maintaining data center infrastructure.
  • Willingness to travel both domestically and internationally as needed, with experience in managing physical site locations to ensure operational readiness.
  • Capable of monitoring repair costs, technician efficiency, and operational metrics, providing data-driven recommendations for continuous improvement.
  • Assist with co-location capacity planning and growth initiatives, including the design and optimization of rack and server layouts.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service