Datacenter Operations Engineer

$164,000 - $327,750/Yr

Nvidia - Santa Clara, CA

posted about 2 months ago

Full-time
Santa Clara, CA
Computer and Electronic Product Manufacturing

About the position

NVIDIA is seeking a motivated Datacenter Operations Engineer to join our IPP (Infrastructure, Planning, and Process) Datacenters team. In this pivotal role, you will be responsible for leading datacenter and lab buildouts, overseeing DC Engineering, and managing related planning activities. Your primary focus will be to develop a comprehensive schedule and execution plan, ensuring that all tasks are driven to completion through various Project Implementation Coordinators (PICs). This position also involves optimizing efficiency within our datacenter environment by influencing bin packing strategies based on collated data, while ensuring that all deployments occur within defined Service Level Agreements (SLAs). Your responsibilities will include handling the buildouts and retrofits of datacenters, which encompasses reviewing requirement gathering processes, evaluating site selection criteria, conducting financial analyses, and securing internal approvals. You will provide recommendations for datacenter and lab layouts, perform network Bill of Materials (BOM) analyses, and develop construction specifications. Additionally, you will maintain our internal Datacenter Infrastructure Management (DCIM) tool to reflect new installations and changes accurately. You will also be tasked with addressing customer concerns related to datacenter floor issues, including racks, site conditions, power, cooling, and any floor accidents. Establishing standards, best practices, and Standard Operating Procedures (SOPs) for datacenters will be a key part of your role. Furthermore, you will assist in the rollout of new datacenter management products and drive audits, such as Energy Audits (ISO50001), while supporting Environmental Health and Safety (EHS) initiatives in SOPs. Collaboration with product and hardware teams will be essential for aligning new systems product specifications and roadmap planning.

Responsibilities

  • Handle the datacenter buildouts/retrofits including reviewing the requirement gathering, review site selection criteria, financial analysis, internal approvals.
  • Provide recommendations in DC/Lab layouts, network BOM analysis, construction specifications.
  • Maintain the internal DCIM tool with the new installs/changes.
  • Address any customer concern for DC floor related issues such as racks, site, power, cooling, floor accidents.
  • Come up with the standards, best practices, SOPs for DCs.
  • Assist in the new DC management product rollout projects.
  • Drive the DC audits such as Energy Audit (ISO50001) and support EHS initiatives in SOPs.
  • Work closely with Product and hardware team for the new systems product specifications & roadmap planning.

Requirements

  • Bachelors Degree in a Tech related Major or equivalent experience.
  • 5+ years of experience leading DC buildouts and operations.
  • 3+ years of hands-on experience in Systems administration.
  • Experience in capacity planning, developing roadmaps.
  • Good interpersonal and presentation skills.

Nice-to-haves

  • Visio and CAD experience for Lab R&D projects and Rack Management.
  • Lab/Datacenter Procurement Experience.
  • Experience with handling PDUs and Power in Labs.
  • Working knowledge of basic SQL queries (for DCIM reporting).

Benefits

  • Equity and benefits eligibility.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service