National Radio Astronomy Observatory - Socorro, NM

posted 2 months ago

Full-time - Mid Level
Socorro, NM
Performing Arts, Spectator Sports, and Related Industries

About the position

Join the design and development team creating the software architecture for the Next Generation Very Large Array (ngVLA) at the National Radio Astronomy Observatory (NRAO). The ngVLA project is a transformational radio telescope array with more than 200 radio telescope antennas throughout North America. The telescope presents significant data infrastructure challenges, with real-time data transmission rates in excess of 168 Tbps, sustained data processing of 60 PFLOP/sec, and a data archive that grows by more than 240 PB each year. We are seeking a Data Hardware Infrastructure Engineer to join the ngVLA Computing and Software (CSS) Integrated Product Team (IPT) and to work jointly with the Data Management and Software (DMS) Division within NRAO. The successful applicant will be responsible for designing, and eventually leading a team building and maintaining, the data infrastructure and hardware that supports data collection, storage, processing, and analysis for ngVLA. The data ingestion, transmission, and storage needs of the ngVLA project are significantly larger in scope than any other NRAO projects and require the application of leading-edge technologies. This role is crucial for ensuring that data systems are scalable, reliable, and performant. The successful applicant will be NRAO's subject matter expert on these systems. Initially, the position will be focused on the system design and hardware infrastructure planning. The Data Hardware Infrastructure Engineer will need to design and architect data infrastructure solutions that are scalable, maintainable, secure, and meet the ngVLA requirements over the project lifecycle. In addition, the successful applicant will need to plan the technical infrastructure needed for data storage, processing, and analytics for the ngVLA super-computing data center. The successful applicant will have the opportunity to lead significant design and architectural decisions by interfacing with colleagues at other big data institutions, designing peta/exascale prototyping storage, processing, and software technologies, closely collaborating with other scientists and engineers at NRAO contributing to the design of the ngVLA, and realizing a full implementation of the system during the construction phase. The position will be based in Charlottesville, VA; Albuquerque, NM; or Socorro, NM.

Responsibilities

  • Understand and analyze ngVLA requirements
  • Perform gap analyses
  • Define interfaces and test plans to validate the proposed design
  • Provide cost and budget assessments for different architectural solutions
  • Design hardware infrastructure that supports data flow, caching, processing, distribution, and storage
  • Articulate hierarchical data storage solutions for the ngVLA archive
  • Define the hardware necessary to support 1000+ node parallelization in a proposed on-premises data center, including processors, GPUs, networks, memory
  • Collaborate with existing data centers to explore off-site data processing and storage solutions
  • Propose hardware suggestions for buffering and caching for both the ngVLA correlator and data center
  • Explore cloud platforms for scalable and cost-effective data distribution, processing and storage solutions
  • Work with big data technologies to handle large-scale data distribution and processing
  • Develop and implement data models and database schemas that support ngVLA requirements
  • Collaborate with software architects and system engineers within DMS and ngVLA to ensure design choices align with the next generation data architecture
  • Build collaborations with external institutions with similar needs and goals
  • Work with software engineers, database experts, and system architects to implement the adopted design during the construction phase
  • Liaise with other teams and departments to ensure that team goals and vision align with those of DMS and ngVLA.

Requirements

  • Bachelor's degree in computer science, engineering, or related field
  • At least five years of experience in demonstrated dataflow, system engineering, or related experience
  • Strong knowledge of data transmission, storage, and back-up systems
  • Experience designing, implementing and maintaining infrastructure solutions
  • Experience with data modeling tools
  • Familiar with latest trends in data storage, backup strategies, and network strategies
  • Experience using the Linux CLI
  • Hands-on experience building scalable systems
  • Familiarity developing software in Python or Java
  • Experience debugging and profiling software systems
  • Familiarity with the Atlassian Suite of applications
  • Strong interpersonal and communications skills
  • Experience working with large, existing applications in a distributed team.

Nice-to-haves

  • Strong software development skills in Python or Java
  • Scripting experience with Bash
  • Familiarity with Cloud Services
  • Familiarity with big data technologies
  • Familiarity with storage solutions
  • Experience with time-series visualization tools.

Benefits

  • Excellent paid time off (13 holidays, annual accrual of up to 24 vacation days and 15 sick days, additional time off for doctor/dentist visits, and 8 weeks of paid parental leave)
  • Medical, dental and vision plans effective on the first day of employment
  • Retirement benefit contributes an amount equal to 10 percent of a qualified participant's base pay with no required employee contribution
  • Optional supplemental, tax-deferred plan for employer retirement contributions.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service