The Vanguard Group - Philadelphia, PA

posted 2 months ago

Full-time - Mid Level
Remote - Philadelphia, PA
Securities, Commodity Contracts, and Other Financial Investments and Related Activities

About the position

The Machine Learning Engineer, Specialist position at The Vanguard Group, Inc. in Philadelphia, PA, involves writing complex production-level data pipelines and engineering code essential for machine learning models. The role requires the utilization of PySpark and the AWS Cloud Tech Stack, including Glue and SageMaker, alongside various data frameworks. The specialist will engage in solution architecture while building end-to-end machine learning data pipelines, leveraging their expertise in cloud-based architectures and technologies to deliver optimized machine learning models at scale. In this position, the engineer will be responsible for reviewing and optimizing code produced for artificial intelligence and machine learning models, employing a modern technology stack. The role emphasizes driving engineering innovation through effective collaboration with AI research functions and technology teams, as well as fostering independent innovation ideas. Candidates must possess a Master's degree in Computer Science, Data Science, Machine Learning Engineering, Statistics, or a closely related IT field, along with two years of experience in the job offered or in related IT positions such as Data Engineer or Software Engineer. The ideal candidate will have a strong background in PySpark, Python, and the AWS tech stack, particularly in building machine learning pipelines and utilizing various AWS services and ML packages like XGBoost and TensorFlow. Additionally, familiarity with relational database management systems and data integration tools is essential, as is experience in solution architecture for complex machine learning models. The company operates on a hybrid model, requiring three days in the office and allowing work-from-home options for two days.

Responsibilities

  • Write complex production-level data pipelines and engineering code for machine learning models.
  • Utilize PySpark and AWS Cloud Tech Stack (Glue, SageMaker) for data frameworks.
  • Engage in solution architecture while building end-to-end machine learning data pipelines.
  • Deliver optimized machine learning models at scale using cloud-based architectures and technologies.
  • Review and optimize code for AI/ML models using a modern technology stack.
  • Drive engineering innovation through collaboration with AI research functions and technology teams.

Requirements

  • Master's degree in Computer Science, Data Science, Machine Learning Engineering, Statistics, or a closely related IT field.
  • Two years of experience in the job offered or in IT positions such as Data Engineer or Software Engineer.
  • Proficiency in PySpark and Python.
  • Experience with AWS services and ML packages like XGBoost and TensorFlow.
  • Working knowledge of relational database management systems and data integration tools.
  • Experience with solution architecture for complex machine learning models.
  • Experience in building machine learning data pipelines utilizing AWS tech stack (Glue, SageMaker, Athena).
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service