Senior Engineer, ML Ops/DevSecOps

$144,501 - $217,311/Yr

Intel - Santa Clara, CA

posted 3 months ago

Full-time - Mid Level
Santa Clara, CA
Computer and Electronic Product Manufacturing

About the position

We are actively seeking a Senior Engineer specialized in ML Ops and DevSecOps! This role requires a comprehensive understanding of machine learning operations, DevSecOps practices, and the integration of security within the CI/CD pipeline. Ideal candidates will have a strong software engineering background, exceptional automation skills, and a passion for improving system reliability and security. This position offers the opportunity to significantly impact our projects by integrating cutting-edge ML and security practices into our operation. In the ML Ops domain, you will be responsible for developing scalable and reliable ML infrastructure, automating ML model deployment, monitoring, and management, and implementing best practices for data versioning, model training, and experimentation. You will collaborate with engineers to streamline the ML workflow and enhance model reproducibility and performance. In the DevSecOps area, your responsibilities will include integrating security measures into the CI/CD pipeline, conducting security audits and vulnerability assessments, and utilizing tools for static code analysis, SAST, and Trivy. You will implement and enforce security standards across development teams and develop and maintain dashboards for monitoring pipeline health and security metrics. You will also design and maintain robust CI/CD pipelines, optimize build processes for enhanced efficiency, and implement strategies for Continuous Build and Test. Automating testing, deployment, and monitoring using GitHub Actions will be a key part of your role, along with analyzing and improving pipeline performance metrics. Collaboration and communication are essential, as you will work closely with cross-functional teams to fulfill their requirements, mentor junior engineers, and participate in architecture reviews with a focus on security and performance. You will effectively communicate project status, risks, and metrics. In project management, you will plan and schedule the development and implementation of multiple long-term and complex projects, often requiring consultative direction. Your problem-solving skills will be utilized to resolve significant issues or analyze unique problems requiring evaluation of intangibles and design innovative solutions. You will make consistent decisions regarding the overall goals and objectives of significant programs or segments of the organization, serving as the primary contact on projects and interacting with senior internal and external personnel on complex or widespread issues requiring coordination across organizational lines.

Responsibilities

  • Develop scalable and reliable ML infrastructure.
  • Automate ML model deployment, monitoring, and management.
  • Implement best practices for data versioning, model training, and experimentation.
  • Collaborate with engineers to streamline the ML workflow.
  • Enhance model reproducibility and performance.
  • Integrate security measures into the CI/CD pipeline.
  • Conduct security audits and vulnerability assessments.
  • Utilize tools for static code analysis, SAST, and Trivy.
  • Implement and enforce security standards across development teams.
  • Develop and maintain dashboards for monitoring pipeline health and security metrics.
  • Design and maintain robust CI/CD pipelines.
  • Optimize build processes for enhanced efficiency.
  • Implement strategies for Continuous Build and Test.
  • Automate testing, deployment, and monitoring using GitHub Actions.
  • Analyze and improve pipeline performance metrics.
  • Work closely with cross-functional teams to fulfill their requirements.
  • Mentor junior engineers and contribute to their professional growth.
  • Participate in architecture reviews with a focus on security and performance.
  • Effectively communicate project status, risks, and metrics.
  • Plan and schedule the development and implementation of multiple long-term and complex projects.
  • Utilize extensive expertise to resolve significant issues or analyze unique problems requiring evaluation of intangibles and design innovative solutions.
  • Make consistent decisions regarding the overall goals and objectives of significant programs or segments of the organization.
  • Serve as the primary contact on projects, interacting with senior internal and external personnel on complex or widespread issues requiring coordination across organizational lines.

Requirements

  • Bachelor's Degree in a STEM related field and 6+ years of experience -OR- a Master's Degree in a STEM related field and 4+ years of experience -OR- a PhD in a STEM related field with 2+ years experience in MLOps or DevOps or DevSecOps roles.
  • Proficiency in Python/Java, or Go.
  • Experience with CI/CD tools like GitHub Actions or Jenkins, or equivalent.
  • Familiarity with Docker, Kubernetes, AWS/Azure or equivalent.
  • Knowledge of machine learning frameworks and libraries.

Nice-to-haves

  • Experience with Infrastructure as Code (IaC) tools like Ansible, BiCep.
  • Familiarity with monitoring and logging tools.
  • Understanding of regulatory requirements and compliance.
  • Certifications such as Certified Kubernetes Administrator (CKA), AWS/Azure Certified DevOps Engineer, or similar.
  • Familiarity with tools and practices for deploying and managing ML models in production environments.
  • Experience with advanced security frameworks or certifications and disaster recovery strategies.

Benefits

  • Competitive pay
  • Stock options
  • Bonuses
  • Health insurance
  • Retirement plans
  • Vacation time
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service