This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Tenstorrent - Santa Clara, CA

posted about 2 months ago

Full-time
Hybrid - Santa Clara, CA
Merchant Wholesalers, Durable Goods

About the position

As an Acceleration Kernel Developer at Tenstorrent, you will play a crucial role in optimizing low-level workloads, kernel development, and enhancing software performance for machine learning applications. This position involves collaborating with a team of skilled engineers to ensure peak efficiency and high-quality results for clients and users.

Responsibilities

  • Participate in the design, development, and maintenance of kernel-level software components for applications.
  • Develop and optimize kernels and kernel libraries for efficient machine learning and HPC applications.
  • Implement tensor compute and tensor data movement optimization kernels with a heavy focus on optimizations.
  • Analyze and optimize low-level code to improve performance and efficiency, emphasizing tensor optimization.
  • Collaborate with machine learning engineers and data scientists to integrate optimized kernels and low-level routines into machine learning frameworks and pipelines.
  • Identify performance bottlenecks, conduct performance profiling, and develop strategies to address and resolve them.
  • Write comprehensive unit tests, conduct thorough debugging, and ensure the stability and reliability of kernel-level code.
  • Create clear and concise documentation for code, APIs, and best practices to facilitate collaboration within the team.
  • Stay up-to-date with the latest developments in kernel development, tensor optimization, and machine learning to propose innovative solutions and improvements.

Requirements

  • Bachelor's degree in Computer Science, Software Engineering, or a related field.
  • Proven experience in kernel development, with a strong focus on low-level optimizations and tensor optimization.
  • Proficiency in C/C++ programming languages.
  • Familiarity with machine learning frameworks and concepts.
  • Strong problem-solving skills and the ability to analyze and debug complex issues.
  • Experience with performance profiling and optimization tools.
  • Excellent communication and teamwork skills.
  • Self-motivated, detail-oriented, and able to work independently as well as in a team.

Nice-to-haves

  • Experience with GPU programming (CUDA, OpenCL) is a plus.
  • Knowledge of operating system internals is a plus.
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service