Nvidia - Santa Clara, CA
posted 4 months ago
NVIDIA is seeking Senior Applied Deep Learning Research Scientists to join our innovative applied deep learning research team, which has been at the forefront of advancements in multi-modal large generative models. Our team has pioneered significant technologies such as Megatron, DLSS, and various audio models including BigVGAN, One TTS Aligner To Rule Them All, RADMM, ZenFlow, and P-Flow. We are dedicated to pushing the boundaries of "Generative AI" by developing state-of-the-art multi-modal generative models that address real-world challenges. If you are enthusiastic about the latest research and technologies that are revolutionizing "Generative AI" and are eager to explore creative new paradigms for multi-modal generative models, particularly focusing on large language models (LLMs) and audio, this position is an excellent opportunity for you. In this role, you will be responsible for researching, designing, implementing, and publishing novel, large-scale generative models that enhance multi-modal LLMs with a specific emphasis on audio. You will design and implement machine learning techniques to adapt generative models for downstream tasks such as audio understanding and synthesis. Additionally, you will construct and curate datasets for large-scale machine learning applications across various domains. Collaboration is key; you will work closely with other team members, research teams, and product teams to integrate your research and developments into industry-leading products. This position offers a unique chance to contribute to groundbreaking research while having a tangible impact on real-world applications.