Pubmatic - Redwood City, CA
posted about 2 months ago
PubMatic is seeking a Senior Machine Learning Engineer with big data experience who can work on building the next generation ML platform. The ideal candidate is a self-motivated problem solver with a strong background in big data tech stack, software design, and development. If you get excited about building a highly impactful machine learning platform that processes large datasets in a creative and fast-paced open cultured environment, then you should consider applying for this position. In this role, you will be responsible for building, designing, and implementing our highly scalable, fault-tolerant, and highly available big data platform to process terabytes of data and provide customers with in-depth analytics. You will develop Big Data pipelines using modern technology stacks such as Spark, Hadoop, Kafka, HBase, and Hive. Additionally, you will be tasked with developing analytics applications from the ground up using modern technology stacks such as Java, Spring, Tomcat, Jenkins, REST APIs, JDBC, Amazon Web Services, and Hibernate. You will work collaboratively with the Machine Learning and monetization teams to democratize data for analysis and impact. Your role will also involve building solutions to help the monetization team run experiments at a fast pace and analyze data accurately to calculate impact. A good understanding of the engineering tech stack and ML algorithms will be essential to make data processing jobs powering these algorithms more efficient and scalable. Moreover, you will develop systems to objectively monitor the impact of various experimental changes on machine learning algorithms, clearly highlighting both positive and negative outcomes. Managing Hadoop MapReduce and Spark Jobs, solving ongoing issues with operating the cluster, and participating in Agile/Scrum processes such as Sprint Planning, Sprint Retrospective, and Backlog grooming will also be part of your responsibilities. You will keep in regular touch with the quality engineering team to ensure the quality of the platforms/products and performance SLAs of Java-based microservices and Spark-based data pipelines. Supporting customer issues over email or JIRA, providing updates, patches to customers, and discussing technical documents with the Technical Writing team will round out your duties.