Lightcast - Milan, TN

posted 9 days ago

Full-time - Entry Level
Milan, TN
Publishing Industries

About the position

As a data analyst for the linguistics team, you will be responsible for European languages assigned to you based on fluency. Your role will be to foster these languages and develop them for a multitude of products delivered to customers. Your job will be to build and maintain these languages per our Lightcast standards and help in the development of further features. To fill this role, we are looking for a dynamic and multilingual person that will quickly learn the ins and outs of the role in order to become an active part of a multicultural team.

Responsibilities

  • Analyze and improve data quality of multilingual text classifiers
  • Work with linguistics and engineering teams to build out new parsers across languages
  • Translate various taxonomies such as Skills, Titles, and Occupations
  • Create crosswalks from origin language titles and skills to Lightcast taxonomies
  • Use SQL for data handling and database management
  • Annotate data used for model training and validation

Requirements

  • Competency in multiple European languages (preferred: three or more)
  • Understanding of syntax and structural analysis of languages
  • Microsoft Excel experience (including vlookups, data cleanup, and functions)
  • Knowledge of query languages such as SQL
  • Knowledge of text analysis or machine learning principles
  • Experience with data analysis using tools such as Excel or Python
  • Knowledge of RegEx
  • Strong linguistics knowledge

Nice-to-haves

  • Experience with data analysis using tools such as Python
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service