Robert Half - Chicago, IL

posted 5 days ago

Full-time - Mid Level
Remote - Chicago, IL
Administrative and Support Services

About the position

The Data Scientist IV position is a permanent role based in Chicago, Illinois, focusing on advanced data science and machine learning applications. The role is primarily remote, with a preference for candidates located in the Midwest. The successful candidate will leverage their expertise in Python, SQL, and various machine learning models to drive data-driven solutions in a collaborative environment.

Responsibilities

  • Utilize Python and its associated libraries to handle various data science tasks.
  • Employ SQL for database management and data manipulation purposes.
  • Develop and implement machine learning models such as Random Forest, KNN, and Time Series Forecasting.
  • Work with neural network architectures, including Transformers, ANN, BERT, GPT models, and open source foundational models like LLama2/3.
  • Use mlFlow for machine learning lifecycle management.
  • Leverage cloud-based machine learning managed services, specifically Azure OpenAI and Azure ML, though familiarity with AWS/Google Cloud Platform is also valued.
  • Work with vector databases and LM/LLM/GEN-AI based tools, libraries, and frameworks such as LangChain, Agentic, Semantic Kernel.
  • Evaluate the performance of ML/DL/LLM systems and handle any drift issues.
  • (Optional) Experience with PySpark/Databricks and Elasticsearch, along with AI component integrations, is a plus.
  • Demonstrate excellent communication and presentation skills, work well within a team, and show a high level of self-motivation.
  • Take LLM-based systems or Neural Network-based systems or mixture of experts-based systems to production.

Requirements

  • Strong communication skills to effectively convey complex data findings to team members and stakeholders
  • Proficiency in database management and integration for seamless data handling
  • Ability to create and deliver presentations to various audiences, showcasing data insights and solutions
  • Experience in production environments, ensuring smooth operation and maintenance of data systems
  • Engineering background for understanding and designing data models and structures
  • Expertise in SQL and Python for data manipulation and analysis
  • Familiarity with Cloud Technologies such as Microsoft Azure and AWS Technologies for scalable data storage and processing
  • Experience in component selection and integration within various libraries and frameworks
  • Solid understanding of statistics and mathematics for advanced data analysis
  • Degree in Computer Sciences or a related field
  • Knowledge of Artificial Intelligence (AI) and Machine Learning techniques, including Neural Nets and Random Forest algorithms
  • Experience with Elasticsearch Technologies for efficient data searching and analytics
  • Ability to manage services and coordinate with different teams for project completion
  • Proficiency in PySpark for distributed data processing
  • Understanding of LLM (Latent Logistic Markov) model for sequence prediction tasks.

Nice-to-haves

  • Experience with PySpark/Databricks and Elasticsearch, along with AI component integrations.

Benefits

  • Medical insurance
  • Vision insurance
  • Dental insurance
  • Life insurance
  • Disability insurance
  • 401(k) plan
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service