Onesource Regulatory - Seattle, WA

posted 2 months ago

Full-time
Seattle, WA
Professional, Scientific, and Technical Services

About the position

OneSource Regulatory Technology is seeking an experienced data engineer to join their product solutions team. This full-time contractor role involves pulling data from various sources, cleaning, normalizing, and loading it into databases, while ensuring data integrity and quality. The ideal candidate will have a strong background in data engineering, particularly in the pharmaceutical space, and will be responsible for developing strategies to maintain high data quality standards.

Responsibilities

  • Parse and synthesize XML and/or JSON documents.
  • Curate data through intermediate to advanced web scraping techniques.
  • Fetch data via SFTP, FTP, Wget, Curl, REST APIs, and GraphQL queries.
  • Utilize Linux command line tools such as grep, wc, sed, awk, and others.
  • Have basic knowledge of SQL with databases like PostGres, MySQL, and Google BigQuery.
  • Understand No-SQL databases like MongoDB or similar.
  • Familiarity with cloud technologies, including storage buckets and serverless functions.
  • Extract text and images from PDF files.
  • Use Puppeteer or other automatable web client technologies.
  • Understand JavaScript, HTML/CSS, and HTTP methods for web scraping.

Requirements

  • At least 4+ years of experience in data engineering.
  • Solid experience with Python and libraries such as Pandas and requests.
  • Basic knowledge of SQL and No-SQL databases.
  • Familiarity with cloud technologies and web scraping tools.
  • Strong English communication skills and attention to detail.

Nice-to-haves

  • Experience within the Pharmaceutical Space.
  • Ability to expose data via C# .NET Core and/or GraphQL.
  • Experience with Google Cloud Platform services.
  • Knowledge of Python multi-threading for data manipulation and scraping.
  • Familiarity with Docker and Kubernetes for data processing.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service