Senior Data Engineer

$135,900 - $166,500/Yr

Spokeo - Pasadena, CA

posted about 2 months ago

Full-time - Senior
Pasadena, CA
101-250 employees
Professional, Scientific, and Technical Services

About the position

As a Senior Data Engineer at Spokeo, you will play a crucial role in developing, optimizing, and enhancing our data technologies, including ETL processes, data pipelines, and entity resolution. This position involves working with AWS infrastructure and various data tools to build and improve data products and automation features, ensuring alignment with organizational goals.

Responsibilities

  • Build infrastructure and data automation pipelines for the extraction, preparation, and loading of data from various sources.
  • Automate and integrate new components into the data pipeline.
  • Work with stakeholders and data science to develop data products including entity resolution and best selection to efficiently execute product vision and strategy.
  • Create unit and stress test components to monitor technical performance and ensure identified issues are resolved.
  • Develop data analysis tools to provide data insights and capture key metrics.
  • Research solutions and maintain technical documentation.
  • Follow best practices for data governance, quality, cleansing, and other ETL-related activities.

Requirements

  • 7+ years of development experience in data engineering.
  • 5+ years of hands-on programming experience with Python.
  • 5+ years of professional experience working in big data ecosystems, Spark is required; PySpark is preferable.
  • 3+ years experience with SQL, schema design, and dimensional data modeling.
  • 2+ years of professional experience working with dataflow orchestration tools, such as Airflow.
  • 2+ years of development experience in highly scalable, distributed systems and cluster architectures using AWS.
  • Prior experience working with large data sets (>100M+ records) is required.
  • A B.S. in Computer Science, Information Systems, or related fields is required.

Nice-to-haves

  • 2+ years of experience with non-relational databases (e.g., DynamoDB, Elasticsearch, etc.) is preferred.

Benefits

  • Participation in an individual annual bonus
  • Stock options
  • 401K
  • 100% medical/dental/vision coverage
  • Unlimited PTO
  • Mental health resources
  • Paid home office equipment
  • Fitness reimbursements
  • Support paying for courses
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service