Sibitalent Corp - Fremont, CA

posted 3 months ago

Full-time - Mid Level
Fremont, CA

About the position

The Sr. Python Developer position is a critical role within our team, which is dedicated to developing big data projects. The team operates in a unique environment where cloud services such as Google or AWS are not utilized due to the sensitivity of client data. Instead, we are leveraging an on-premise cloud infrastructure that allows us to utilize all big data frameworks while managing terabytes of data. This role is designed for an individual who is not only technically proficient but also capable of collaborating effectively with various stakeholders and global teams to ensure alignment on common goals. In this position, the developer will spend approximately six hours coding and two hours collaborating with team members each day. The role requires advanced-level experience, specifically over eight years, in Python programming, particularly in handling big data. The ideal candidate will have experience with frameworks such as Django and/or Flask, as well as proficiency in PySpark. Additionally, familiarity with implementing data science algorithms and data munging/wrangling is highly desirable. The candidate will also be expected to utilize third-party Python libraries, including but not limited to Pandas, Pyarrow, and Numpy, and have experience with machine learning libraries like scikit-learn. Developing REST APIs that handle large volumes of data sets is a key responsibility, along with experience in relational database management systems (RDBMS) and NoSQL stores such as HBase and Cassandra. A strong emphasis on writing high-quality code, automated testing, and adhering to engineering best practices is essential, as is the ability to create reusable code components. Continuous partnership with stakeholders and coordination with both internal and external teams, including onsite and offshore resources, is a fundamental aspect of this role.

Responsibilities

  • Develop and maintain big data projects using Python programming language.
  • Collaborate with team members and stakeholders to ensure alignment on project goals.
  • Write high-quality, reusable code components and implement automated testing.
  • Develop REST APIs to handle large volume data sets.
  • Utilize big data frameworks and manage terabytes of data in an on-premise cloud environment.
  • Implement data science algorithms and perform data munging/wrangling as needed.
  • Work with third-party Python libraries such as Pandas, Pyarrow, and Numpy.
  • Engage with RDBMS systems and NoSQL stores like HBase and Cassandra.

Requirements

  • 8+ years of experience in Python programming, particularly with big data.
  • Proficiency in Django and/or Flask frameworks.
  • Experience with PySpark for big data processing.
  • Familiarity with implementing data science algorithms.
  • Experience with data munging/wrangling techniques.
  • Proficient in using third-party Python libraries including Pandas, Pyarrow, and Numpy.
  • Experience with machine learning libraries such as scikit-learn.
  • Ability to develop REST APIs involving large volume data sets.
  • Experience with RDBMS systems and NoSQL databases (HBase, Cassandra).
  • Strong emphasis on writing high-quality code and following engineering best practices.

Nice-to-haves

  • Experience with cloud services (even if not currently used) for future scalability considerations.
  • Knowledge of additional machine learning frameworks or libraries.
  • Familiarity with data visualization tools.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service