SAIC - McLean, VA
posted about 2 months ago
The position involves supporting a program that utilizes integrated discrete technologies to manage massive data processing, storage, modeling, and analytics across thousands of unique data sources. The primary goal is to facilitate threat identification and analysis while aiding in the achievement of both tactical and strategic objectives. The data platform capability, which serves as the backbone for various applications, accelerates operations by leveraging technologies and systems for data processing and modeling. This role requires the development and application of data processing technologies such as Python, SPARK, Java, SQL, Jenkins, PyPi, Terraform, Cloudera, ElasticSearch, Pentaho, Apache NiFi, and Apache Hop. The successful candidate will be responsible for performing data processing and developing methodologies to meet analytic requirements in clustered computing environments. Additionally, the role includes supporting downstream systems and capabilities of external customer organizations that rely on the data platform. This involves creating integration plans that utilize new data processing, modeling, and storage technologies, including cloud environments. The candidate will evaluate data collections to assess their potential value to the customer’s data platform and generate assessments to support data acquisition and engineering activities. This will ensure that data is effectively integrated into the data platform systems to maximize its value. The position also entails performing and supporting data modeling and engineering activities, refining existing models, and creating new models and data feeds to support both existing and new analytic methodologies, all under customer oversight.