Spectrum - Englewood, CO

posted 2 months ago

Full-time - Principal
Remote - Englewood, CO
10,001+ employees
Telecommunications

About the position

The Principal Architect III - Telemetry & ML role at Spectrum involves designing, developing, and implementing telemetry solutions and AI/ML architectures. This position is crucial for driving innovation and optimizing performance through advanced data analytics and real-time telemetry data. The successful candidate will leverage their expertise in software development, programming, and time-series databases to create scalable systems that enhance operational excellence.

Responsibilities

  • Lead the design and implementation of scalable telemetry systems and AI/ML infrastructure.
  • Collaborate with data scientists and engineers to integrate AI/ML models into software systems.
  • Design software systems for real-time telemetry data collection, processing, and analysis, ensuring scalability, security, and efficiency.
  • Develop frameworks to monitor and optimize the performance of AI/ML models in production using real-time data.
  • Architect solutions that leverage time-series databases to handle large-scale, high-frequency telemetry data and feed it into AI/ML models.
  • Architect and implement end-to-end telemetry systems to capture and analyze large-scale, real-time data from distributed systems.
  • Build and maintain data pipelines that support the collection, processing, and storage of telemetry data for AI/ML model training and monitoring.
  • Design and optimize data storage using time-series databases like Victoriametrics, Prometheus, or TimescaleDB for AI/ML workloads.
  • Enable predictive analytics through data-driven insights from telemetry systems and ensure data integrity for AI/ML applications.
  • Implement and configure Grafana for real-time monitoring and visualization of telemetry data.
  • Develop tools and processes to automate the deployment of machine learning models in production environments.
  • Implement monitoring and alerting systems to track the performance and accuracy of AI/ML models.
  • Utilize Grafana, time-series databases, and other monitoring tools to ensure visibility into model performance, system health, and real-time telemetry data.
  • Collaborate with DevOps teams to ensure continuous integration and deployment (CI/CD) of AI/ML models and telemetry solutions.
  • Lead the implementation of event management frameworks to handle telemetry-driven alerts and events.
  • Leverage event management platforms to automate and streamline the detection, response, and resolution of telemetry or model-related issues.
  • Develop proactive event management systems to identify anomalies and improve incident response time.
  • Implement mechanisms for automating event correlation and reducing false positives through intelligent alerting systems.
  • Utilize strong programming skills (Python, Java, Go, etc.) to design and implement robust, reusable, and high-performance software components.
  • Develop and optimize data pipelines that feed real-time telemetry data into AI/ML models using time-series data and other relevant data sources.
  • Work with software engineers to integrate AI/ML models and telemetry data into existing applications.
  • Ensure software architectures are aligned with business goals and industry best practices.
  • Provide technical leadership and mentorship to engineering teams in telemetry, AI/ML, software architecture, and event management.
  • Work closely with stakeholders, including product teams and business leaders, to understand and translate business needs into technical solutions.
  • Stay current with emerging technologies and trends in AI/ML, telemetry, time-series databases, and software development, and drive innovation within the organization.

Requirements

  • 10+ years of cumulative experience with system installation, configuration, operations, software development, and/or database development.
  • BA/BS in Information Technology, Computer Science, MIS or equivalent combination of education and experience.
  • Expert knowledge of software development and delivery (cloud computing, containerization, high-availability, mobile apps, big data, data at rest and in motion, AI, and machine learning).
  • Advanced knowledge of software development and delivery (cloud computing, containerization, high-availability, mobile apps, big data and machine learning).
  • Proven analytical skills to solve complex technology and business problems.
  • Proven ability to present technical concepts to non-technical audiences.
  • Effective written, verbal and presentation skills to all levels in the organization, including executive leadership, and external forums.
  • Effective organizational and leadership skills.
  • Effective business sense and sense of urgency to achieve business results.

Nice-to-haves

  • Experience with cloud platforms (AWS, Azure, GCP).
  • Familiarity with container orchestration tools (Kubernetes, Docker).
  • Knowledge of additional programming languages (e.g., Scala, Rust).
  • Experience with data governance and compliance frameworks.

Benefits

  • Comprehensive pay and benefits package.
  • Hybrid work policy allowing work from home up to one day each week.
  • Opportunities for career growth and development.
  • Supportive and inclusive workplace culture.
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service