Verint Systems - Trenton, NJ

posted 7 days ago

Full-time - Mid Level
Trenton, NJ
Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

The Sr. Engineer, Site Reliability at Verint is responsible for applying engineering principles to enhance system reliability and performance through automation and collaboration. This role focuses on improving outage response, establishing best practices, and driving continuous improvement in a DevOps environment. The engineer will work closely with various teams to implement solutions that ensure system availability and resilience.

Responsibilities

  • Document, agree and implement a consequence practice when an error budget is breached
  • Drive our transformation to 'everything as code'
  • Partner with R&D and Operations teams to enhance telemetry
  • Work with Architects to design for availability and performance
  • Provide intelligence on system performance for continuous improvements
  • Use operational intelligence from observability and telemetry to auto remediate for availability
  • Gather data and use AI to infuse predictive alerts and actions
  • Design, automate and run anti-fragility tests, and conduct fire drills to ensure resiliency
  • Provide production sizing guidelines for cost-based autoscaling
  • Create, manage and maintain SLIs and SLOs
  • Participate in operations, rapid emergency response efforts, and blameless postmortems
  • Support incremental and continuous deployments into production
  • Provide guidance to lower level engineers
  • Develop and provide input into new operational standards and best practices
  • Lead and drive participation in process improvement, training & tool development

Requirements

  • Bachelor's degree in Computer Science, Engineering or related technical field or equivalent experience
  • 5 years experience with high-level languages such as GoLang, Python, Java, C#
  • 5+ years with build, source and editing including make, vi, bash
  • Demonstrable experience automating build, testing, deployment, alerting, and similar work
  • 3 years supporting a multi-region, multi-tenant, SaaS or PaaS environment
  • 3 years experience with AWS (preferred AMI, EC2, EBS, ELB, IAM, KMS, RDS, S3, SNS, VPC, Route 53, CloudWatch, Lambda)
  • 3 years experience with automated delivery tools, e.g. Harness, Jenkins, Azure DevOps
  • 2 years experience with Infrastructure as Code, e.g. Terraform, Helm
  • 3 years experience with Docker, Kubernetes, HELM, YAML
  • 3 years experience with Git and branching strategies and automated config management, e.g. Github, Gitlab, Chef, Ansible
  • Excellent problem solving skills
  • 2 years experience with observability frameworks (telemetry, log aggregation, APM, synthetic testing), e.g. DataDog, AppDynamics, Splunk

Nice-to-haves

  • AWS Developer, DevOps, SysOps, or Solution Architecture certifications
  • Detail oriented and highly organized with the ability to manage multiple priorities and parallel projects
  • Excellent written and verbal communication skills
  • Experience with Identity Solutions such as Auth0, Keycloak
  • Experience with Hashicorp suite including Vault, Consul and Boundary
  • Demonstrated experience working in agile environments
  • Experience of implementing and managing Data processing platforms that meet regulatory compliance regulations such as PCI-DSS, HIPAA and GDPR
  • Excellent organization, time management, and project skills
  • Previous success operating in a matrix environment
  • Have successfully led a DevOps or SRE transformation from a technical perspective
  • Strong analytical skills

Benefits

  • Health insurance
  • Dental insurance
  • Vision insurance
  • 401k plan
  • Paid holidays
  • Flexible scheduling
  • Professional development opportunities
  • Employee discount programs
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service