Expert Application Engineer SRE

$110,500 - $154,900/Yr

Discover Financial Services - Deerfield, IL

posted about 1 month ago

Full-time - Mid Level
Remote - Deerfield, IL
Credit Intermediation and Related Activities

About the position

As an Expert Application Site Reliability Engineer (SRE) at Discover Financial Services, you will play a crucial role in ensuring the availability and performance of critical applications, including the Card and Bank websites and mobile application. This position focuses on applying software engineering principles to IT infrastructure and operations, with an emphasis on creating reliable and scalable software systems. You will work in an Agile environment, collaborating with development teams to enhance infrastructure and automate operational processes, while also managing risks and customer-impacting issues.

Responsibilities

  • Partner with Application Development teams to build resiliency into critical websites and mobile applications and define best practices for SLI/SLO/Error Budgets.
  • Proactively identify collaboration opportunities across the firm to promote reusable solutions at scale.
  • Build out end-to-end observability in partnership with Application Development teams and other SREs.
  • Contribute to organizational strategy for monitoring, alerting, and dashboards.
  • Provide consulting expertise across SRE best practices for architects and application teams.
  • Create and maintain technology vision and roadmap for Digital SRE.
  • Drive strategic technology decisions collaborating with internal and industry experts.
  • Evolve capacity management and performance management tools/processes to align with the company's cloud strategy.
  • Define the disaster recovery plan needed for critical applications.
  • Research industry best practices and add technical capabilities at Discover, such as chaos engineering.
  • Participate in an on-call escalation rotation.
  • Research new technology opportunities and how they can be used to add technical capabilities at Discover.
  • Drive strategic technology decisions based on collaboration with a broad field of experts outside of Discover.
  • Contribute to the external image of Discover Technology as a desired workplace to learn technology best practices.
  • Shape learning paths for Discover engineers by sharing knowledge gained from external experiences.

Requirements

  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • 5+ years of experience in Site Reliability Engineering or related roles.
  • Strong understanding of software engineering principles and practices.
  • Experience with monitoring and observability tools.
  • Proficiency in scripting and automation tools.
  • Knowledge of cloud infrastructure and services.

Nice-to-haves

  • Experience with chaos engineering practices.
  • Familiarity with Agile methodologies and practices.
  • Certifications in relevant technologies or methodologies (e.g., AWS, Azure, Kubernetes).
  • Strong communication and collaboration skills.

Benefits

  • Paid Parental Leave
  • Paid Time Off
  • 401(k) Plan
  • Medical, Dental, Vision, & Health Savings Account
  • Short-Term Disability, Life, Long-Term Disability and Accidental Death & Dismemberment Insurance
  • Recognition Program
  • Education Assistance
  • Commuter Benefits
  • Family Support Programs
  • Employee Stock Purchase Plan
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service