Site Reliability Engineer

$102,500 - $123,000/Yr

Apex Clearing Corporation - Portland, OR

posted 19 days ago

Full-time - Mid Level
Remote - Portland, OR
Credit Intermediation and Related Activities

About the position

The Site Reliability Engineer at Apex Fintech Solutions will serve as a Distributed Systems Administrator, focusing on maintaining and supporting the company's message queueing infrastructure. This role is crucial for ensuring efficient and reliable communication across systems, with responsibilities that include system maintenance, monitoring, troubleshooting, capacity planning, security compliance, and performance optimization.

Responsibilities

  • Implement, configure, and maintain message queueing systems, including Kafka, RabbitMQ, IBM MQ, and GCP PubSub.
  • Perform routine system upgrades, patches, and optimizations to ensure optimal performance.
  • Implement robust monitoring solutions to proactively identify issues and troubleshoot performance bottlenecks.
  • Respond promptly to system alerts and incidents, diagnosing and resolving issues to minimize downtime.
  • Conduct regular capacity assessments to ensure that the messaging systems meet current and future demands.
  • Recommend and implement scalability enhancements as needed.
  • Implement and enforce security best practices for message queueing systems.
  • Ensure compliance with relevant industry standards and regulations.
  • Collaborate with development and infrastructure teams to integrate messaging systems into applications and services.
  • Analyze system performance and implement optimizations to enhance throughput and reduce latency.
  • Maintain comprehensive documentation for messaging system configurations, procedures, and best practices.
  • Develop and maintain disaster recovery plans for messaging systems.

Requirements

  • Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent experience).
  • 3+ years of proven experience as a Systems Administrator with a focus on message queueing systems, including Kafka, RabbitMQ, IBM MQ, and GCP PubSub.
  • Knowledge of the maintenance of message-oriented middleware.
  • Experience with cloud-based messaging services, particularly GCP PubSub.
  • Scripting and automation skills (e.g., Bash, Python) for system administration tasks.
  • Familiarity with infrastructure-as-code concepts and tools.
  • Troubleshooting and problem-solving skills.
  • Understanding of security practices related to messaging systems.
  • Ability to work collaboratively in a fast-paced, agile environment.
  • Strong communication skills, both written and verbal.

Benefits

  • Healthcare benefits (medical, dental, and vision)
  • Competitive PTO
  • 401k match
  • Parental leave
  • HSA contribution match
  • Paid subscription to the Calm app
  • Generous external learning and tuition reimbursement benefits
  • Hybrid work schedule allowing flexibility of working from home and office.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service