Apex Clearing Corporation - Portland, OR

posted 2 months ago

Full-time - Entry Level
Remote - Portland, OR
Credit Intermediation and Related Activities

About the position

Apex Fintech Solutions (AFS) is seeking an Associate Site Reliability Engineer to join our team in Portland, Oregon. In this role, you will serve as a Distributed Systems Administrator, playing a crucial part in maintaining and supporting our message queueing infrastructure. This infrastructure is essential for enabling efficient and reliable communication across our systems, which is vital for processing millions of transactions daily. Your expertise will be instrumental in ensuring the availability, performance, and scalability of our messaging systems, which include Kafka, RabbitMQ, IBM MQ, and GCP PubSub. As an Associate Site Reliability Engineer, you will be responsible for implementing, configuring, and maintaining these message queueing systems. This includes performing routine system upgrades, patches, and optimizations to ensure optimal performance. You will also implement robust monitoring solutions to proactively identify issues and troubleshoot performance bottlenecks, responding promptly to system alerts and incidents to minimize downtime. Capacity planning will be a key aspect of your role, as you will conduct regular assessments to ensure that the messaging systems meet current and future demands. You will recommend and implement scalability enhancements as needed, while also enforcing security best practices and ensuring compliance with relevant industry standards and regulations. Collaboration is essential in this position, as you will work closely with development and infrastructure teams to integrate messaging systems into applications and services. You will analyze system performance and implement optimizations to enhance throughput and reduce latency, while also maintaining comprehensive documentation for messaging system configurations and procedures. Additionally, you will develop and maintain disaster recovery plans for messaging systems, conducting regular testing of recovery procedures to ensure system resilience.

Responsibilities

  • Implement, configure, and maintain message queueing systems, including Kafka, RabbitMQ, IBM MQ, and GCP PubSub.
  • Perform routine system upgrades, patches, and optimizations to ensure optimal performance.
  • Implement robust monitoring solutions to proactively identify issues and troubleshoot performance bottlenecks.
  • Respond promptly to system alerts and incidents, diagnosing and resolving issues to minimize downtime.
  • Conduct regular capacity assessments to ensure that the messaging systems meet current and future demands.
  • Recommend and implement scalability enhancements as needed.
  • Implement and enforce security best practices for message queueing systems.
  • Ensure compliance with relevant industry standards and regulations.
  • Collaborate with development and infrastructure teams to integrate messaging systems into applications and services.
  • Analyze system performance and implement optimizations to enhance throughput and reduce latency.
  • Maintain comprehensive documentation for messaging system configurations, procedures, and best practices.
  • Develop and maintain disaster recovery plans for messaging systems.
  • Design application usage patterns across multiple availability regions.
  • Conduct regular testing of recovery procedures to ensure system resilience.

Requirements

  • Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent experience).
  • 1+ years of proven experience as a Systems Administrator with a focus on message queueing systems, including Kafka, RabbitMQ, IBM MQ, and GCP PubSub.
  • Knowledge of the maintenance of message-oriented middleware.
  • Experience with cloud-based messaging services, particularly GCP PubSub.
  • Scripting and automation skills (e.g., Bash, Python) for system administration tasks.
  • Familiarity with infrastructure-as-code concepts and tools.
  • Troubleshooting and problem-solving skills.
  • Understanding of security practices related to messaging systems.
  • Ability to work collaboratively in a fast-paced, agile environment.
  • Strong communication skills, both written and verbal.

Benefits

  • Healthcare benefits (medical, dental, and vision)
  • Employee Assistance Program (EAP)
  • Competitive Paid Time Off (PTO)
  • 401k match
  • Parental leave
  • HSA contribution match
  • Paid subscription to the Calm app
  • Generous external learning and tuition reimbursement benefits
  • Hybrid work schedule allowing flexibility of working from home and one of the primary offices.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service