Unclassified - San Jose, CA

posted 4 months ago

Full-time - Senior
San Jose, CA
1,001-5,000 employees

About the position

OKX is revolutionizing world systems through our cutting-edge digital asset exchange, Web3 portal, and blockchain ecosystems. We are deeply committed to shaping a fairer, more transparent, and accessible society through blockchain technology. To date, we have over 50 million users, 3000+ employees, and a presence in 180+ countries, all believing in the same vision as us. Our platform is safe and reliable, backed by our Proof of Reserves. As strong supporters of the Arts and Sports, we take pride in our partnerships in these fields. The Cloud Infrastructure Engineering team plays a critical role in our organization. This team is responsible for building tools and infrastructure that promote early detection of production failures, which is essential for ensuring a stellar customer experience. Our work focuses on driving safety, health, and uptime of our platform, as well as the ability to remedy unforeseen problems. By alleviating some of the complex burdens associated with scaling and maintaining uptime in distributed systems, Cloud Infrastructure Engineers enable development teams to concentrate on feature development rather than the intricacies of achieving and maintaining service level commitments. We are looking for a creative and driven individual who can spearhead our efforts to implement innovative infrastructure solutions that will significantly impact our platform's stability and scalability. We are open to hiring at both senior and staff levels, providing an opportunity for experienced professionals to contribute to our mission.

Responsibilities

  • Research, architect, and implement solutions based on AWS products.
  • Maintain and configure AWS products and services, ensuring daily maintenance of each AWS cloud environment.
  • Utilize Terraform for infrastructure as code (IaC) to automate the provisioning and management of cloud resources.
  • Prepare documents related to AWS Cloud Operations & Maintenance (O&M) and formulate O&M specifications.
  • Monitor company services and handle alerts in a timely manner to ensure service stability and uptime.
  • Collaborate with development teams to ensure seamless integration and deployment of new features.

Requirements

  • Bachelor's degree or above in Computer Science or relevant domains.
  • Over 6 years of experience in DevOps, SRE, or related positions.
  • Proficient in AWS distributed management, large-scale clustering, fault tolerance, backup, load balancing, and other technologies.
  • Deep understanding of high availability architecture and capacity planning, with rich experience in handling complex problems.
  • Solid Linux platform operation and maintenance skills, including debugging capabilities and proficiency in troubleshooting, configuration tuning, and performance analysis.
  • Familiarity with Kubernetes (k8s) for container orchestration and management.
  • Experience with deployment and tuning of EC2, EKS, VPC, or big data products.
  • Experience with microservices architecture, including deployment, scaling, and maintenance.
  • Experience in monitoring, O&M, and management of AWS large-scale servers and containers.
  • Familiarity with Internet company architecture and configurations such as nginx, redis, MySQL, kafka, and Elasticsearch.
  • Proficient in using Python/Shell for development.
  • Strong engineering skills, proficient in at least one O&M or infrastructure sub-area, such as public cloud networking, SRE, DevOps, or cloud-native.
  • Excellent business analysis ability, system architecture ability, and problem-solving ability.

Nice-to-haves

  • Bilingual in English and Mandarin.
  • Familiarity with the operation and maintenance management of Alibaba Cloud, Google Cloud, Microsoft Cloud, and other cloud providers.

Benefits

  • Competitive total compensation package.
  • L&D programs and education subsidy for employees' growth and development.
  • Various team building programs and company events.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service