Bytedanceposted 30 days ago
San Jose, CA
Professional, Scientific, and Technical Services

About the position

Our Infrastructure Engineering team supports the company's fast growth by building and operating hyper-scale datacenters, managing the life cycle of server fleet, providing cloud solutions, and developing various infrastructure services and making sure they are scalable and are reliable. As a Cloud Site Reliability Engineer, you will build, expand, and operate Bytedance's global infrastructures, including large-scale systems in public and private clouds, data centers, and content delivery networks. You will also build tools, automation, visualizations, and monitors to facilitate the operation and optimization of the global infrastructure. This role requires participation in technical operations and rotations in response to performance and reliability issues, and you will help improve the whole lifecycle of infrastructure services from inception and design throughout development to deployment, user support, and refinement.

Responsibilities

  • Build, expand, and operate Bytedance's global infrastructures, including large-scale systems in public and private clouds, data centers, and content delivery networks.
  • Build tools, automation, visualizations, and monitors to facilitate the operation and optimization of the global infrastructure.
  • Participate in technical operations and rotations in response to performance and reliability issues.
  • Help improve the whole lifecycle of infrastructure services from inception and design throughout development to deployment, user support, and refinement.

Requirements

  • Master's degree (or Bachelor's degree with 3+ years of experience) in Computer Engineering, Electrical Engineering, Computer Science, or related major.
  • 3+ years of experience working with Unix/Linux systems from kernel to shell and beyond, with experience working with system libraries, file systems, and client-server protocols.
  • 3+ years experience with essential system-level apps, like DNS, APT, LDAP, Nginx, CI/CD, Ansible, Packer etc.
  • 2+ years experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python.
  • Self-driven and capable of coping with ambiguity and moving projects from concept to delivery.
  • Strong analytical skills and the ability to solve real-world problems in a fast-moving environment.
  • Experience in designing, analyzing, and building automation and tools for large-scale systems.

Nice-to-haves

  • Familiarity with Kubernetes techniques.
  • Familiarity with Microservices and FaaS techniques.
  • Experience in Web App or UI design and implementation.
  • Experience in DB design, usage, and DBA.
  • Experience with Unit Tests, integration tests, and performance tests.
  • Experience in system and data security.

Job Keywords

Hard Skills
  • Ansible
  • Build Tools
  • Go
  • Kubernetes
  • Linux Kernel
  • 0gvnMZBA1QJr m8Ql2r3iOoH4
  • 0NLqK rxfiMq4B
  • 0x4q2Kzy9QW ygJn B7pajG1LJ HmAVkY7TnzU9
  • 7I0eg
  • A2HmE E0g3I9JD
  • bIu751
  • cRi3 6aF yq6BuFw
  • kisUyGrq opJW6lNbZ 7bO05T28a
  • l15nmN KLHirlq
  • LJ9WhcQefdY 4PEkFtX
  • Nn93kEX
  • OdrpWk gk19pDNl3B2
  • qMYUg3FWLad xkYBeC1
  • UfH9v2zmaYL7 8RrI7uUA
  • ZKBjQw
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service