Apply

8,821 Similar Jobs

Responsibilities

About the Team

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed infrastructures. Our SREs are tasked to ensure the traffic services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will have the opportunity to manage a variety of complex systems at scale, including traffic systems that serve hyperscale datacenters and public cloud, global load balancer that handles Tbps of traffic.

Responsibilities

Build, expand and operate Bytedance’s global traffic platform, including large-scale systems in public and private clouds, edge data centers.
Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global traffic platform.
Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues.
Help improve the whole lifecycle of infrastructure services from inception and design throughout development, to deployment, user support and refinement.

Qualifications

Minimum Qualifications

Bachelor or Master's degree in Computer Engineering, Electrical Engineering, Computer Science or related major.
Proven years experience working with Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols.
At least 3 years experience in one or more programming languages such as Go, Python and Shell script.
Familiar with Cloud and CI/CD framework/Tools, such as GIT, Docker, Kubernetes, etc.

Preferred Qualifications

Experience in designing, analyzing and building automation and tools for large scale systems
Experience in building solutions with AWS, Google, Azures and other cloud services.
Experience in networking technologies such TCP/IP, HTTP, DNS, etc. in a carrier-grade environment.
Experience in developing and operating one or more of following systems: Kubernetes, Nginx, ipvs, ELK stack, etc.
Self-driven and capable of coping with ambiguity and moving projects from concept to delivery.
Strong in analytical skills and the ability to solve real world problems in a fast moving environment.

Read Full Description

Apply

Jobs at ByteDance
Similar Jobs

Confirmed 3 hours ago. Posted 30+ days ago.

Site Reliability Engineer, Traffic Platform

ByteDance

Responsibilities

Qualifications

Discover Similar Jobs

Site Reliability Engineer, Traffic Platform - 2025 Start

Site Reliability Engineer, Traffic Platform - Traffic SRE - 2025 Start

Site Reliability Engineer, Traffic Platform - Traffic SRE

Site Reliability Engineer (Cloud Native Platform) - Traffic Infrastructure

CDN Senior Site Reliability Engineer - Traffic Infrastructure

Suggested Articles

Site Reliability Engineer, Traffic Platform

ByteDance

Responsibilities

Qualifications

Discover Similar Jobs

Site Reliability Engineer, Traffic Platform - 2025 Start

Site Reliability Engineer, Traffic Platform - Traffic SRE - 2025 Start

Site Reliability Engineer, Traffic Platform - Traffic SRE

Site Reliability Engineer (Cloud Native Platform) - Traffic Infrastructure

CDN Senior Site Reliability Engineer - Traffic Infrastructure

Suggested Articles

Entry Level and Junior Customer Experience Jobs at Startups

Junior Software Jobs & Internships at Media Technology Firms