Tech Lead Manager: Security Platforms and Infrastructure SRE

TikTok

Education
Benefits
Qualifications
Special Commitments

Responsibilities

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.

Why Join Us

Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.

Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.

To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.

Join us.

The Global Security Organization provides industry-leading cyber-security and business protection services to TikTok globally. Our organization employs four principles that guide our strategic and tactical operations. Firstly, we Champion Transparency & Trust by leading the charge in organizational transparency, prioritizing customer trust, and placing user needs first. Secondly, we aim to maintain Best in Class Global Security by proactively identifying and reducing risks while enabling innovative product development. We constantly work towards a sustainable world-class security capability. Thirdly, we strive to be a Business Catalyst & Enabler by embodying the DNA of technical innovation and ensuring our Global Security operations are fast and agile. Finally, we Drive Empowered & Risk-Informed Decision Making by providing our leaders with the necessary information to make agile decisions based on risk. In order to enhance collaboration and cross-functional partnerships, our organization follows a hybrid work schedule that requires employees to work in the office for 3 days a week, as directed by their manager. We regularly review our hybrid work model, and the specific requirements may change at any time.

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As the technical lead manager of the Security Platforms and Infrastructure SRE team, you will have the opportunity to build and lead a highly skilled SRE team responsible for maintaining critical technologies used to secure one of the largest data platforms in the world. You'll need to ensure the data, services and infrastructures are reliable, fault-tolerant, efficiently scalable and cost-effective. You'll also have the opportunity to design, build and deliver all kinds of systems in collaboration with your team.

Responsibilities

  • Building and managing the EU SRE team, including team recruitment, new talent training, system operation/maintenance/coordination and team culture building.
  • Establish on-call rotations, help chats, support groups, and escalation paths for your SRE team.
  • Give technical leadership and support to your team and stakeholders.
  • Coordinate with Security, Privacy, Compliance, and Legal to ensure adherence to latest regulations and internal compliance requirements
  • Engage in and improve the whole lifecycle of service, from inception and design, through to deployment, operation and refinement;
  • Ensure reliable, fault-tolerant, efficiently scalable and cost-effective data, services and infrastructures;
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health. Practice sustainable incident response and blameless postmortems;

Qualifications

  • Extensive hands-on experience operating large scale Kubernetes environments
  • Strong hands-on experience in Linux and TCP/IP Networking
  • Prior experience with configuration and maintenance of common applications such as DNS, Nginx, Docker, MySQL, etc
  • BS or MS degree in Computer Science, Electrical Engineering, Computer Engineering or related areas.
  • Experience in one or more programming languages such as Go, Java, C++, Python etc.
  • Good problem-solving, analytical thinking capabilities and exceptional attention to details.
  • Good communication and collaboration skills.

Preferred Qualifications

  • Minimum of five years of work experience in software development or SRE, particularly in the design, building, scaling, and troubleshooting of cloud systems.
  • Operational experience running a 24x7 production infrastructure at scale.
  • Proficiency working with data structures, schemas, and technologies like Hadoop, Hive, Redis, and MySQL
  • Experience in using cloud-native services like GKE, EKS, AWS/GCP load balancing, AWS/GCP cloud storage platforms (S3, storage buckets)
  • Experience in designing and analyzing large-scale distributed systems
  • Experience leading and hiring engineers

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

#LI-Hybrid

Read Full Description
Confirmed 19 hours ago. Posted 30+ days ago.

Discover Similar Jobs

Suggested Articles