Software Engineer - Machine Learning Infrastructure

ByteDance

Benefits

Responsibilities

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join Us

At ByteDance, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and for millions of users across all of our products. We lead with curiosity and aim for the highest, never shying away from taking calculated risks and embracing ambiguity as it comes. Here, the opportunities are limitless for those who dare to pursue bold ideas that exist just beyond the boundary of possibility. Join us and make impact happen with a career at ByteDance.

The mission of our AML team is to push next-generation machine learning algorithms and platform for the recommendation system, ads ranking and search ranking in our company. We also drive substantial impact for core businesses of the company. Currently we are looking for Software Engineer - Machine Learning Infrastructure to join our team to support and advance that mission.

Responsibilities:

  • Responsible for the design and implementation of a global-scale machine learning system for feeds, ads and search ranking models.
  • Responsible for improving the use-ability and flexibility of the machine learning infrastructure.
  • Responsible for improving the workflow of model training and serving, data pipelines and resource management for the multi-tenancy machine learning systems.
  • Responsible for designing and developing key components of ML infrastructure and mentoring interns.

Qualifications

  • Proficient in C/C++/Python, and have solid programming skills.
  • Familiar with deep learning frameworks (TensorFlow/Pytorch).
  • Experience in developing and deploying large-scale systems.
  • Ability to work independently and complete projects from beginning to end and in a timely manner.
  • Good communication and teamwork skills to clearly communicate technical concepts with other teammates.
  • Experience on improving core machine learning infrastructure(TensorFlow, Pytorch, and Jax).

Preferred Qualifications:

  • Experience contributing to an open sourced machine learning framework (TensorFlow/PyTorch).
  • Experience in big data frameworks (e.g., Spark/Hadoop/Flink), experience in resource management and task scheduling for large scale distributed systems.
  • Strong background in one of the following fields: Hardware-Software Co-Design, High Performance Computing, ML Hardware Acceleration (e.g., GPU/TPU/RDMA) or ML for Systems.

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

ByteDance is committed to providing reasonable accommodations during our recruitment process. If you need assistance or an accommodation, please reach out to us at kayla.wong@bytedance.com.

Read Full Description
Confirmed 6 hours ago. Posted 30+ days ago.

Discover Similar Jobs

Suggested Articles