Research Scientist, Foundation Model, Vision and Language

TikTok

Responsibilities

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.

Why Join Us

Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.

Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.

To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.

Join us.

About the Team

Our team's mission is to empower content understanding and creation using CV/NLP related technologies. We focus on cutting-edge R&D in areas like multi-modal understanding, vision and language, foundation models, audio/music understanding and generation with an emphasis on content creation. The team is a mix of experienced research scientists and research engineers, aiming to push the research boundaries in multi-modality and applying our research results to improve the experience of TikTok users.

Responsibilities

  • Conduct cutting-edge research and development in computer vision and natural language processing, especially in the areas of multi-modality, vision and language, etc.
  • Publish our latest research results, and help to build our brand in the research community.
  • Transfer our research results to product applications, and explore new product ideas with CV/NLP at its core.

Qualifications

Qualifications

  • Research and engineering experience in one or more areas of computer vision and natural language processing, including but not limited to
  • Experience in multi-modal understanding, vision and language, such as video captioning, VQA, Text-to-video retrieval, and other related topics.
  • Work with very large-scale datasets, and build very large-scale datasets to scale up foundation models.
  • Experience with language models and apply them in various downstream tasks.
  • Experience in audio/music understanding and generation.
  • Preferring candidates with publications in top-tier venues such as CVPR, ECCV, ICCV, NeurIPS, ICLR, ICML, EMNLP, ACL, COLING, etc
  • Highly competent in algorithms and programming; Strong coding skills in Python and popular deep learning frameworks.
  • Work and collaborate well with team members.
  • Ability to work independently; Strong communication skills.

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at rd.accommodations@tiktok.com

Read Full Description
Confirmed an hour ago. Posted 30+ days ago.

Discover Similar Jobs

Suggested Articles