Research Scientist in Large Multimodal Models Applications - San Diego

ByteDance

Responsibilities

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok and Helo as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join Us

Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible.

Together, we inspire creativity and enrich life - a mission we aim towards achieving every day.

To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve.

Join us.

Team Introduction

Multimedia Lab's mission is to promote cutting-edge research in multimedia (including, but not limited to image/video data processing, compression and transmission), and to transfer technologies into our products for better serving our hundreds of millions of users. We are looking for exceptional individuals from all area of multimedia processing/compression/transmission, who have a track record of research excellence, a passion to shape the future of multimedia processing, and the potential to become an outstanding leader in the field.

Responsibilities

1. Contribute to the research and development of multimedia algorithms based on large multimodal models, including but not limited to video understanding, quality assessment, video processing and enhancement, and video compression.

2. Optimize and accelerate the performance of algorithms related to large multimodal models.

3. Explore the implementation of large multimodal models in multimedia applications, such as short video streaming, video transcoding, live streaming, etc.

4. Conduct advanced academic research on large multimodal models and publish findings in top international conferences and journals.

Qualifications

Minimum Qualification

1. Proficiency in Diffusion, LLM, and other advanced large multimodal models; experience with model training, tuning, and application.

2. Familiarity with computer vision (CV) algorithms, including GAN, VAE, and Diffusion for AIGC.

Preferred Qualification

1. Experience with NLP and RL algorithms, and knowledge of models such as Transformer, BERT, and GPT is preferred.

2. A history of leading impactful projects in large multimodal models or publishing in top conferences (NeurIPS, ICLR, ICML, etc.) is advantageous.

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

ByteDance Inc. is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://shorturl.at/cdpT2

Read Full Description
Confirmed 13 hours ago. Posted 30+ days ago.

Discover Similar Jobs

Suggested Articles