Business Unit
Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers, TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.
What the Role Entails
- Research and optimize content generation models (text, image, audio, 3D models, etc.) to address challenges such as generation quality, diversity, controllability, and efficiency.
- Aim to improve user experience and production effectiveness, and support the productization of algorithms.
- Conduct algorithm training and optimization in areas such as image generation, multi-modal large models, and few-shot learning.
- Based on inhouse products and business needs, improve the performance and experience of AI painting, text generation, and video generation through: Prompt optimization / Generation model R&D / Adapter development / Performance acceleration which also includes resolving algorithm bottlenecks when applying models in real business scenarios.
- Address the industrial deployment of multimodal generative models and actively explore model design and optimization in an R&D context.
Who We Look For
- PhD (preferably fulltime) in Computer Science, Artificial Intelligence, Mathematics, or related fields.
- Solid foundation in computer vision or machine learning algorithms; candidates with publications in top conferences or journals are preferred.
- Proficient in machine learning and deep learning fundamentals, and familiar with mainstream AIGC frameworks, including GAN, VAE, VQGAN, Diffusion models, etc.
- Familiar with generation model extensions such as ControlNet, LoRA, and Text Inversion.
- Familiar with multi-modal models like CLIP, ERNIE-ViL, and other transformer-based cross-modal representation models. Hands-on experience in NLP, multi-modal learning, or AI-generated content is a strong plus.
- Strong learning ability, clear logical thinking, excellent communication skills, and a high level of curiosity.
- Good teamwork and interpersonal communication skills.
Equal Employment Opportunity at Tencent
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
Read Full Description