Job Description
1、参与人工智能芯片的软硬件协同设计,指令集功能验证;
2、参与人工智能芯片的编译器算法设计和实现, 工具链开发与维护,网络模型的性能调优;
3、参与深度学习软件栈的设计和实现;
1. Working closely with hardware/architecture engineering and software teams to understand the hardware and software requirements.
2. Responsible for compiler and tool chain design, implementation, maintaining and performance tuning.
3. Responsible for the design and implementation of deep learning software stack.
Job requirements
1. 熟练掌握C/C++/Python等编程技能,有扎实的编程基础、良好的编程风格和工作习惯;
2. 熟悉编译原理和算法实现,二年及以上编译器开发经验;
3. 对机器学习算法/深度学习有一定了解尤佳;
4. 有GCC、LLVM和Open64等开源编译器相关开发经验尤佳;
5. 有CUDA/OpenCL/汇编级优化经验尤佳;
6. 有深度学习相关框架中如Caffe/Tensorflow/PyTorch/MXNet等图优化开发经验尤佳;
7. 有计算机体系结构背景,熟悉芯片架构/微架构(包括CPU/GPU/SoC/FPGA), 有软硬件协同设计经验尤佳;
8. 具有独立解决问题的能力,良好的团队合作意识和沟通能力;
Requirements:
1. Strong C/C++/Python programming skills and solid knowledge of data structure and algorithm.
2. Familiarity with compiler theory, and experience on compiler development at least two year
3. Familiarity with machine learning or deep learning is a good plus.
4. Experience on open source compiler project such as GCC/LLVM/Open64 is a good plus.
5. Experience on Shader/CUDA/OpenCL assembler level performance is a good plus.
6. Experience on graph level compiler development on Caffe/Tensorflow/PyTorch/MXNet etc. is a good plus.
7. Familiarity with CPU or (GP)GPU architecture is a good plus.
8. Self motivated, team work, and good communication are a must.