职位描述
1、参与设计和实现推理引擎SDK,提升推理性能、易用性和产品稳定性。(Design and develop inference engine。Focusing on performance、usability and product robustness)
2、参与设计和实现推理引擎的AI编译。包括图融合、各类图优化、算子优化以及自动化调优等(Design and develop AI Compiling。including fusion,graph optimizations、kernel optimization and auto-tuning)
3、参与设计和实现推理引擎的运行时系统。包括内存管理以及资源管理等等。实现高效和稳定的稳定性。(Design and develop runtime system,including memory management and resource management)
4、参与设计和实现大模型的推理优化。基于推理引擎,研发和应用大模型推理优化的技术(Design and optimize LLM inference。Based on inference engine,develop and apply core technology for LLM inference)
职位要求
1、 CS/EE相关领域,5+年工作经验(A Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience), 5+ years of relevant software development experience)
2、 熟悉C/C++编程(Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design).
3、 熟悉深度学习框架,如Tensorflow,Pytorch(Familiar with deep learning framework,tensorflow or pytorch etc)
4、 有较强的技术热情,自我驱动(Proactive and able to work without supervision)
5、 具备下述条件优先:
a)有AI编译或推理引擎相关研发经验。(Related experience on deep learning compiling or inference engine)
b) 熟悉CPU/GPGPU架构并有深度学习系统研发经验(Prior experience with performance modelling, profiling, and code optimization in DL)
c) 熟悉TensorRT、MLIR并跟踪行业技术趋势(Experience on TensorRT or MLIR is highly preferred)
base:杭州 杭州市余杭区向往街1122号欧美金融城(EFC)英国中心西楼T6