工作职责:
1. 参与基础大语言模型应用研发;
2. 结合机器学习、强化学习等技术优化基础大语言模型
3. 调研并探索SFT/RLHF方向前沿算法、框架,持续提升现有算法的效率与效果。
Responsibility:
• Lead, collaborate, and execute on research that pushes forward the state of the art in large language model research
• Use machine learning, reinforcement learning and other technologies to optimize fundamental large language models
• Directly contribute to experiments related to supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF), continuously improving the effectiveness of existing algorithms
任职资格:
1. 有计算机科学、数学、统计学或相关领域的硕士或博士学位;
2. 熟悉Python与深度学习框架,具有良好的编程能力和扎实的数学理论基础;
3. 关注行业前沿进展,对技术开发及应用有热情,有自己的想法并乐于挑战自我;
4. 良好的沟通能力,跨团队协作能力,具备出色的规划、执行力,强烈的责任感,以及优秀的学习能力和自我驱动力;
Qualification:
• MS or PhD in Computer Science, Computer Engineering, Mathematics, Statistics or related fields
• Experienced in Python and deep-learning, Excellent programming skills, Knowledgeable of mathematical concepts
• Keep up with the latest scientific literature and advancements in the LLM domain, suggesting potential improvements or integrations
• Excellent communication and collaboration skills, as well as the ability to work independently and manage multiple projects simultaneously. Being self-motivated
加分项
1. 有相关领域的开源项目、竞赛获奖、顶会论文发表/在投;
2. 熟悉LangChain、DeepSpeed等LLM开源工具,工程能力较强;
Preferred experience:
• Experienced in building orchestration using tools such as LangChain, DeepSpeed and other open-source tools
• Project experience of large language models, awarded in competition, first author publications at top conferences or in the process of submission