Backend Production Support Engineer25k-45k
广东本科及以上5-10年系统运维运维技术支持工程师
五险一金#补充医疗保险#带薪年假#节日福利 #外企文化#弹性办公时间#团建聚餐
We are a team to design, develop, maintain, and improve software for various ventures projects, i.e., projects that are adjacent to our core businesses and are bootstrapped fast with a lean team. You will be actively involved in the design of various components behind scalable applications, from frontend UI to backend infrastructure.
Role Responsibilities:
1. Technical Development and Testing
a. Engage in the requirements analysis, system design, and undertake part of the development work within the technical team.
b. Write and refine integration tests to ensure the robustness and reliability of the software.
c. build close relationship with different operation team to continuously improve the operation efficiency
2. Customer Support and Problem Resolution
a. Act as the primary technical contact for resolving issues encountered by the derivs trading project in production. This involves troubleshooting and root cause analysis of issues, and implementing effective solutions
b. Participate in the daily work of the CS team. Actively receive and handle issues submitted by users, and address their requirements promptly.
c. Demonstrate a comprehensive understanding of all upstream and downstream business services related to derivs trading, including onboarding, pricing and trading, to quickly analyze and locate problems. Actively solve the problems encountered or coordinate with relevant teams to promote problem-solving.
d. Regularly organize and analyze the encountered problems, transform them into business requirements, and submit them to the Project Manager (PM) and developers (dev) to improve and optimize our products.
3. System Reliability and Maintenance
a. Collaborate with the SRE team to formulate reasonable monitoring and alerting mechanisms. By leveraging monitoring tools and techniques, identify potential issues in the production environment as early as possible to prevent system failures.
b. Provide system upgrade and optimization plans to enhance system performance and reliability.
c. In the event of a system failure, coordinate with the infra team, SRE team, and other relevant teams to identify and troubleshoot the problem, aiming to minimize downtime and the impact on users.
4. Timezone: Work mostly in EST timezone as most of our customers, but also to have couple hours overlap with HKT timezone
5. Potential Day/Night Shift
6.Working and Interview Language: English
Role Requirements:
5+ years of experience in DevOps, particularly with AWS, Kubernetes, and CI/CD pipelines.
Solid experience in a Linux environment and relevant support tools.
Experience with shell scripting, Ruby, Golang and SQL technologies.
Familiar with monitoring systems and logging systems (e.g. Datadog, Sumologic, OpenTelemetry).
Familiarity with exchange platform domain knowledge
Experience in GitOps with ArgoCD is a plus.
Strong analytical skills and the ability to proactively identify issues before they escalate.
Detail-oriented with strong ownership of work and the ability to multitask across simultaneous projects.
Excellent communication and interpersonal skills to engage both technical and non-technical stakeholders.
Proficiency in both written and spoken English and Mandarin; Cantonese is a plus.
Willingness to work flexible hours to cover US trading sessions.
Embody a proactive and positive mindset, demonstrating a strong "can-do" attitude