Job Description:
1. Responsible for the deployment, optimization, and high-availability assurance of the company's core business systems, cloud platforms (such as AWS/Aliyun/Tencent Cloud), and foundational services (such as Kubernetes, Docker, Nginx, MySQL, Redis, Kafka).
2. Plan and implement system capacity management, performance optimization, and disaster recovery solutions to ensure service stability and scalability.
3. Responsible for the construction and maintenance of CI/CD pipelines to achieve automated building, testing, deployment, and rollback.
4. Design and improve system monitoring, log collection, and alerting systems (Prometheus/Grafana/ELK/OpenSearch).
5. Participate in emergency response, troubleshooting, and post-mortem analysis for production environment incidents, summarizing and driving long-term optimization solutions.
6. Participate in the standardization and process improvement of the operation and maintenance system, and document best practices.
7. Participate in statistical evaluation of overall operation and maintenance costs and audit various IT expenditures.
Job Requirements:
1. Bachelor's degree or higher in computer-related fields (full-time, excluding associate degree upgrades), with 5+ years of experience in large-scale internet or cloud platform operations.
2. Proficient in Linux systems and at least one programming language (Shell/Python/Go).
3. Expertise in Docker, Kubernetes, and CI/CD toolchains (Jenkins, GitLab CI, ArgoCD, etc.).
4. Familiar with monitoring and logging systems such as Prometheus, Grafana, ELK/OpenSearch.
5. Experience with public cloud architectures (AWS, Aliyun, GCP, Azure).
6. Strong communication skills, teamwork spirit, and ability to quickly identify and resolve complex issues.
Bonus Points:
- Extensive experience in IT cost optimization.
- Experience in global network acceleration and security deployment on enterprise cloud platforms.
Benefits:
- Fully remote work (Full Remote) with flexible hours.
- Weekends off and statutory holidays.
- Paid sick leave and annual leave.
- Annual double performance bonuses.
- Chinese-speaking work environment.


