Job Description
Key Responsibilities
- Perform daily maintenance of mainstream cloud servers (AWS, Azure, Google Cloud) including automated deployment, configuration, optimization, backup, and troubleshooting
- Establish and optimize operation standards, workflows, emergency plans, and participate in building the operation system framework
- Develop and maintain automation scripts/tools to enhance operational efficiency and automation capabilities
- Research and implement new cloud technologies/services by tracking platform updates and applying cutting-edge practices
- Provide professional technical support with rapid issue resolution
- Manage physical hardware including installation, configuration, troubleshooting, and upgrade operations
- Optimize hardware resource allocation to ensure cluster efficiency and performance
- Collaborate with software development teams to provide technical support and optimization recommendations for high-concurrency computing scenarios
Job Requirements
- 3+ years experience in cloud operations (AWS/Azure/GCP) and hardware maintenance
- Proficient in infrastructure automation tools (Terraform, Ansible, etc.) and scripting languages (Python, Bash)
- Strong understanding of cloud architecture principles and best practices
- Experience with containerization technologies (Docker, Kubernetes)
- Knowledge of monitoring tools and performance optimization techniques
- Excellent troubleshooting skills across both cloud and physical infrastructure
- Ability to collaborate effectively with cross-functional teams
- Continuous learning mindset to stay updated with emerging technologies
Preferred Qualifications
- Cloud certification (AWS Certified Solutions Architect, Azure Administrator, etc.)
- Experience with hybrid cloud environments
- Background in DevOps practices and CI/CD pipelines
- Knowledge of networking protocols and security best practices


