Job Description
We are seeking a skilled Cloud and Hardware Engineer to maintain and optimize our cloud infrastructure and hardware systems. This role involves working with mainstream cloud platforms while ensuring efficient hardware resource utilization in a high-performance computing environment.
Key Responsibilities
- Responsible for the daily maintenance of mainstream cloud servers such as AWS, Azure, and Google, including automatic deployment, configuration, optimization, backup, and troubleshooting
- Establish and optimize operation and maintenance standards, workflow, emergency plans, etc., and participate in the construction of the operation system
- Write and maintain automation scripts and tools to improve operational efficiency and automation level
- Track new technologies and services on cloud platforms, understand cutting-edge practices, and apply them to practical work
- Provide professional technical support to quickly resolve issues
- Manage and maintain hardware, including installation, configuration, troubleshooting, and upgrades
- Optimize hardware resource utilization to ensure efficient operation of the cluster
- Collaborate with software development teams to provide technical support and optimization suggestions for high concurrency computing
Additional Requirements
The ideal candidate should have strong problem-solving skills and be able to work in a fast-paced environment. Experience with containerization technologies (Docker, Kubernetes) and infrastructure-as-code tools (Terraform, Ansible) would be advantageous.
This position requires excellent communication skills as you will be collaborating with multiple teams across the organization. A proactive approach to identifying potential issues and implementing preventive measures is highly valued.