Job Description
Key Responsibilities
- Collaborate with the system development team to analyze and resolve daily network failures, ensuring the normal operation of system infrastructure and executing maintenance operations for large-scale clusters.
- Proactively identify problems and hidden dangers in the production environment, and enhance the company's operation and maintenance delivery capabilities through the use of open-source or self-developed operation and maintenance tools.
- Quickly handle various system failures encountered during actual operation, prevent potential system failures, and ensure the system's high availability and reliability.
- Utilize host monitoring, log analysis, application performance monitoring (APM), and common system performance analysis commands to locate complex problems, respond promptly, and provide feedback on resolved issues.
- Independently troubleshoot issues during the execution of shell or Python scripts, ensuring smooth operation of system processes and timely resolution of technical bottlenecks.
- Implement and maintain blockchain network configurations, ensuring compliance with security protocols and optimizing system scalability and efficiency.
- Document operational procedures, maintenance records, and system performance metrics to support knowledge sharing and continuous improvement.
- Collaborate with cross-functional teams to design and deploy blockchain solutions, ensuring alignment with business objectives and technical standards.
Job Requirements
- Proficient in scripting programming languages such as Python, shell, and SQL, with strong skills in writing automation scripts and managing mainstream databases.
- Experienced in blockchain technology, with practical knowledge of network protocols, consensus mechanisms, and smart contract development.
- Deep understanding of cloud computing platforms and security products, with hands-on experience in their operation and maintenance.
- Knowledge of resource virtualization technologies (e.g., containerization, storage orchestration), with experience in building and maintaining scalable infrastructure.
- Excellent problem-solving abilities and analytical skills to diagnose and resolve complex technical issues in production environments.
- Ability to work independently and collaboratively, with strong communication skills to coordinate with development, security, and operations teams.
- Strong attention to detail and organizational skills to manage system configurations, logs, and performance metrics effectively.
- Preferred experience with DevOps practices, CI/CD pipelines, and cloud monitoring tools (e.g., Prometheus, Grafana) for proactive system management.
- Ability to adapt to rapidly evolving blockchain technologies and continuously improve operational processes through research and innovation.
- Excellent time management skills to prioritize tasks and meet critical system maintenance deadlines under pressure.


