DevOps Engineer at Web3Auth

Full Time1 month ago
Employment Information
Job Description
This role is focused on maintaining and enhancing the reliability of our technical infrastructure. The ideal candidate will be responsible for implementing system updates and resolving critical issues to ensure seamless operations. They will also serve as a key member of the DevOps team, providing Level 2 technical support and being on-call for urgent production incidents. The position requires proactive development of tools and processes to minimize system errors and elevate customer satisfaction. Additionally, the candidate will design and execute strategies for system troubleshooting, maintenance, and integration with internal back-end systems. They will analyze root causes of technical failures, investigate complex issues, and develop automation scripts to streamline visualization tasks.
Key Responsibilities
  • Deploy updates and fixes to ensure the stability and performance of our services, including version control, testing, and rollback procedures.
  • Monitor system health and maintain high uptime by proactively identifying and mitigating potential risks.
  • Provide Level 2 technical support to resolve escalated issues, while being on-call to address urgent DevOps team needs during production outages.
  • Develop and maintain tools that automate error detection, reduce manual intervention, and improve overall operational efficiency.
  • Design and implement integration solutions for internal back-end systems, ensuring compatibility and data consistency across platforms.
  • Conduct root cause analysis for production errors, document findings, and propose preventive measures to avoid recurrence.
  • Investigate and resolve complex technical issues, including system configuration, network connectivity, and application performance bottlenecks.
  • Create and refine scripts for automating visualization tasks, such as data processing, reporting, and dashboard generation.
  • Establish standardized procedures for system troubleshooting, maintenance, and incident response to ensure consistency and scalability.
  • Collaborate with cross-functional teams to align technical solutions with business objectives and user requirements.
  • Continuously optimize system workflows and infrastructure to enhance reliability, security, and user experience.
  • Stay updated on emerging technologies and industry best practices to drive innovation in system management and automation.
Job Requirements
  • Proven experience in DevOps operations, with a strong track record of maintaining high system uptime and resolving critical issues.
  • Advanced knowledge of system administration, automation tools (e.g., Ansible, Puppet), and cloud platforms (e.g., AWS, Azure).
  • Excellent problem-solving skills and ability to analyze complex technical scenarios to identify root causes and implement effective solutions.
  • Proficiency in scripting languages (e.g., Python, Bash) for automation and visualization tasks, including API integration and data processing.
  • Strong understanding of software development lifecycle, with experience in integrating applications with internal back-end systems.
  • Ability to design and document standardized procedures for system maintenance, troubleshooting, and incident management.
  • Excellent communication skills to collaborate with teams and explain technical solutions to non-technical stakeholders.
  • Preferred: Experience with CI/CD pipelines, containerization technologies (e.g., Docker, Kubernetes), and monitoring tools (e.g., Prometheus, Grafana).
  • Ability to work independently and as part of a team, with a proactive approach to identifying opportunities for improvement.
  • Strong attention to detail and commitment to delivering high-quality, reliable technical solutions that align with business goals.
  • Preferred: Familiarity with ITIL frameworks and incident management best practices.
  • Ability to adapt to evolving technologies and continuously improve system performance and security protocols.
MyJob.one - Remote work. Real impact

New Things Will Always
Update Regularly

MyJob.one - Remote work. Real impact