Job Description
We are seeking a highly skilled Senior DevOps Engineer to join our team. The ideal candidate will be responsible for ensuring the smooth operation of our peer-to-peer (p2p) nodes and traditional applications with zero downtime. You will work closely with developers to troubleshoot issues, secure our systems, and create robust monitoring solutions.
Key Responsibilities
- Ensure our p2p nodes and traditional apps are running as smoothly as possible with zero downtime
- Troubleshoot all deviations from "zero downtime" alongside developers, including but not limited to debugging
- Communicate with external community members and partners in troubleshooting their issues and participating in the Allora ecosystem as peer nodes
- Secure our systems from common web2 vulnerabilities (e.g. DDoS)
- Create monitoring and alerting systems to preempt all potential issues
- Participate in on-call duties alongside developers
- Proactively identify the need for and create automations
- Aid in software engineering responsibilities (mostly Golang, some Python and Shell)
Required Skills
- Strong experience in DevOps practices and principles
- Proficiency in Golang, Python, and Shell scripting
- Experience with monitoring and alerting systems
- Knowledge of security best practices and vulnerability prevention
- Excellent troubleshooting and debugging skills
- Ability to work collaboratively with developers and external partners
- Experience with peer-to-peer networks and distributed systems is a plus
Additional Information
This position requires participation in on-call rotations and the ability to proactively identify areas for automation and improvement. The ideal candidate will be a self-starter with a strong problem-solving mindset and a passion for maintaining high-availability systems.