Job Description:
As a Site Reliability Engineer at ACCREVENT, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems. You will work closely with development and operations teams to build resilient infrastructure, automate workflows, and enhance system observability.
Your Day-to-Day:
- Design and implement scalable infrastructure solutions.
- Develop automation scripts to improve deployment and monitoring.
- Troubleshoot system failures and optimize performance.
- Collaborate with development teams to enhance application reliability.
- Ensure high availability and disaster recovery planning.
Must-Have Skills & Experience:
- 4+ years of experience in SRE, DevOps, or cloud engineering.
- Strong knowledge of cloud platforms (AWS, Azure, or GCP).
- Experience with infrastructure as code (Terraform, Ansible).
- Proficiency in scripting languages (Python, Bash, or Go).
- Hands-on experience with Kubernetes, Docker, and CI/CD pipelines.
Positional Attributes:
- Passion for automation and system optimization.
- Strong troubleshooting and problem-solving skills.
- Ability to work in a collaborative and high-paced environment.
If you are interested in joining ACCREVENT and believe you are a great fit for this role, we’d love to hear from you! Please send your updated resume along with a brief cover letter to info@accrevent.com with the job title in the subject line. Our team will review your application and get in touch with qualified candidates. We look forward to welcoming you to ACCREVENT!