Description
About The Role
As a Site Reliability Engineer at Abnormal Security, you are opinionated about the discipline of Reliability Engineering. You are curious and that curiosity has led you to read and explore how different organizations tackle the challenges inherent in balancing #velocity and #excellence.
At the same time, you are pragmatic and able to prioritize the most impactful work while ensuring that the reliability of the overall system is improving over time. You are committed to building systems that are resistant to failures, able to recover quickly when failures occur, and elastic based on the company's needs.
What You'll do
Build and run services that improve the overall reliability of Abnormal's platform
Serve as a subject matter expert in observability and monitoring
Collaborate with software engineering and operations teams to develop and implement robust, scalable, and efficient infrastructure solutions.
Automate infrastructure and configuration management
Document operational procedures, guidelines, and troubleshooting steps to maintain a comprehensive knowledge base.
Conduct retrospectives of production infrastructure incidents
Participate in an on-call rotation
Work with AWS, Azure, Golang, Kubernetes, Golang, Python, and Terraform
Must have Skills
1-2 years experience as an individual contributor in software engineering roles
1-2 years of programming experience, preferably in Python or Golang
Experience with git-based source control solutions such as GitHub, GitLab, Jenkins, etc.
Strong understanding of Linux/Unix systems, networking, and security principles.
Familiarity with containerization technologies such as Docker and container orchestration platforms like Kubernetes.
Knowledge in public cloud environments such as AWS, Azure, or GCP
Experience with database technologies like MySQL, PostgreSQL, or MongoDB.
Nice to have Skills
Experience providing CI/CD pipelines for an organization using tools such as GitHub Actions, GitLab, Jenkins, etc.
Experience with monitoring and observability tools such as Prometheus, Grafana, or ELK stack.
Experience with build tooling such as bazel, pants, maven, etc.
#LI-MT1