Please reference you found the job post on jobsfordevelopers.com to help us get more companies to post here.
Balbix is looking for a Staff DevOps Engineer to join our growing team. This role is critical in setting up our engineers for success and for our company to ensure that we deliver the best of class and scalable platform. This role requires working in various hosting environments and technology stacks with a specific focus on AWS and a commitment to Infrastructure as Code (IaC) and continuous deployment. You will also assist in configuring and installing IT and engineering infrastructure (on-premises and cloud).
The ideal candidate should possess a consistent track record of massively scaling applications and infrastructure, solving complex problems in AWS, IaC, and CI/CD automation, and quickly learning and adapting to constantly evolving technologies with AWS and other cloud providers.
You will
Work with the existing DevOps team to design and develop IaC components for the Balbix solution and internal engineering tools running in AWS.
Building and deploying state-of-the-art security SaaS platform using the latest CI/CD techniques which are fully automated, repeatable and secure.
Administration of Linux systems at scale using automation.
Secure infrastructure using security best practices (TLS, bastion hosts, certificate management, authentication and authorization, network segmentation, etc.)
Work with the existing DevOps team to design, develop and manage deployments on several Kubernetes clusters.
Manage, maintain, and monitor our infrastructure.
Work with the existing DevOps team to design and implement a consistent logging, monitoring, and diagnostic system for the Balbix solutions.
You have
8+ years of experience in DevOps/Platform Engineering
4+ years working on setting up infrastructure in AWS for Saas based product development organization
Solid understanding of the AWS infrastructure and working with services such as load balancers (NLB/ALB/ELB), IAM, KMS, Networking, EC2, CloudWatch, CloudTrail, Lambda, etc.
4+ years experience with building infrastructure using Terraform
3+ years of solid experience in Kubernetes, Helm
Excellent knowledge of working on configuration management systems such as Ansible.
Knowledge of CI/CD code management and deployment technologies such as GitLab, DockerFamiliarity with Nginx, HA Proxy, Kafka, and other components used in public cloud environmentsExperience with Grafana/Prometheus/LGTM stack is a plus
MLOps experience is a plus
Experience deploying and managing Aurora RDS- Postgres, ElasticCache, Cassandra, OpenSearch/Elasticsearch, ClickHouse is a plus.
Experience building/working on the latest AI technologies for infrastructure management is a plus
Excellent time management skills while staying focused and calm under pressure in meeting competing deadlines, ability to quickly change priorities when needed
Ability to communicate clearly and effectively, both written and verbal