Job Description
TS/SCI w/Polygraph required Approved for 60% telework
06-10-SRE
Description: DevOps refers to a software development concept that unites and brings together developers and IT staff. The DevOps approach involves consistent, small edits to software coding. This means frequent updates and testing of software that results in very quick releases. DevOps is a culmination of two practices: Development and Operations. A Site Reliability Engineer is an expert who utilizes the DevOps methodology and integrates IT operations into software management and deployment. They ensure that the DevOps strategy is well implemented.
The Site Reliability Engineer is expected to have a good understanding of the software development lifecycle, know automation tools for developing digital pipelines (CI - Continuous Integration / CD - Continuous Deployment), and have classical system administration experience. They are expected to work across departments with managers, developers, and administrators to improve our software products for the customer.
Position Specific Skills: Experience with automating the deployment, scaling, and management of containerized applications using Kubernetes and related tools. Collaborate with developers to create and maintain CI/CD pipelines to ensure fast and efficient delivery of software. Troubleshoot and resolve issues related to Kubernetes infrastructure, applications, and networking. Experience coordinating with development teams to streamline code deployment. Conduct system tests for security, performance, availability, and reliability. Ensure the stable performance of the infrastructure in a large-scale setting, and know how to scale that infrastructure.
Required Skills: - Design and deploy Kubernetes clusters in a highly available and scalable manner. Includes unit testing, deployment, monitoring and reporting.
- Create container images and Helm Charts
- Deploying Docker images and configuring them on Kubernetes.
- Implement and maintain monitoring, logging, and alerting solutions to ensure visibility and control over the Kubernetes environment.
- Evaluate new technologies and tools to improve the Kubernetes-based infrastructure and provide recommendations to the team.
- Troubleshoot pod issues deployed in Kubernetes.
- Document internal processes and procedures related to duties and responsibilities.
- Automate the deployment, scaling, and management of containerized applications using Kubernetes and related tools.
- Collaborate with developers to create and maintain CI/CD piplines to ensure fast and efficient delivery of software.
- Troubleshoot and resolve issues related to Kubernetes infrastructure, applications, and networking.
- Coordinate with development teams to streamline code deployment.
- Conduct systems tests for security, performance, availability, and reliability.
- Ensure code quality, test and distribute code updates, and monitor the health and stability of deployed products.
- Ensure the stable performance of the infrastructure in a large-scale setting and know how to scale that infrastructure.
- Have the ability to multi-task and adapt to changes quickly
- Have high level problem-solving and excellent communication skills.
LCAT Qualifications: Five (5) years of experience in programs and contracts of similar scope, type, and complexity is required and a Master's degree in Computer Science or related discipline from an accredited college or university is required; OR Eight (8) years of experience in programs and contracts of similar scope, type and complexity and a Bachelor's degree in Computer Science or related discipline from an accredited college or university is required.
Two (2) years of additional SRE experience on projects with similar software processes may be substituted for a bachelor's degree.
Minimum three (3) years of experience with administering Docker, programming with C++ and/or Python, and programming on/and administering Linux servers is required.
Akina is a Woman Owned, Service Disabled, Veteran Owned, Small Business, looking for talented and ambitious individuals to join our team. We offer a generous compensation package that includes 24 days PTO accrued annually and 11 federal holidays. Our 401k is 100% vested on your start date and the company makes a direct contribution worth 10% of your salary. Akina covers 100% of healthcare costs for employees and 50% toward dependents. We offer educational assistance towards college classes and will cover costs associated with job related training and certifications Akina is committed to excellence and creating innovative and flexible solutions for our clients. We are a small company with an open ear to our employees' needs in order to attract and retain quality talent that enables our customer's mission.
Job Tags
Holiday work, Remote job, Flexible hours,