3Line commenced business in 2007 with the primary aim of using technology to make financial services easily available to the financially excluded. We believe this is a basic human right, and 3Line is our platform for achieving this purpose. We focus on four key areas: Electronic Banking, Issuer Processing, Agency Banking and Identity Management, all driven by our Card Scheme. Our vision is to be the second largest driver of e-Services, second only to the internet and through that, make financial services easily available to the financially excluded as a means of social reformation and transformation. Our mission is to operate an end-to-end electronic payment system which allows organizations, government agencies, and individuals collect, bank, and transfer or withdraw money electronically anytime, anywhere, making money safe and secure at all times. To do this, we open and operate secure platforms that connect organizations with people, introduce tailor-made products and services that meet the needs of those they serve by design functions and locations, and educate the financially excluded on steps to take to achieve financial freedom.
About the job
- A Site Reliability Engineer (SRE) is responsible for ensuring the reliability, scalability, and efficiency of systems in production. The role combines software engineering and systems administration to optimize performance, automate operations and manage incidents effectively.
Job Responsibilities
- Design, implement, and maintain highly available and scalable distributed systems.
- Define and monitor Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to maintain system reliability.
- Lead incident response, root cause analysis, and post-mortems to improve system stability.
- Implement automated failure detection, alerting, and self-healing mechanisms.
- Automate infrastructure provisioning, scaling, and recovery using tools like Terraform, Pulumi, or CloudFormation.
- Set up and manage monitoring, logging, and tracing tools like Prometheus, Grafana, Datadog, OpenTelemetry, and Conduct capacity planning and performance tuning to optimize infrastructure and applications.
- Manage ECS & Kubernetes clusters (EKS, AKS, GKE) and optimize workloads for performance and cost-efficiency Document architectures, workflows, and operational procedures to improve team efficiency. Enforce security best practices for cloud environments, networking, and access control (IAM, RBAC).
- Cost Optimization – Analyze and reduce cloud infrastructure costs while maintaining performance. Disaster Recovery – Implement resilience testing strategies and simulate failures to improve system robustness.
- Develop and enhance CI/CD pipelines for automated deployments using Jenkins, Gitlab, Aws Codepipeline, AzurePipline.
Qualifications
- BSc (Computer Science/Engineering, Information Technology, Computer Information Systems etc.)
- Must have between 3-5 years in similar role especially in the fintech industry.
- CI/CD & Automation - Aws codepipeline, gitlab, Jenkins, TravisCI, Azure pipelines.
- Cloud Infrastructure - AWS, Azure
- Containerization & Orchestration - Docker, Kubernetes (AKS, EKS, GKE)
- Scripting & IAC - bash, Powershell, python, Terraform, Cloudformation.
- Monitoring, logging & Observability - Prometheus, grafana, datadog, Splunk etc
- Configuration Management - Ansible, Chef, puppet, Saltstack.
- Security & Compliance - IAM, Certificates, Secrets, FIrewalls, PCI DSS, ISO27001
Method of Application
Signup to view application details.
Signup Now