Summary
Overview
Work History
Education
Skills
Websites
Certification
Infrastructure Management
Micro Services
AWS and Azure Skills
On Premise Datacenters
Internship
Interests
Timeline
Generic

Givemore Sibanda

Site Reliability Engineer At BMWITHub SA
Midrand

Summary

9 Years experienced Site Reliability Engineer, Linux administrator, Cloud professional architect. Extensive experience in designing, implementing, and managing scalable, highly available, and secure cloud-based systems. Proficient in leveraging cloud platforms such as AWS, Azure to architect robust infrastructure solutions that optimize performance, cost, and reliability. Skilled in automation, CI/CD pipelines, containerization and infrastructure-as-code to streamline operations and enhance system resilience. Strong problem-solving skills with a proven track record in reducing downtime, improving system efficiency, and driving operational excellence. Collaborative team player with expertise in bridging development and operations, fostering DevOps cultures.

Overview

8
8
years of professional experience
6
6
Certifications

Work History

SRE and Cloud Specailist

BMWITHub SA
Johanesburg
05.2023 - Current
  • Designed and implemented highly available, scalable, and fault-tolerant systems on Azure cloud and on premise environment.
  • Manage application workload and automatic resource scaling using HPA on AKS.
  • Reduced system downtime through proactive monitoring and automation using Grafana and Prometheus.
  • Developed and maintained infrastructure-as-code (IaC) using Terraform enabling rapid deployment of cloud resources.
  • Optimized CI/CD pipelines using GitHub Actions, improving deployment frequency and minimizing rollbacks
  • Lead incident response and post-mortem analyses, implementing preventive measures that reduced critical incidents through defect and PBI management.
  • Collaborated with development teams to ensure applications are designed for reliability, scalability, and observability in cloud environments
  • Managed and maintained 7 AKS clusters running over 400 service, maintaining reliability, availability and security of applications.
  • Run daily health check calls with L2 teams to review platform stability.

Site Reliability Engineer

Adumo
Johanesburg
06.2022 - 04.2023
  • Managed and implemented infrastructure and services on AWS cloud.
  • AWS Services/Infrastructure provisioning.
  • Software deployment.
  • Monitoring.
  • Hybrid connectivity configuration.
  • Cost management.
  • Access control.
  • Infrastructure as Code.
  • Python scripting.
  • Java infrastructure support.
  • Docker and Kubernetes Infrastructure administration.
  • Monitoring with Grafana, Prometheus, Nagios, PRTG, site24x7 Kibana – Configure effective monitoring based on application return codes and alerting for any abnormalities.
  • Log aggregation with Graylog/Elastic search – Configure parameters to enable effectively and informative data logging.
  • Docker – prepare the infrastructure and create docker files.
  • Create deployment pipelines and version control using Gitlabs, Bitbucket and ArgoCD.
  • Manage microservices using Kubernetes and Elastic Kubernetes.
  • Create logging patterns on Graylog.
  • Monitor using Prometheus/Grafana and Kibana.
  • Cisco switch and router management and configuration.
  • Firewall management and configuration.
  • Network connectivity, failover testing and availability.
  • Core network links monitoring and configuration.
  • VMware virtual environment management.
  • Patch deployment.

Linux Administrator / Systems Engineer

Innervation/Adumo
Johanesburg
01.2018 - 06.2022
  • Company Overview: A South African company with presence is across Africa that specializes in payment switching and various value added services and products with an annual transaction value of R66 billion across 30 000 active clients.
  • Implemented, supported and maintained all the IT services, including networking, security, email, and disaster recovery.
  • Rendered technical support for desktop, server, and networking issues for the company (worked on Gsuite, O365 Asterick phone system, laptop configuration and office and core network implementations)
  • Managed 250+ installed systems and ensured the highest levels of systems & infrastructure availability.
  • Installed, configured & tested operating systems, application software & system management tools.(worked on VMware, Veeam backup, WSUS and Desktop Central patch management, Windows and Linux server provisioning and hardening).
  • Maintained security, backup, and redundancy strategies in order to maintain data safety and security.
  • Wrote & maintained custom scripts to increase system efficiency and lower manual effort.
  • Liaised with vendors & IT personnel to ensure that all the systems are working properly; increased productivity.
  • Planned and implemented systems automation to increase the efficiency and reduce cost.
  • Maintained the technical inventory and maintained the constant availability of technical resources.
  • Ensured all IT services are properly maintained and upgraded, including anti-virus, backups, imaging, and patching.
  • Currently responsible for our growing adaptation of cloud services.
  • Managing running instance and new implementations on AWS. (provisioning of resource and IAM policies as well as finding ways to lower costs.
  • Automating repetitive tasks.
  • Migration to hosted telephony systems ( from Astericks to 3CX and from 3CX to Teleforge)
  • Migration of datacenter from office premises to Teraco datacenter.
  • Migration from Sonicwall firewall to Fortigate firewall.
  • Network planning and installation of new office systems including firewalls and switches.
  • A South African company with presence is across Africa that specializes in payment switching and various value added services and products with an annual transaction value of R66 billion across 30 000 active clients.

Operations Support Intern

Innervation VAS
Johanesburg
02.2017 - 01.2018
  • Provided technical support to end-users and installed Microsoft Software applications in 50+ systems.
  • Installed and supported in house telephony system using Asterisk software saving the company money by running free in house systems.
  • Linux administrator for over 300 remote servers with a revenue on millions.
  • 2nd line support for all our clients on technical issues.

Education

Advanced Diploma - Telecommunication Systems

City & Guilds
London

National Certificate - Electronic Communication Systems

Bulawayo Polytechnic

Skills

  • Operating Systems: Windows, Linux

  • Web Server: Apache, Nginx

  • Cloud Platforms: AWS cloud and Azure

  • Scripting: Python and bash

  • Containerization: Docker, Azure Kubernetes,

  • Infrastructure Tools: Terraform, Ansible, Helm

  • CI/CD : ArgoCD, GitHub Actions, git, bitbucket

  • Methodologies: DevOps, Agile, Operations Expect

Certification

Redhat Certified Systems Administrator

Infrastructure Management

  • Monitoring with Grafana, Prometheus, Nagios, PRTG, site24x7 Kibana – Configure effective monitoring based on application return codes and alerting for any abnormalities.
  • Log aggregation with Graylog/Elastic search – Configure parameters to enable effectively and informative data logging.

Micro Services

  • Docker – prepare the infrastructure and create docker files.
  • Create deployment pipelines and version control using Gitlabs, Bitbucket and ArgoCD.
  • Manage microservices using Kubernetes and Elastic Kubernetes.
  • Create logging patterns on Graylog.
  • Monitor using Prometheus/Grafana and Kibana.

AWS and Azure Skills

  • EKS
  • EC2
  • Lambda
  • S3
  • Transit gateway
  • Virtual Gateway
  • API Gateway
  • ECS
  • Fortigate
  • IAM
  • RDS
  • IGW
  • NAT Gateway
  • CloudWatch
  • Cloudtrail
  • Route 53
  • ELB
  • VPC

On Premise Datacenters

  • Cisco switch and router management and configuration.
  • Firewall management and configuration.
  • Network connectivity, failover testing and availability.
  • Core network links monitoring and configuration.
  • VMware virtual environment management.
  • Patch deployment.

Internship

Operations Support Intern, Innervation VAS, 02/01/17, 01/31/18, Provided technical support to end-users and installed Microsoft Software applications in 50+ systems., Installed and supported in house telephony system using Asterisk software saving the company money by running free in-house systems., Linux administrator for over 300 remote servers with a revenue on millions., 2nd line support for all our clients on technical issues.

Interests

I am a friendly easy going individual who enjoys following various sporting events like soccer, athletics and auto sports I read and watch a lot of online blogs about technology and popular people lifestyle I jog and use the gym on a weekly basis to maintain my health

Timeline

SRE and Cloud Specailist

BMWITHub SA
05.2023 - Current

Site Reliability Engineer

Adumo
06.2022 - 04.2023

Linux Administrator / Systems Engineer

Innervation/Adumo
01.2018 - 06.2022

Operations Support Intern

Innervation VAS
02.2017 - 01.2018

Advanced Diploma - Telecommunication Systems

City & Guilds

National Certificate - Electronic Communication Systems

Bulawayo Polytechnic
Givemore SibandaSite Reliability Engineer At BMWITHub SA