Summary
Overview
Work History
Education
Skills
Timeline
Comments
Contact Info
Skills Level
Profile
Contact Info
Profile
Hi, I’m

Ntsako Ngobeni

Data Engineer
Midrand

Summary

Experienced Data Engineer with a diverse background in finance, health research, education, and government services. Proficient in designing and implementing secure, high-volume data infrastructures using AWS and open-source tools for on-premise solutions. Expert in developing data pipelines (ETL/ELT), data lakes, and data warehouses, with a strong focus on ensuring data quality and security.


Skilled in collaborating with data scientists to build datasets for analysis, handling ad hoc requests, providing support, and automating processes. Recently, leveraging AI to conduct preliminary analytics on data.


Proven ability to lead teams, train junior engineers, and collaborate with stakeholders to deliver data-driven solutions. Experienced in building and maintaining data systems, backend development, and providing consultancy services to optimize data usage for strategic decision-making. Dedicated to leveraging data for impactful insights and driving project success across various sectors.

Overview

18
years of professional experience

Work History

Capitec Bank

Senior Data Engineer
3 2023 - Current

Job overview

Overview: Worked at Capitec Bank Business, focusing on designing and implementing ETL processes and leading the development of credit data products for data science and the credit data warehouse.

Responsibilities:

  • Designed and implemented high-volume data pipelines using AWS Glue, Lambda, Step Functions, Terraform, Airflow, and EMR.
  • Utilized Apache Spark for data processing within EMR.
  • Employed Redshift for data warehousing and S3 for the data lake.
  • Managed job scheduling with Control M.
  • Collaborated with data scientists in Credit Analytics, handling data acquisition, ingestion, and transformation.
  • Led the development of the credit data warehouse, translating data models into SQL queries for building dimensions and facts.
  • Provided training and support for junior data engineers.
  • Automated data loads with Apache Airflow and Control M, monitored via PowerBI.
  • Planned, guided, and led the data engineering team to ensure high performance and successful project delivery.
  • Collaborated with various teams to design solutions, ensuring alignment and consensus, with individual decisions made primarily for ad hoc requests.

Skills:

  • ETL Tools: AWS Glue, Lambda, Step Functions, Terraform, Apache Airflow, EMR
  • Data Processing: Apache Spark
  • Data Warehousing: Redshift, S3
  • Job Scheduling: Control M
  • Programming: Python, SQL
  • CI/CD: GitLab
  • Data Visualization: PowerBI

CENFRI

Consulting Data Engineer
1 2022 - 02.2023

Job overview

Overview: Cenfri is Part of the Mastercard Foundation team in Rwanda, focusing on digitizing Rwanda's economy by building and optimizing secure data infrastructures for government and private sector institutions.

Responsibilities:

  • Developed cost-efficient data lakes and analytical environments in AWS using open-source tools.
  • Utilized EC2 instances for:Apache Airflow (orchestrating Docker containers)
    Apache Spark (data processing cluster used by Airflow to get data from source)
    Metabase (reporting)
    OpenMetadata (data documentation)
    VPN (secure data access)
  • Hosted data lake on RDS Postgres.
  • Built reusable Python data pipelines for data ingestion.
  • Collaborated with data analysts and stakeholders to structure and optimize data for analysis.
  • Consulted with private and public sectors to implement and leverage data infrastructure for decision-making.

Skills:

  • Cloud Platforms: AWS (EC2, RDS)
  • Data Engineering: Python, Docker, Apache Airflow, Apache Spark
  • Data Analysis and Reporting: Metabase
  • Data Documentation: OpenMetadata
  • Database Management: PostgreSQL
  • Security: VPN configuration and management
  • Consultation: Stakeholder engagement, capacity building, infrastructure optimization

Cash Connect

Data Engineer/Analyst
04.2020 - 12.2021

Job overview

Overview: Cash Connect is a cash management and payments solution company that secures cash, provides risk cover, and facilitates supplier payments from retail cash vaults.

Responsibilities:

  • Built in-house data infrastructure on AWS for a data lake, integrating data from various business sources for historic reporting and BI.
  • Part of the software engineering team, contributing to backend system development.
  • Managed data sources including SAP, sales data, in-house applications, and data synchronization between systems.
  • Provided ongoing support and maintenance for parts of the systems as needed.
  • Utilized AWS tools and solutions similar to those used at Cenfri.

Skills:

  • Cloud Platforms: AWS
  • Data Engineering: Python, Docker, Apache Airflow, Apache Spark
  • Data Warehousing: Redshift, S3
  • ETL Tools: AWS Glue, Lambda, Step Functions, Terraform
  • Job Scheduling: Control M
  • Programming: Python, SQL
  • CI/CD: GitLab
  • Data Visualization: Metabase, PowerBI
  • Software Development: Backend system development

Yoco

Data Engineer/Analyst
01.2018 - 03.2020

Job overview

Overview: Yoco provides card machines, online payments, and business tools to help businesses start, run, and grow.

Responsibilities:

  • Built data pipelines into an existing data warehouse.
  • Worked on data requests and reporting using Tableau/Periscope.
  • Improved and enhanced the current infrastructure using AWS, Redshift, Python, and Apache Airflow.
  • Worked with large transaction databases from card payments, focusing on optimizing SQL query performance.

Skills:

  • Cloud Platforms: AWS
  • Data Engineering: Python, Apache Airflow
  • Data Warehousing: Redshift
  • Data Visualization: Tableau, Periscope
  • Programming: Python, SQL
  • Database Management: Handling large transaction databases and query optimization

STATSSA

System Developer
07.2016 - 12.2017

Job overview

Overview: Statistics South Africa is the national statistical service of South Africa, dedicated to producing timely, accurate, and official statistics to advance economic growth, development, and democracy.

Responsibilities:

  • Maintained applications and databases, implementing changes through change control and management.
  • Worked with C# applications and ASP.NET applications.
  • Managed SQL Server databases.
  • Created datasets and updated queries for analysts.
  • Performed application testing using automated testing tools.

Skills:

  • Programming: C#, ASP.NET
  • Database Management: SQL Server
  • Data Preparation: Dataset creation, query optimization
  • Application Testing: Automated testing tools
  • Change Management: Implementing application changes through controlled processes

Agincourt Research Unit

Data Specialist
01.2012 - 06.2016

Job overview

Overview: The Agincourt HDSS at the MRC/Wits Rural Public Health and Health Transitions Research Unit strengthens and extends a high-functioning health and socio-demographic surveillance system, serving as the scientific foundation for advanced research and intervention studies.

Responsibilities:

  • Developed and updated software for data entry systems.
  • Managed databases, including quality control, administration, security, and backups.
  • Extracted high-quality data for researchers and contributed to scientific publications.
  • Trained and managed junior team members.
  • Developed and managed GIS systems.
  • Created and maintained structures for GIS data storage and integrated the GIS database with the HDSS database.

Skills:

  • Software Development: Data entry systems
  • Database Management: Quality control, administration, security, backups
  • Data Extraction: Preparing data for research and publications
  • Team Management: Training and managing junior members
  • GIS Systems: Development, management, and data integration

Hoedspruit Independent

Collage Tutor
12.2010 - 12.2011

Job overview

Overview: Hoedspruit Independent College, where I tutored maths for grade 8 and taught Computer Application Technology (CAT), focusing on computer applications, MS products, and basic computer usage.

Responsibilities:

  • Tutored grade 8 maths.
  • Taught Computer Application Technology subject to Grade 10-12 learners, covering computer applications and MS products.
  • Developed a C# Windows application program for recording learner marks, which generated mark sheets, mark schedules, and learner reports used by all teachers.
  • Managed SQL Server for storing and managing data related to learner marks.

Skills:

  • Teaching: Maths, Computer Application Technology
  • Software Development: C# Windows applications
  • Database Management: SQL Server
  • Educational Tools: Development of programs for academic record-keeping and reporting

Dikgadi Projects

Data Administrator
12.2006 - 03.2009

Job overview

  • Data Collection, as a fieldworker for Municipal Mass property valuation
  • Promoted to Data capturing and Data auditing
  • Team leader for data entry clerks, and compiling reports for project manager.

Education

University Of South Africa
Pretoria, South Africa

Bachelor of Science from Computing (Computer Science & Information Systes)
04.2016 - 06.2016

Mafemani Senior Secondary
Thulamahashe

High School Diploma
04.2001

Skills

AWS Cloud Solutions (EC2 S3 Step Functions RDS Glue Lambda Redshift Security Groups VPC)

Timeline

Data Engineer/Analyst

Cash Connect
04.2020 - 12.2021

Data Engineer/Analyst

Yoco
01.2018 - 03.2020

System Developer

STATSSA
07.2016 - 12.2017

University Of South Africa

Bachelor of Science from Computing (Computer Science & Information Systes)
04.2016 - 06.2016

Data Specialist

Agincourt Research Unit
01.2012 - 06.2016

Collage Tutor

Hoedspruit Independent
12.2010 - 12.2011

Data Administrator

Dikgadi Projects
12.2006 - 03.2009

Mafemani Senior Secondary

High School Diploma
04.2001

Senior Data Engineer

Capitec Bank
3 2023 - Current

Consulting Data Engineer

CENFRI
1 2022 - 02.2023

Comments

Throughout my 16-year journey in data specialization, I've grown from being a data collector to leading data teams. With a Computer Science degree, I've embraced the shift to cloud technologies, particularly AWS. My toolkit includes SQL, EC2 for Docker containers, Postgres RDS, Redshift, S3, Kafka, and more. Python is my go-to for ETL, while Git keeps my projects organized and version-controlled. I integrate Slack for notifications and use Apache Superset/Metabase for reporting. My expertise lies in architecting data pipelines, backend engineering, and system planning, making me a valuable asset in the field. I use OpenMetaData for data documentation.

Contact Info

061 401 6487, ngobeni.pleasure@gmail.com, linkedin.com/in/plengo, facebook.com/braplengo, instagram.com/braplengo

Skills Level

Had appropriate training only, Limited Practical Experience, Solid Practical Experience, Extensive Experience, Expert Experience

Profile

Person, Pleasure Ntsako, Ntsako, Ngobeni, 37, Male, BSc. Computing, University of South Africa

Contact Info

061 401 6487, ngobeni.pleasure@gmail.com, linkedin.com/in/plengo, facebook.com/braplengo, instagram.com/braplengo

Profile

Ntsako, Ngobeni, 37, Male, BSc. Computing, University of South Africa

Ntsako NgobeniData Engineer