INITIALIZING SYSTEM...
LAT: 30.0444° N LON: 31.2357° E CAIRO — EGYPT
SYS: OPERATIONAL VER: 2025.1.0
<PORTFOLIO/>

AHMED SHABAN

|

Data Engineer · Data Scientist · AI Practitioner

GPA: 3.88 / 4.0 STATUS: AVAILABLE
PROJECTS: 020 CERTS: 09+
SCROLL
// 01

ABOUT_ME

Ahmed Shaban
ENGINEER.PROFILE

Results-driven Data Engineer and aspiring Data Scientist with a strong academic foundation in Business Information Systems.

GPA of 3.88/4.0 (Distinction Standing) at Capital University, Cairo. Currently advancing expertise through Egypt's premier Digital Egypt Pioneers Initiative (DEPI) in the Data Engineering track as a team leader.

Proven ability to design ETL pipelines, build relational databases, and deploy machine learning models, bridging technical excellence with effective leadership to deliver real-world impact.

20+
Projects
9+
Certifications
3.88
GPA / 4.0
91%
AI for Business
// 02

EDUCATION_LOG

🎓 CAPITAL UNIVERSITY
FACULTY OF COMMERCE · CAIRO
B.Sc. DEGREE ★ DISTINCTION

Business Information Systems (BIS)

🏛
Capital University – Faculty of Commerce Cairo, Egypt
DURATION September 2023 – July 2027
EXPECTED 2027
CUMULATIVE GPA
3.88 / 4.0
CORE COURSEWORK
Systems Analysis Database Management Data Visualization Business Statistics Business Intelligence Information Systems
// 03

SKILL_STACK

DATA ENGINEERING
ETL / ELT Pipelines Apache Spark Apache Kafka Apache Airflow Data Modeling Data Warehousing
🗄 DATABASES
SQL / T-SQL SQL Server PostgreSQL NoSQL Database Design Normalization ERD Design
🤖 AI & MACHINE LEARNING
Scikit-learn TensorFlow Deep Learning CNN / SVM / FNN Generative AI Prompt Engineering Hugging Face LLMs Transfer Learning
💻 PROGRAMMING & WEB
Python C++ HTML5 / CSS3 JavaScript PHP Pandas NumPy Data Analysis
TOOLS & CLOUD
Git & GitHub Docker AWS Azure Vercel Power BI Data Annotation
ALL TECHNOLOGIES
Python C++ T-SQL Apache Spark Apache Kafka Apache Airflow Pandas NumPy Data Analysis SQL Server PostgreSQL Database Design Machine Learning Deep Learning TensorFlow Scikit-learn Prompt Engineering Hugging Face Data Annotation HTML5 CSS3 JavaScript PHP Azure Vercel Power BI Docker Git & GitHub
// 04

PROJECT_LOG

PRJ-001
ACTIVE

Household Power Consumption Pipeline

End-to-end ETL pipeline ingesting, cleaning, and transforming large-scale household energy datasets — streamlined multi-source extraction with advanced SQL and Python analytics.

PythonSQLETLPandasNumPyGit
PRJ-002
COMPLETE
⚕️

Pharmacy Database Management System

Fully normalized relational database for pharmacy inventory and transaction management. Authored 20+ optimized T-SQL queries, reducing query response time significantly.

SQL ServerT-SQLERD DesignNormalization
PRJ-003
COMPLETE
🧬

Cancer Classification: ML & Deep Learning

Benchmarked 4 classification models (SVM, FNN, CNN, Transfer Learning) on real-world cancer data. Engineered preprocessing pipelines with advanced imbalance handling and evaluation.

TensorFlowScikit-learnCNNSVMTransfer Learning
PRJ-004
COMPLETE
🏨

Hotel Database Management System

Developed a fully normalized relational database to manage hotel reservations, guest information, room availability, and billing services with 20+ optimized T-SQL queries.

SQL ServerT-SQLERD DesignNormalization
PRJ-005
COMPLETE
💬

Amazon Reviews Sentiment Analysis

Developed ML models (SVM, FNN, CNN) to classify reviews as Positive, Negative, or Neutral. Applied text preprocessing, TF-IDF vectorization, and comprehensive model evaluation.

SVMNeural NetworksTF-IDFScikit-learn
PRJ-006
COMPLETE
🏠

Boston Housing Price Prediction

Built ML models to predict median housing prices. Applied correlation analysis, feature normalization, and trained Linear Regression and Random Forest Regressor models.

Linear RegressionRandom ForestScikit-learnSeaborn
PRJ-007
COMPLETE
🍄

Mushroom Classification

Implemented an ML solution to classify mushrooms as edible or poisonous using label encoding, Decision Tree, and Random Forest with feature importance visualizations.

Binary ClassificationLabel EncodingDecision TreeRandom Forest
PRJ-008
COMPLETE
🚢

Titanic Survival Prediction

Predictive models for Titanic passenger survival. Applied missing data cleaning, categorical encoding, and explored survival correlations with Logistic Regression and Random Forest.

Binary ClassificationCategorical EncodingLogistic RegressionRandom Forest
PRJ-009
COMPLETE
🌧️

Weather Data Analysis & Rain Prediction

Analyzed historical Australian weather data, performed feature engineering, and trained Logistic Regression, Random Forest, and Gradient Boosting for binary rainfall prediction.

Feature EngineeringScikit-learnPandasMatplotlibSeaborn
PRJ-010
COMPLETE
🤖

LLM Applications Development

Built applications leveraging Large Language Models and advanced prompt engineering to automate analytical tasks, using NVIDIA GPU-accelerated infrastructure for model inference.

Prompt EngineeringLLMsPythonAI
// 05

MISSION_LOG

PROGRAM

Data Engineering Trainee

Digital Egypt Pioneers Initiative (DEPI) – MCIT 2025 – Present

Architect and maintain scalable data pipelines using ETL/ELT design patterns, Apache Spark, and Apache Airflow. Collaborate within a 6-person agile squad to engineer real-time data ingestion frameworks using advanced Python and optimized SQL. Deploy cloud-based data storage solutions, managing the full project lifecycle from requirements to stakeholder reporting.

TRAINING

AI for Business Trainee

ITIDA & NTI · 120-Hour Intensive Applied AI Program Jul – Aug 2025 · Score: 91% (Distinction)

Achieved a top-tier score of 91% by mastering applied AI concepts for enterprise contexts. Automated complex financial and operational reporting workflows using Python. Designed data-driven strategic recommendations from real-world business case studies, integrating quantitative insights with executive-level storytelling.

INTERNSHIP

Corporate Summer Intern — Green Leap Program

Commercial International Bank (CIB) Jul 2025

Participated in a structured corporate internship focusing on core banking operations, enterprise IT infrastructure, and digital transformation strategy. Analyzed CIB's data governance frameworks and technology adoption roadmaps, engaging cross-functional teams to understand business-IT alignment in compliance-sensitive environments.

TRAINING

Generative AI Developer (Applied Training)

ITI in Collaboration with NVIDIA 2025

Architected and deployed a production-grade text-to-image Generative AI application leveraging Hugging Face open-source models. Applied advanced prompt engineering and model fine-tuning techniques with NVIDIA GPU-accelerated development tools for enterprise-scale deployment.

TRAINING

Web Development & Database Trainee

Information Technology Institute (ITI) Jul 2024

Designed and implemented relational database schemas using PostgreSQL to support full-stack web application backends. Built responsive front-end interfaces using HTML5 and CSS3, integrating backend logic via Python to deliver complete end-to-end web solutions.

CERTIFICATIONS

NVIDIA - Building LLM Applications
🔍 VIEW
NVIDIA

Building LLM Applications with Prompt Engineering

Sep 2025
ITI - Transact-SQL
🔍 VIEW
ITI

Transact-SQL Queries using SQL Server

Sep 2025
DEPI Coaching Phase
🔍 VIEW
DEPI

Completing the tasks in the coaching phase

Apr 2026
CIB Training
🔍 VIEW
CIB

Core banking principles & digital transformation

Sep 2025
ITI - Python & Web Dev
🔍 VIEW
ITI

Python and Web Development

May 2024
NTI - ML & DL
🔍 VIEW
NTI

Building Machine Learning and Deep Learning Models

July 2025
NVIDIA - AI for All
🔍 VIEW
NVIDIA

AI for All: From Basics to GenAI Practice

Aug 2025
NVIDIA - Deep Learning
🔍 VIEW
NVIDIA

Getting Started with Deep Learning

Aug 2025
ITI - Deep Learning
🔍 VIEW
ITI

Deep Learning and LLM

Aug 2025
// 06

ESTABLISH_CONTACT

Ready to collaborate on high-impact data and AI projects. Open to opportunities worldwide.

CURRENTLY AVAILABLE FOR OPPORTUNITIES
SYSTEM STATUS
RESPONSE TIME < 24 HOURS
TIMEZONE UTC +2 (EET)
OPEN TO Full-time / Internship
STATUS ● OPEN TO WORK