Skip to content
View GuirassyFode's full-sized avatar

Block or report GuirassyFode

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
GuirassyFode/README.md

Hi there, I'm Fode Guirassy πŸ‘‹

πŸš€ Data & AI Engineer | Azure DP-203 In Progress | Building Scalable Data Pipelines

LinkedIn GitHub followers


πŸ‘¨β€πŸ’» About Me

I'm a Data & AI Engineer passionate about designing and building end-to-end data solutions β€” from raw ingestion to AI-powered insights. I specialize in cloud-native data architectures, real-time streaming pipelines, and machine learning integrations using modern data stack technologies.

  • πŸ—οΈ Building production-grade ETL/ELT pipelines on Azure, AWS & GCP
  • πŸ€– Developing AI/ML-powered data workflows & RAG (Retrieval-Augmented Generation) systems
  • ⚑ Engineering real-time streaming solutions with Apache Spark, Kafka & Flink
  • πŸ—„οΈ Designing dimensional data models (Star/Snowflake schema) for analytics
  • πŸ“Š Optimizing data platforms for scalability, reliability, and performance

πŸ› οΈ Tech Stack

☁️ Cloud Platforms

Azure AWS GCP

πŸ”§ Data Engineering

Apache Spark Apache Kafka Apache Airflow dbt

πŸ’Ύ Databases & Storage

PostgreSQL MySQL Cassandra Snowflake

πŸ€– AI / ML

Python LangChain OpenAI Scikit-learn

🐳 DevOps & Infrastructure

Docker Kubernetes Git


🎯 Certifications & Learning

  • πŸ“š Microsoft Azure Data Engineer Associate (DP-203) β€” In Progress

πŸ“Œ Featured Projects

Project Description Tech Stack
πŸ”₯ Apache Spark Portfolio End-to-end Spark data engineering solutions with local vs. global sort optimizations PySpark, Scala
☁️ Azure Data Engineer (DP-203) Azure-based ETL/ELT pipelines β€” preparation for DP-203 certification Azure Data Factory, Synapse, ADLS
πŸ€– AI Chat RAG Workflow Retrieval-Augmented Generation pipeline for intelligent document Q&A Python, LangChain, OpenAI
πŸ“° News Trend Data Pipeline Real-time news trend ingestion and analytics pipeline Python, Airflow, Kafka
πŸ—„οΈ Dimensional Modeling - NBA Star schema dimensional model for NBA analytics SQL, PostgreSQL
☸️ Kubernetes Data Engineer Containerized data pipeline deployment with Kubernetes Kubernetes, Docker, Python
πŸ“Š SQL Deep Dive Advanced SQL techniques: window functions, CTEs, optimization SQL, Jupyter Notebook

πŸ“ˆ GitHub Stats

GuirassyFode's GitHub Stats Top Languages


πŸ“« Let's Connect

I'm always open to discussing data engineering, AI/ML projects, cloud architecture, or opportunities in consulting and technology.

LinkedIn Email


⭐ "Turning raw data into actionable intelligence β€” one pipeline at a time."

Pinned Loading

  1. azure-dp-203-data-engineer-azure azure-dp-203-data-engineer-azure Public

    Azure DP-203 Data Engineer certification prep: Azure Data Factory, Synapse Analytics, ADLS Gen2, Stream Analytics, Databricks & Delta Lake pipelines

    1

  2. kubernetesDataEngineer kubernetesDataEngineer Public

    Kubernetes-orchestrated data engineering platform: containerized ETL pipelines, Helm charts, pod autoscaling & cloud-native data workflow deployment

    Python 1

  3. SQL-Deep-Dive- SQL-Deep-Dive- Public

    Advanced SQL mastery: window functions, CTEs, recursive queries, query optimization, indexing strategies & analytical patterns for data engineering interviews

    Jupyter Notebook 1

  4. Apache-Spark-Data-Engineering-Portfolio Apache-Spark-Data-Engineering-Portfolio Public

    Production-grade PySpark data engineering solutions: ETL pipelines, sorting optimization, Spark SQL, Azure ADLS integration & dimensional modeling

  5. my-ai-chat-rag-workflow my-ai-chat-rag-workflow Public

    RAG-powered AI chat workflow using LangChain & OpenAI for intelligent document Q&A β€” retrieval-augmented generation pipeline with vector embeddings

  6. News_trend_data_pipeline News_trend_data_pipeline Public

    End-to-end containerized data pipeline for real-time news trend ingestion, transformation, data quality checks & alerting using Docker and Apache Airflow