Skip to content
View MahdiNavaei's full-sized avatar

Block or report MahdiNavaei

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MahdiNavaei/README.md

Mahdi Navaei

AI Engineer • Agentic AI Architect • Data Scientist

Profile Views LinkedIn Email Kaggle

8+ years building production-grade AI systems | Tehran, Iran

Specializing in Agentic AI, LLMs, RAG Systems, and Enterprise ML Pipelines


🎯 What I Do

I architect and deploy production-grade intelligent systems that drive real business impact. My focus areas:

Domain Expertise
Agentic AI Autonomous agents, multi-step reasoning, task orchestration, tool integration
LLMs & NLP Fine-tuning, RAG architectures, NL2SQL, conversational AI, prompt engineering
Enterprise RAG Hybrid retrieval, re-ranking pipelines, knowledge management systems
Production ML End-to-end pipelines, real-time inference, MLOps, scalable deployments

🚀 Flagship Project 1

🤖 Production-Grade Agentic AI Framework

Vision + LLM + Event Sourcing • Local LLMs • LangGraph • HITL Safety • Autonomous Task Execution

Tests Python LangGraph License Docker

ARIA is not a prompt-chain demo or a single-purpose script — it's a full agentic AI system built for real-world automation: observe UIs with vision, plan with LLMs, act safely with human oversight, and learn from outcomes. Designed to run on local LLMs and consumer GPUs (8GB VRAM), with native English & Persian support for privacy-sensitive and resource-constrained environments.

Cognitive architecture — perception, reasoning, execution, and memory are separated and observable:

┌─────────────────────────────────────────────────────────────────────────┐
│                         ARIA — Cognitive Core                            │
├─────────────────────────────────────────────────────────────────────────┤
│  👁️ Eye (VLM/OCR)   →  Observe real interfaces • Screenshot • UIRef    │
│  🧠 Brain (LLM)      →  Plan, execute, observe • LangGraph • HITL gates  │
│  ✋ Hand (Actions)   →  Browser • Desktop • Playwright • PyAutoGUI       │
│  💾 Memory           →  Working + Episodic + Semantic (Redis • Qdrant)   │
│  📡 Event Bus        →  Kafka/Redpanda • Full audit trail & replay       │
│  📚 Learning         →  Extract skills & policies from successful runs   │
└─────────────────────────────────────────────────────────────────────────┘

Why ARIA stands out:

Pillar What it means for you
Vision-First VLM-powered UI understanding with multi-locator fallback — no brittle selectors only
Event-Sourced Every step persisted; full audit trail and replay for debugging and compliance
Human-in-the-Loop Safety gates for sensitive actions (login, CAPTCHA, payment) — production-safe by design
Local & Bilingual Run entirely on your hardware; native Farsi STT (Whisper) and embeddings
Production-Ready FastAPI + WebSocket API, Streamlit dashboard, Docker Compose, 81 tests

Tech stack: LangGraph • Ollama / OpenAI • Qwen-VL • Playwright • Redpanda (Kafka) • Redis • Qdrant • Mem0

The Job Apply automation (LinkedIn, Indeed) is the first production plugin — the platform is built for more.

🔗 Explore ARIA → • 📖 Docs, ADRs, and MODELS.md inside the repo


🚀 Flagship Project 2

📄 Governance-Safe Financial Document AI

Bilingual (EN/FA) • Quality Gates • Human-in-the-Loop Review • Replayable Lifecycle • Audit Endpoints

Release Python Next.js License FastAPI

InvoiceMind — Evidence-first invoice processing

InvoiceMind is not an OCR benchmark or a generic prompt demo — it's a production-oriented platform for invoice extraction, human review, and governance-safe automation. Built for teams where traceability and control matter more than blind automation. Most invoice AI fails in production because decisions are hard to trust, explain, and control; InvoiceMind tackles that gap head-on.

End-to-end flow — from ingestion to final export, with explicit gates and audit at every step:

┌─────────────────────────────────────────────────────────────────────────┐
│                    InvoiceMind — Pipeline & Lifecycle                    │
├─────────────────────────────────────────────────────────────────────────┤
│  📥 Ingestion → Validation → OCR/Layout → LLM Extraction → Postprocess  │
│  📊 Routing (quality gates) → Review / Quarantine → Export + Audit        │
├─────────────────────────────────────────────────────────────────────────┤
│  Run lifecycle: RECEIVED → VALIDATED → EXTRACTED → GATED →               │
│                 AUTO_APPROVED | NEEDS_REVIEW → FINALIZED                 │
│  Control: cancel • replay • quarantine (reason-coded) • audit/verify      │
└─────────────────────────────────────────────────────────────────────────┘

Why InvoiceMind stands out:

Pillar What it means for you
Evidence-first Policy and gate-based routing instead of confidence-only automation
Decision traceability Every auto-approve or escalate tied to gate results and reason codes
Replayable & auditable Full run lifecycle, cancel/replay, and audit endpoints for compliance and post-incident analysis
Local-first Privacy-first inference; versioned config bundles and model registry (models.yaml)
Safe defaults Quarantine and human review over aggressive auto-posting; NIST AI RMF & OWASP LLM–aligned

Tech stack: Python 3.11+ • FastAPI • Next.js 16 • React 19 • TypeScript • SQLAlchemy • Alembic • SQLite • AGPL-3.0

ADR-001 (local-first), ADR-002 (evidence-first), ADR-003 (policy-driven gates) — design documented in the repo.

🔗 Explore InvoiceMind → • 📖 Docs, run.bat one-click startup, API surface in README


💼 Featured Projects

🛡️ DriveShield — Real-Time Collision Risk Intelligence

End-to-end collision prediction platform using Nexar's BADAS-Open model.

  • State-of-the-Art Prediction: Real-time risk analysis with vision models
  • 100% Offline: Runs locally without external API calls
  • Production-Ready: FastAPI backend + React TypeScript frontend

Tech: Python • FastAPI • React • TypeScript • PyTorch • Computer Vision

🔗 View Repository →

🔄 Hybrid Retail Recommender System

Production-ready hybrid recommender combining collaborative filtering & content-based ML.

  • Results: 140% precision improvement, 175% recall improvement
  • Scale: Tested on 38K+ user dataset
  • Bilingual: English/Persian UI with RTL support

Tech: Python • FastAPI • React • TypeScript • scikit-learn • Docker

🔗 View Repository →

🌊 FlowCast — Surge Pricing & ETA Optimization Engine

Enterprise-grade intelligent pricing and ETA prediction for ride-hailing platforms.

  • ETA Accuracy: +20% improvement over baseline
  • Revenue: +10-25% efficiency per trip
  • Price Stability: 30-40% volatility reduction

Tech: Python • FastAPI • React • GeoPandas • Time-Series Forecasting

🔗 View Repository →

💊 Pharmaceutical Supply Chain Agentic AI

Four-agent system for supply chain optimization using LangGraph orchestration.

  • Logistics Costs: 40% reduction
  • Stockouts: 67% reduction
  • Forecast Accuracy: 95%+ (MAPE < 5%)

Tech: Python • FastAPI • LangGraph • Next.js • MongoDB • GPT-4o-mini

🔗 View Repository →

📚 More Projects
Project Description Tech
Blood Cell Cancer Detection CNN-based classifier with 99%+ accuracy TensorFlow • Keras • Medical Imaging
Books Recommendation System Production recommender, 8% sales increase Collaborative Filtering • scikit-learn
Stock Price Collection Automated data pipeline for finance ML Web Scraping • Database Design
CIFAR-10 Classification CNN image classifier, 90%+ accuracy TensorFlow • Keras • CNN

🛠️ Tech Stack

AI & LLM

OpenAI LangChain LangGraph Ollama RAG Transformers

ML & Data Science

Python PyTorch TensorFlow scikit-learn Pandas

Production & DevOps

FastAPI Docker Kubernetes PostgreSQL Redis

Frontend

React TypeScript Next.js


🏆 Achievements

Achievement Description
🥈 2nd Place — Tehran Provincial AI Competition (2022)
🎓 Member — Iran's National Elites Foundation
📜 Kaggle Notebooks Master
📄 Published Researcher — Health Science Reports (Wiley), ICVPR, AMLAI

📊 GitHub Stats

GitHub Stats GitHub Streak

📚 Publications


💼 Experience

Role Company Period
Senior AI/ML Engineer Daria Hamrah Paytakht Jul 2024 – Present
Senior AI/ML Engineer Educational Industries Research & Innovation Co Nov 2023 – Jul 2024
Data Science Team Lead Diar-e Kohan CO. Sep 2020 – May 2022
Data Scientist Diar-e Kohan CO. Sep 2018 – Sep 2020

🎯 Open To

  • 🚀 Building Agentic AI systems and LLM applications at innovative companies
  • 💼 Production-grade AI systems that solve real business problems
  • 🌍 Collaborating with international teams on cutting-edge AI/ML projects
  • 🤝 Remote positions, contract work, or full-time opportunities worldwide

Let's Connect

LinkedIn GitHub Kaggle Email


⭐ If you find my work interesting, please consider giving my repositories a star!

Building the future of AI, one system at a time.

Pinned Loading

  1. DriveShield DriveShield Public

    Real-time dashcam collision risk prediction with BADAS-Open, FastAPI backend, and a bilingual React dashboard.

    Python

  2. ParaBoostForest-Hybrid-Parallel-Boosting-for-Imbalanced-Learning ParaBoostForest-Hybrid-Parallel-Boosting-for-Imbalanced-Learning Public

    Reproducible credit-card fraud benchmark on Kaggle with Optuna tuning, PR-based thresholding, and publication-ready plots; compares ParaBoostForest, RF, and XGBoost.

    Python

  3. aria aria Public

    🤖 Production-grade Agentic AI Framework | Vision + LLM + Event Sourcing | Local LLMs | LangGraph | HITL Safety | Autonomous Task Execution

    Python

  4. InvoiceMind InvoiceMind Public

    Local-LLM, Evidence-first, Audit-ready Invoice Processing

    Python 4