[SYS.001] ORBITAL PROFILE

Mehdi Somrani

Data Engineer | Pipeline Reliability | BI Operations

Building resilient data platforms for high-volume decision systems with a focus on observability, speed, and operational clarity.

PythonSQLHadoop

[SYS.002]

MISSION LOG

TIMESTAMP: 08/2023 - Present

Orange Tunisie

Data & BI Engineer

  • Perform data segmentation with SQL and PySpark for reporting over 10 million daily and hourly records.
  • Develop Customer Value Management scenarios for more than 3 million customers across 100+ campaigns.
  • Automate recurring ETL and reporting scripts, reducing ad hoc tasks and saving ~7 hours weekly of hardware usage.
  • Build forecasting models achieving over 70% accuracy and perform trend analysis.
  • Mentor interns with all achieving over 85% in their end-of-studies projects and assist onboarding into the data team.
  • Reduced query runtime by replacing ~40% of classic conditions with built-in functions on Hadoop tables and views.
  • Optimized data pipeline for ~14 tables, improving query and processing efficiency by ~60%.
  • Contributed to more than 100 marketing campaigns with data-driven customer segmentation.

TIMESTAMP: 02/2023 - 05/2023

Orange Tunisie

Big Data Intern

  • Set up a multi-machine cluster (1 master, 2 slaves) on Ubuntu to support distributed processing.
  • Deployed Apache Kafka for real-time ingestion and PySpark for processing 100k+ hourly records.
  • Configured Elasticsearch and Kibana for hourly and weekly analytics dashboards.
  • Automated data flows using Apache NiFi and Shell scripting to create a reliable DataOps pipeline.
  • Delivered an end-to-end pipeline for KPI monitoring and forecasting with >85% accuracy.
  • Implemented a distributed Hadoop environment supporting large-scale data with 100k+ hourly records.
  • Enabled real-time analytics and automated ETL, reducing manual processing from ~15 minutes per update to fully automated hourly updates.
  • Built a reliable pipeline for KPI tracking, improving team efficiency and reporting accuracy.

TIMESTAMP: 2021 - Paused

Apoo Project | Personal Initiative

Graphic Designer & Community Manager

  • Managed social media accounts, scheduled posts, and engaged with followers to maintain an active community.
  • Designed illustrations, stickers, and visual assets to communicate educational messages across platforms.
  • Coordinated campaigns and collaborations with WHO for COVID-19 awareness and Croissant Rouge for a blood donation event.
  • Built a recognizable personal brand through consistent illustration, storytelling, and content management.
  • Collaborated with WHO to create COVID-19 awareness campaigns, using characters to educate and engage the public.
  • Partnered with Croissant Rouge for a blood donation event, creating stickers given to participants.

TIMESTAMP: Dec 2020 — Jun 2021

Graphic Designer & Community Manager | Freelance

Freelance Graphic Designer & Community Manager

  • Designed marketing visuals including social media posts, banners, and promotional content.
  • Created graphical identities and logos tailored to client branding requirements.
  • Managed social media accounts, scheduling posts and interacting with followers to increase engagement.
  • Implemented content strategies to improve brand presence and attract new clients.
  • Increased social media engagement by over 30% through consistent and visually appealing content.
  • Developed distinctive visual identities and logos that strengthened client branding.

[SYS.003]

SYSTEM STATUS BOARD

DATA & BIG DATA

ONLINE
PythonSQLData EngineeringMachine LearningBusiness IntelligencePySparkHadoopApache KafkaApache AirflowApache ImpalaElastic Stack (ELK)Data WarehousingSQL ServerMicrosoft ExcelSpreadsheetPower BILinux

WEB & DEVELOPMENT

ONLINE
GitGitHubGitLabHTMLCSSJavaScriptFastAPIFlutterMySQLTransact-SQL (T-SQL)

DESIGN & MULTIMEDIA

ONLINE
Adobe IllustratorAdobe PhotoshopFigmaCanva

METHODOLOGIES & SOFT SKILLS

ONLINE
Problem SolvingCommunicationDecision-MakingTarget Marketing

LANGUAGES

ONLINE
Arabic (Maternal)English (C2)French (Intermediate)

[SYS.004]

MISSION FILES

ID: data-guarddata

Data-Guard: Resilient Sales Pipeline

Engineered a defensive end-to-end pipeline to process and validate Tunisian retail sales events. - Hybrid Lambda-Medallion architecture using Spark Structured Streaming and Kafka - Silver-Guard validation engine enforcing 7 business guardrails and quarantine logic - DuckDB and Parquet serving layer for sub-second Power BI reporting - Fully containerized ecosystem with Docker for reproducible deployments

PythonKafkaSparkDuckDBPower BIDocker
ACCESS REPO →
ID: realtime-gaming-pipelinedata

Real-Time Data Streaming Pipeline

Streamed hundreds of live game events per minute and processed player scores and logins instantly. - Kafka ingestion for real-time event capture - Spark processing for low-latency enrichment - ELK Stack (Elasticsearch, Kibana) for live metrics dashboards - Dockerized infrastructure for easy deployment

PythonKafkaSparkElasticsearchKibanaDocker
ACCESS REPO →
ID: ai-job-matchingdata

AI Job-Matching Pipeline

Developed a real-time ingestion pipeline for semantic resume-to-job matching. - all-MiniLM-L6-v2 embeddings for semantic similarity - JobSpy adaptation for localized Tunisian job indexing - SQLite persistence for search history, duplicate detection, and observability logging

PythonSentenceTransformersStreamlitSQLitePyMuPDFMachine Learning
ACCESS REPO →
ID: airflow-weatherdata

Airflow Weather Pipeline

Fetched weather data from OpenWeatherMap API and trained regression models. - Automated ETL with Apache Airflow - JSON to CSV transformation - LinearRegression, DecisionTree, and RandomForest models achieving ~79% accuracy

PythonAirflowPandasscikit-learnDocker
ID: gitlab-cicddata

GitLab CI/CD Pipeline

Created CI/CD pipeline for Python data processing. - GitLab project, runner, and SSH authentication setup - Automated build, test, and deployment - Kubernetes (MicroK8s) orchestration

PythonGitLab CI/CDKubernetesMicroK8s
ID: mongodb-projectdata

MongoDB Data Engineering

Imported ~200k JSON documents and performed advanced analytics. - MQL queries and aggregation pipelines - Data modeling for retrieval optimization - Large-scale document processing

MongoDBPythonMQL

[SYS.005]

CLEARANCE RECORD

Nov 2024 - Nov 2025

DEGREE / PROGRAM

Data Engineering Certification

Executive Education MINES Paris - PSL (with Datascientest and Orange Tunisie)

Professional certificate focused on Python, SQL/NoSQL, PySpark, Kafka, Elasticsearch, Hadoop, Hive, Airflow, DevOps, CI/CD, monitoring, and modern data pipeline engineering.

Oct 2023 - Jun 2025

DEGREE / PROGRAM

Master's in Business Analytics & Data Science

Virtual University of Tunis (UVT)

Advanced training in data analysis, machine learning, NLP, big data frameworks, cloud infrastructure, and data security, with practical work in Python, R, SQL, scikit-learn, and Spark.

2020 - 2023

DEGREE / PROGRAM

Bachelor in Computer Science, Big Data & Data Analysis

Institut Superieur des Arts Multimedia de la Manouba (ISAMM)

Comprehensive program in computer science and data analysis covering algorithms, statistics, cloud computing, machine learning, and project delivery through hands-on team projects.

Aug 2024

CERTIFICATION

Associate Data Engineer

DataCamp

Certificate code: DEA0015452912142.

Sep 2024

CERTIFICATION

EFSET English Test

EFSET

Certificate code: ASPMQT.

Dec 2025

CERTIFICATION

Power BI Data Analyst Associate (PL-300)

THE TEAM

.

[SYS.006]

VOLUNTEERING LOG

TIMESTAMP: Sep 2023 - Feb 2024

GDG Sfax

Organizer

  • - Planned and coordinated tech workshops and meetups to promote local developer engagement.
  • - Facilitated knowledge sharing sessions on emerging technologies and best practices.
  • - Collaborated with team members to ensure smooth event execution and community growth.

TIMESTAMP: Jul 2023 - Apr 2024

ML Act Community

Lead Designer

  • - Designed visual content and materials for community events, enhancing engagement.
  • - Supported ML/DL learning initiatives by creating accessible and clear resources.
  • - Collaborated with members to organize workshops and networking opportunities.

TIMESTAMP: Oct 2021 - Jul 2023

Data Science Club TBS

Graphic Designer

  • - Produced graphics and visual materials for events like DATACAMP 2077 and Hackathon: INTO THE DATAVERSE.
  • - Supported event logistics and coordination, ensuring seamless participant experience.
  • - Collaborated with members to design engaging content for workshops and competitions.

TIMESTAMP: Oct 2022 - Sep 2023

Robotique ISAMM

Media Specialist

  • - Created digital content to promote club activities and projects.
  • - Supported event planning and documentation to improve community visibility.
  • - Collaborated with team members on outreach and technical demonstrations.

TIMESTAMP: Jan 2021 - Jul 2021

Enactus ISAMM

Software Programmer

  • - Led a Flutter development team to build a mobile application integrated with Firebase.
  • - Managed project timelines, delegated tasks, and ensured quality deliverables.
  • - Enhanced user experience and data management through well-structured app development.

TIMESTAMP: Oct 2020 - Jun 2021

Jeunes Ingénieurs ISAMM

Graphic Designer

  • - Designed promotional materials and graphics for events and workshops.
  • - Assisted in event planning and content creation to boost community engagement.