Massive Data Processing

CPE Lyon

Reference Material

Big data technologies and methodologies: distributed computing frameworks, data mining algorithms, neural networks, linked open data, and ethical considerations in large-scale data analysis using Hadoop, Hive, and Spark.
HUB N1 N2 N3 N4 N5 N6 MAP REDUCE H S ML

Lectures

Practical Work

GitHub Repository

TDM Repository

Hands-on practical exercises covering big data processing, distributed computing, data mining, neural networks, and semantic web technologies. Includes exercises for Hadoop, Hive, Spark, and machine learning implementations.

Access on GitHub