Back to Iv Year
📘 AM605PC
Data Mining Lab
Access study materials and notes for this subject
DATA MINING LAB MANUAL
PDF Document
Preview
Download
Syllabus Overview
List of Experiments
Experiment 1: Data Processing Techniques
a) Data Cleaning
b) Data Transformation – Normalization
c) Data Integration
Experiment 2: Data Partitioning Techniques
Implement Horizontal Partitioning
Implement Vertical Partitioning
Implement Round Robin Partitioning
Implement Hash-based Partitioning
Experiment 3: Data Warehouse Schemas
Design Star Schema
Design Snowflake Schema
Design Fact Constellation Schema
Experiment 4: Data Cube and OLAP Operations
Construct Data Cube
Perform OLAP Operations: Roll-up, Drill-down, Slice, Dice, Pivot
Experiment 5: ETL Operations
Perform Data Extraction
Apply Transformations
Load Processed Data into Target Warehouse (using Pentaho/Python)
Experiment 6: Attribute-Oriented Induction
Implement Attribute-Oriented Induction Algorithm for Concept Description
Experiment 7: Apriori Algorithm
Implement Apriori Algorithm for Frequent Itemset Mining
Generate Association Rules from Frequent Itemsets
Experiment 8: FP-Growth Algorithm
Construct FP-Tree
Mine Frequent Patterns using FP-Growth Algorithm
Experiment 9: Decision Tree Induction
Implement Decision Tree Algorithm (e.g., ID3 or C4.5)
Visualize the Generated Tree
Experiment 10: Information Gain Calculation
Calculate Entropy and Information Gain for Attributes
Use Gain to Select Root Node in Decision Tree
Experiment 11: Naive Bayes Classification
Implement Naive Bayes Classifier
Classify Test Instances using Probability Estimation
Experiment 12: K-Nearest Neighbour (K-NN)
Implement K-NN Algorithm for Classification
Experiment with Different Distance Metrics (Euclidean, Manhattan)
Experiment 13: K-Means Clustering
Implement K-Means Algorithm
Visualize Clusters and Analyze Convergence
Experiment 14: BIRCH Clustering
Implement BIRCH Algorithm for Large Dataset Clustering
Understand CF Tree Construction
Experiment 15: PAM (Partitioning Around Medoids)
Implement PAM Algorithm (K-Medoids)
Compare with K-Means in Terms of Robustness to Outliers
Experiment 16: DBSCAN Clustering
Implement DBSCAN Algorithm
Identify Core, Border, and Noise Points
Analyze Clustering with Different Epsilon and MinPts Values
Data Mining Lab Notes