Week | Topics | Study Materials | Materials |
1 |
Introduction to Data Mining
• Concepts, importance, application areas. Introduction to R and Python.
|
|
|
2 |
Data Preprocessing
• Data cleaning, transformation, normalization. Applications with R and Python.
|
|
|
3 |
Supervised Learning: Regression Analysis
• Linear and logistic regression. Applications with R and Python.
|
|
|
4 |
Supervised Learning: Classification
• Decision trees, support vector machines, random forests. Applications with R and Python.
|
|
|
5 |
Unsupervised Learning: Clustering
• K-means, hierarchical clustering, DBSCAN. Applications with R and Python.
|
|
|
6 |
Unsupervised Learning: Association Rules
• Apriori, FP-Growth algorithms. Applications with R and Python.
|
|
|
7 |
Anomaly Detection
• Outlier detection, isolation forests and local outlier factor. Applications with R and Python.
|
|
|
8 |
Midterm exam
|
|
|
9 |
Model Selection and Validation
• Model evaluation metrics, cross-validation. Applications with R and Python.
|
|
|
10 |
Feature Engineering and Reduction
• Feature selection, dimensionality reduction techniques. Applications with R and Python.
|
|
|
11 |
Data Visualization and Reporting
• Data exploration and visualization techniques. Applications with R and Python.
|
|
|
12 |
Real World Data Mining Applications
• Sectoral application examples: e-commerce, social media analysis, customer segmentation. Real life examples with R and Python
|
|
|
13 |
Working on Course Projects
• Students work on group projects, analyzing data sets and developing models.
|
|
|
14 |
Project Presentations and General Review of the Course
• Presentation of the projects that students worked on throughout the semester. General review of the lesson and going over important points.
|
|
|