Eksploracja i analiza danych  - Data Mining and Data Analysis

Przedmiot dla studentów Informatyki, specjalności SE (Politechnika Poznanska)

This course is taught in English language for Master Course of Software Engineering (Poznan University of Technology).

We plan to move these course materials to the e-learning SE system - so please verify the content of e-moodle cs put , in particular with respect to instructions for laboratories classes, small projects and homeworks (contat me or lab teaching assistent).

Prowadzący / Lecturer:

dr hab. inż. Jerzy Stefanowski , prof. nadzw.

Class / Lab Teaching Assistant: not assigned (All lab instructions were prepared togther with dr inż. Magdalena Deckert)

Lectures are planned each Monday, CW-BT lecture room BT 126 godz, 13.30-15.00;

Lectures slides for the current academic year (based on 2019 version):

  1. Introduction to KDD Process. -> this year joined with the preprocessing New lecture.
  2. Pre-processing for data mining .
  3. Discovery of classification trees.
  4. Discovery of rules.
  5. On evaluation of classifiers.
  6. Naive Bayes, K-NN and other classifiers. The newest lecture Naive Bayes, K-NN version 2019; The additional lecture on Statistical Classifiers is put inside ALGODEC lecture link - see Dodatkowe wyklady.
  7. See also the additional lecture on imbalanced data - this is an  updated PAN summer school lecture
  8. A new lecture (2019 ed) on ensembles (bagging, boosting, random forest, stacking). You may check additional lectures on multiple classifiers is put insude ALGODEC lecture links  and a special short lecture on this topic special for you on ensembles + imbalance learning.
  9. Cluster algorithms.
  10. Association Rules; We added an extra lecture on rule evaluation - see additional handouts.
  11. Sequence Data Mining
  12. Prediction models - regression; Two lectures.
  13. Basics of Artificial Neural Networks A lecture on MLP, RBF and Kohonen networks.
  14. Non-parametric tests and qualitative data analyis: Selected non-parametric, ranked tests + Analysis of Questionnaires and Qualitative Data
  15. Data Visualisation and Visual Data Mining
  16. Summary of KDD and Buisness Advanced Data Processing Systems
  17.  

Some other notes - Tutorials on Data Mining - ALGODEC COST project
 

Laboratory notes -- will be provided by our Lab Instructor. See the moodle system for cs put poznan.

Selected initial lab instructions ( M.Deckert's contribution to most of them is acknowledged):

 Topics and Reamrks for the Final Exam 2016 / 7 - partly for earlier editions

Literatura - Course Books and References

Ostatnia aktualizacja -- Last revision: May 28, 2019 - J.Stefanowski