Eksploracja i analiza danych - Data Mining
and Data Analysis
Przedmiot dla studentów Informatyki, specjalności SE
(Politechnika Poznanska)
This course is taught in English language for Master Course of
Software Engineering (Poznan University of Technology).
We plan to move these course materials to the e-learning SE system - so
please verify the content of e-moodle cs put , in particular with respect to instructions
for laboratories classes, small projects and homeworks (contat me or lab
teaching assistent).
Prowadzący / Lecturer:
dr hab. inż.
Jerzy Stefanowski , prof. nadzw.
Class / Lab Teaching Assistant: not assigned
(All lab instructions were prepared togther with dr inż. Magdalena Deckert)
Lectures are planned each Monday, CW-BT lecture room BT 126 godz, 13.30-15.00;
Lectures slides for the current academic year (based on 2019 version):
- Introduction to KDD Process. -> this year joined with the preprocessing New lecture.
- Pre-processing for data mining .
- Discovery of classification trees.
- Discovery of rules.
- On evaluation of classifiers.
- Naive Bayes, K-NN and other classifiers.
The newest lecture Naive Bayes, K-NN version 2019;
The additional lecture on Statistical Classifiers is put inside ALGODEC lecture link - see Dodatkowe wyklady.
- See also the additional lecture on imbalanced data - this is an
updated
PAN summer school lecture
- A new lecture (2019 ed) on ensembles (bagging, boosting, random forest, stacking).
You may check additional lectures on multiple classifiers is put insude ALGODEC lecture links
and a special short lecture
on this topic special for you on ensembles + imbalance learning.
- Cluster algorithms.
- Association Rules; We added
an extra lecture on rule evaluation - see additional handouts.
- Sequence Data Mining
- Prediction models - regression; Two lectures.
- Basics of Artificial Neural Networks A lecture on MLP, RBF and Kohonen networks.
- Non-parametric tests and qualitative data analyis: Selected non-parametric, ranked tests + Analysis of Questionnaires
and Qualitative Data
- Data Visualisation and Visual Data Mining
- Summary of KDD and Buisness Advanced Data Processing Systems
Some other notes - Tutorials on Data Mining - ALGODEC COST project
Laboratory notes -- will be provided by our Lab Instructor. See the moodle system for cs put poznan.
Selected initial lab instructions ( M.Deckert's contribution to most of them is acknowledged):
Topics and Reamrks for the Final Exam 2016 / 7 - partly for earlier editions
Literatura - Course Books and References
- Data Mining: Concepts and Techniques, Jiawei Han,
Micheline Kamber, Jian Pei, Morgan Kaufmann, 2005 (2 rozszerzone wydanie).
- Odkrywanie wiedzy z danych, Larose D. ( polskie tłumaczenie),
PWN, Warszawa, 2006 = Also available in an original US version.
- Eksploracja danych, Hand D., Mannila H., Smyth P. (polskie tłumaczenie) = Also available in an original Enslish version.
-
Data Mining and Analysis: Fundamental Concepts and Algorithms, Mohammed J. Zaki, Wagner Meira Jr [printed version by Cambridge Press, see also the authors' web page of the book]
- Inne pozycje wg. informacji prowadzacego / Ask the lecturer for additional books and see the first lecture slides
Ostatnia aktualizacja -- Last revision: May 28, 2019 - J.Stefanowski