dr inż. Krzysztof Dembczyński (kdembczynski cs put poznan pl)
28-10-2017 | The first lecture :) |
The aim of the course: To get to know how to design and construct data warehouses for efficient data processing.
The scope of the course:We will learn about:
29-10-2017 | Introduction [pdf] |
29-10-2017 | Evolution of database systems [pdf] |
18-11-2017 | Dimensional modeling [pdf] |
18-11-2017 | ETL and OLAP systems [pdf] |
26-11-2017 | MapReduce in Spark [pdf] |
14-01-2017 | Processing of very large data [pdf] |
18-11-2017 | Dimensional modeling [pdf] [report-1.pdf] [report-1.tex] |
25-11-2017 | Data transformation [pdf] [unique_tracks.zip] [triplets_sample_20p.zip] [report-2.pdf] [report-2.tex] |
16-12-2017 | MapReduce in Spark [pdf] [all-shakespeare.zip] [matrix M] [vector x] [vector v] [unique_tracks-csv.zip] |
21-01-2018 | MapReduce in Spark: Matrix multiplication [pdf] [data for matrix multiplication] |
Test : | 75 points | (min. 50%) |
Labs : | 25 points | (min. 50%) |
Regular tasks and exercises : | 4x25 points | (min. 50%) |
90% | 5.0 |
80% | 4.5 |
70% | 4.0 |
60% | 3.5 |
50% | 3.0 |
Z. Królikowski, Hurtownie danych: logiczne i fizyczne struktury danych, Wydawnictwo Politechniki Poznańskiej 2007
R. Kimball, M. Ross, The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling, John Wiley & Sons 2002
A. Rajaraman, J. D. Ullman, Mining of Massive Datasets, Cambridge University Press, 2011, http://www.mmds.org.
H. Garcia-Molina, J. D. Ullman, J. Widom, Systemy baz danych. Kompletny podręcznik. Wydanie II Helion, 2011.