News

03-03-2017 The new semester has begun :)

The aim and the scope of the course

The aim of the course: To get to know the latest technologies and algorithms for processing massive datasets for intelligent decision support systems.

The scope of the course: We will learn how to organize, store, access, and process massive datasets::

Information about the Course

Time and Place

Schedule of Lectures

03-03-2017 Processing of massive data sets [pdf]
10-03-2017 Evolution of database systems [pdf]
17-03-2017 Dimensional modeling [pdf]
24-03-2017 ETL and OLAP systems [pdf]
07-04-2017 Processing of very large data [pdf]
21-04-2017 Approximate query processing [pdf]
12-05-2017 Multi-dimensional index structures [pdf]
26-05-2017 Finding similar items I [pdf]
02-06-2017 Finding similar items II [pdf]
09-06-2017 Data partitioning and MapReduce [pdf]

Schedule of Labs

03-03-2017 Bonferroni's principle [pdf]
10-03-2017 Solving problems by simulations [pdf]
24-03-2017 Dimensional modeling [pdf]
31-03-2017 Data transformation [pdf] [unique_tracks.zip] [triplets_sample_20p.zip] [report.pdf] [report.tex]
21-04-2017 Data transformation in bash [pdf]
27-04-2017 Bloom filters [pdf] [code]
22-05-2017 Nearest neighbor search [pdf] [msdc-facts.zip] [result.txt]
02-06-2017 Approximate nearest neighbor search [pdf]

Evaluation

Lecture:
Test : 75% (min. 50%)
Labs : 25% (min. 50%)
Labs:
Regular exercises and homeworks : 100% (min. 50%)

Bonus points for all: up to 10 points.

Scale

90% 5.0
80% 4.5
70% 4.0
60% 3.5
50% 3.0

Bibliography

R. Kimball, M. Ross, The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling, John Wiley & Sons, 2002

Z. Królikowski, Hurtownie danych: logiczne i fizyczne struktury danych, Wydawnictwo Politechniki Poznańskiej, 2007

A. Rajaraman, J. D. Ullman, Mining of Massive Datasets, Cambridge University Press, 2011, http://www.mmds.org.

H. Garcia-Molina, J. D. Ullman, J. Widom, Database Systems: The Complete Book. Second Edition. Pearson Prentice Hall, 2009.

J.Lin, Ch. Dyer, Data-Intensive Text Processing with MapReduce. Morgan and Claypool Publishers, 2010, http://lintool.github.com/MapReduceAlgorithms/.

Ch. Lam, Hadoop in Action, Manning Publications Co., 2011.