Summary of discussion

After the session on Tuesday (15th of September) we discussed about the future and the challenges of scheduling for parallel computing. The following issues were raised: The above summary was written by M.Drozdowski
Minutes of what has been said are given here (by J.BerliƄska).
If any details of the discussion have been omitted, or you would like to add a comment, then please write an email to Maciej Drozdowski.

Things to ponder

A questionnaire was distributed between the participants of the discussion. The questions asked, and the answers are given below.
  1. What are the biggest challenges of scheduling for parallel computing (SPC)?
    Answers:
    - Architecture dependency.
    - Availability of advanced reservations.
    - "Optimal/improved/good" schedules (optimality is impossible because of NP-hardness).
    - New hardware architectures including parallel many-core systems, GPGPU, etc.
    - A lack of application and operating system dynamic scheduling techniques to deal with hierarchical and heterogeneous computing.
    - Energy-aware scheduling algorithms.
    - Multicriteria scheduling methods.
    - Good [scheduling] algorithms results despite poor knowledge of processing times.
    - Complexity of real problems (in modeling not computational complexity).
    - Decomposition of real problems into parts simple to represent and solve.

  2. What are the practical scheduling problems that should be solved?
    Answers:
    - Self-tuning algorithms which provide close-to-optimal results (provably) for a wide range of applications.
    - New scheduling modules implemented in operating and HPC queuing systems taking into account new application requirements and new hardware architectures and their constraints.
    - Scheduling of applications on many-core and hybrid systems.
    - Scheduling in large scale heterogeneous environments taking into account data management.
    - Optimization of the costs (e.g. energy, cost of ownership) on platforms with big numbers of machines.
    - Diversity of models, lack of uniform approaches.

  3. What are the obstacles in applying theoretical scheduling results in practice?
    Answers:
    - Availability of libraries/reference implementations.
    - Unavailability of advanced reservations.
    - Not enough/reliable information about resources/services.
    - A lack of simulations and emulations based on reference benchmarks and real workloads.
    - Difficulty with the access to sources of practical applications and systems.
    - Lack of fine-grain parallelization of majority of applications.
    - Missing good estimations of job execution times.
    - Often optimization time [is] too long - decision is needed immediately.
    - Lack of input data for scheduling algorithms, lack of instrumentation to implement the schedules.

  4. What is the most obsolete subject in SPC?
    Answers:
    - Focus on theoretical models for relatively small problem instances.
    - Load balancing in grids and clusters.
    - Taking into account low level communication hardware, or communication network structure.

  5. What is the most promising subject in SPC?
    Answers:
    - Multiobjective scheduling of workflows.
    - Hierarchical and many-level scheduling in parallel and distributed systems.
    - Application-level dynamic scheduling for parallel threads and processes.
    - Probably fine-grain scheduling of applications on new architectures (man-cores, accelerators, hybrid systems) due to large area of applications (very high impact).

  6. What question should we have asked?
    Answers:
    - What to do "not to reinvent the wheel" and to implement more applicable modules and scheduling approaches in practice.




Last modified :