Performance Profile: a Multi-Criteria Performance Evaluation Method for Test-Based Problems

by Wojciech Jaśkowski, Paweł Liskowski, Marcin Szubert, Krzysztof Krawiec
Abstract:
In test-based problems, solutions produced by search algorithms are typically assessed using average outcomes of interactions with multiple tests. This aggregation leads to information loss, which can render different solutions apparently indifferent and hinder comparison of search algorithms. In this paper we introduce the performance profile, a generic, domain-independent, multi-criteria performance evaluation method that mitigates this problem by characterizing the performance of a solution by a vector of outcomes of interactions with tests of various difficulty. To demonstrate the usefulness of this gauge, we employ it to analyze the behavior of Othello and Iterated Prisoner’s Dilemma players produced by five (co)evolutionary algorithms as well as players known from previous publications. Performance profiles reveal interesting differences between the players, which escape the attention of the scalar performance measure of the expected utility. In particular, they allow us to observe that evolution with random sampling produces players coping well against the mediocre opponents, while the coevolutionary and temporal difference learning strategies play better against the high-grade opponents. We postulate that performance profiles improve our understanding of characteristics of search algorithms applied to arbitrary test-based problems, and can prospectively help design better methods for interactive domains.
Reference:
Performance Profile: a Multi-Criteria Performance Evaluation Method for Test-Based Problems (Wojciech Jaśkowski, Paweł Liskowski, Marcin Szubert, Krzysztof Krawiec), In International Journal of Applied Mathematics and Computer Science, volume 26, 2016.
Bibtex Entry:
@Article{Jaskowski2015profiles,
  author =    {Wojciech Jaśkowski and Paweł Liskowski and Marcin Szubert and Krzysztof Krawiec},
  title =     {Performance Profile: a Multi-Criteria Performance Evaluation Method for Test-Based Problems},
  journal =   {International Journal of Applied Mathematics and Computer Science},
  year =      {2016},
  volume =    {26},
  number =    {1},
  pages =     {215--229},
  abstract =  {In test-based problems, solutions produced by search algorithms are typically assessed using average outcomes of interactions with multiple tests. This aggregation leads to information loss, which can render different solutions apparently indifferent and hinder comparison of search algorithms. In this paper we introduce the performance profile, a generic, domain-independent, multi-criteria performance evaluation method that mitigates this problem by characterizing the performance of a solution by a vector of outcomes of interactions with tests of various difficulty. To demonstrate the usefulness of this gauge, we employ it to analyze the behavior of Othello and Iterated Prisoner’s Dilemma players produced by five (co)evolutionary algorithms as well as players known from previous publications. Performance profiles reveal interesting differences between the players, which escape the attention of the scalar performance measure of the expected utility. In particular, they allow us to observe that evolution with random sampling produces players coping well against the mediocre opponents, while the coevolutionary and temporal difference learning strategies play better against the high-grade opponents. We postulate that performance profiles improve our understanding of characteristics of search algorithms applied to arbitrary test-based problems, and can prospectively help design better methods for interactive domains.},
  doi =       {10.1515/amcs-2016-0015},
  keywords =  {coevolutionary algorithms, evolution strategies, Othello, Reversi, games, multi-objective analysis.},
  owner =     {Wojciech},
  timestamp = {2015.07.14},
  url = {http://www.cs.put.poznan.pl/wjaskowski/pub/papers/Jaskowski2015Profile.pdf}
}

This entry was posted by . Bookmark the permalink.