Estimation of Reward and Decision Making for Trust-Adaptive Agents in Normative Environments

  • Jan Kantert
  • Yvonne Bernard
  • Lukas Klejnowski
  • Christian Müller-Schloer
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8350)

Abstract

In an open trusted Desktop Grid system with a normative environment incentives and sanctions may change during runtime. Every agent in the system computes work for other agents and also submits jobs to other agents. It has to decide for which agents it wants to work and to which agent it wants to give its jobs. We introduced a trust metric to isolate misbehaving agents. After getting a job processed by another agent it will get a reward. When processing a job for another agent it will get a positive trust-rating, but no direct reward. To come to a decision when accepting or rejecting jobs we need to be able to estimate the reward. Since the environment may change at runtime and to overcome delayed reward issues we use a neural network to estimate the reward based on the environment and trust level.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Tomforde, S., Hähner, J., Müller-Schloer, C.: The multi-level observer/controller framework for learning and self-optimising systems. Int. J. Data Mining and Bioinformatics (2012)Google Scholar
  2. 2.
    Bernard, Y., Klejnowski, L., Hähner, J., Müller-Schloer, C.: Towards trust in desktop grid systems. In: IEEE International Symposium on Cluster Computing and the Grid, pp. 637–642 (2010)Google Scholar
  3. 3.
    Castelfranchi, C., Falcone, R.: Trust Theory: A Socio-Cognitive and Computational Model, 1st edn. Wiley Publishing (2010)Google Scholar
  4. 4.
    Cakar, E.: Population-based runtime optimisation in static and dynamic environments. Ph.D. dissertation, Leibniz Universität Hannover (2011), http://edok01.tib.uni-hannover.de/edoks/e01dh11/668667427.pdf
  5. 5.
    Bernard, Y., Kantert, J., Klejnowski, L., Schreiber, N., Müller-Schloer, C.: Application of learning to trust-adaptive agents. In: Workshop on Social Concepts in Self-Adaptive and Self-Organising Systems, Workshop Proceedings of the Seventh IEEE International Conference on Self-Adaptive and Self-Organizing Systems (SASOW), Philadelphia, USA, September 9-13 (2013)Google Scholar
  6. 6.
    Choi, S., Buyya, R., Kim, H., Byun, E.: A Taxonomy of Desktop Grids and its Mapping to State of the Art Systems. Grid Computing and Distributed Systems Laboratory, The University of Melbourne, Tech. Rep. (2008)Google Scholar
  7. 7.
    Bernard, Y., Klejnowski, L., Cakar, E., Hahner, J., Müller-Schloer, C.: Efficiency and robustness using trusted communities in a trusted desktop grid. In: 2011 Fifth IEEE Conference on Self-Adaptive and Self-Organizing Systems Workshops, SASOW (2011)Google Scholar
  8. 8.
    Cakar, E., Müller-Schloer, C.: Self-organising interaction patterns of homogeneous and heterogeneous multi-agent populations. In: Third IEEE International Conference on Self-Adaptive and Self-Organizing Systems, SASO 2009, pp. 165–174 (2009)Google Scholar
  9. 9.
    Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: an update. SIGKDD Explor. Newsl. 11(1), 10–18 (2009), http://doi.acm.org/10.1145/1656274.1656278 CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Jan Kantert
    • 1
  • Yvonne Bernard
    • 1
  • Lukas Klejnowski
    • 1
  • Christian Müller-Schloer
    • 1
  1. 1.Institute of Systems EngineeringLeibniz Universität HannoverGermany

Personalised recommendations