Distributed and Parallel Databases

, Volume 24, Issue 1–3, pp 1–43 | Cite as

Scalable and topology-aware reconciliation on P2P networks

  • Vidal Martins
  • Esther Pacitti
  • Manal El Dick
  • Ricardo Jimenez-Peris


Collaborative applications are characterized by high levels of data sharing. Optimistic replication has been suggested as a mechanism to enable highly concurrent access to the shared data, whilst providing full application-defined consistency guarantees. Nowadays, there are a growing number of emerging cooperative applications adequate for Peer-to-Peer (P2P) networks. However, to enable the deployment of such applications in P2P networks, it is required a mechanism to deal with their high data sharing in dynamic, scalable and available way. Previous work on optimistic replication has mainly concentrated on centralized systems. Centralized approaches are inappropriate for a P2P setting due to their limited availability and vulnerability to failures and partitions from the network. In this paper, we focus on the design of a reconciliation algorithm designed to be deployed in large scale cooperative applications, such as P2P Wiki. The main contribution of this paper is a distributed reconciliation algorithm designed for P2P networks (P2P-reconciler). Other important contributions are: a basic cost model for computing communication costs in a DHT overlay network; a strategy for computing the cost of each reconciliation step taking into account the cost model; and an algorithm that dynamically selects the best nodes for each reconciliation step. Furthermore, since P2P networks are built independently of the underlying topology, which may cause high latencies and large overheads degrading performance, we also propose a topology-aware variant of our P2P-reconciler algorithm and show the important gains on using it. Our P2P-reconciler solution enables high levels of concurrency thanks to semantic reconciliation and yields high availability, excellent scalability, with acceptable performance and limited overhead.


Data replication Semantic reconciliation Eventual consistency Peer-to-peer 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Aberer, K., Cudré-Mauroux, P., Datta, A., Despotovic, Z., Hauswirth, M., Punceva, M., Schmidt, R.: P-Grid: a self-organizing structured P2P system. ACM SIGMOD Rec. 32(3), 29–33 (2003) CrossRefGoogle Scholar
  2. 2.
    Akbarinia, R., Martins, V., Pacitti, E., Valduriez, P.: Global Data Management, 1st edn. IOS Press (2006) (Chapter: Design and implementation of Atlas P2P architecture) Google Scholar
  3. 3.
    Akbarinia, R., Martins, V., Pacitti, E., Valduriez, P.: Top-k query processing in the APPA P2P system. In: Proc. of the Int. Conf. on High Performance Computing for Computational Science (VecPar), Rio de Janeiro, Brazil, July 2006 Google Scholar
  4. 4.
    Akbarinia, R., Pacitti, E., Valduriez, P.: Reducing network traffic in unstructured P2P systems using top-k queries. Distributed Parallel Databases 19(2–3), 67–86 (2006) CrossRefGoogle Scholar
  5. 5.
    Akbarinia, R., Pacitti, E., Valduriez, P.: Data currency in replicated DHTs. In: Proc. of the ACM SIGMOD Int. Conf. on Management of Data, Beijing, China, June 2007, pp. 211–222 Google Scholar
  6. 6.
    Anwitaman, D., Hauswirth, M., Aberer, K.: Updates in highly unreliable, replicated peer-to-peer systems. In: Proc. of the IEEE Int. Conf. on Distributed Computing Systems (ICDCS), Washington, May 2003, pp. 76–85 Google Scholar
  7. 7.
  8. 8.
    Castro, M., Jones, M.B., Kermarrec, A.-M., Rowstron, A., Theimer, M., Wang, H., Wolman, A.: An evaluation of scalable application-level multicast built using peer-to-peer overlays. In: Proc. of the Annual Joint Conf. of the IEEE Computer and Communications Societies (INFOCOM), San Francisco, California, April 2003, pp. 1510–1520 Google Scholar
  9. 9.
    Chockler, G., Keidar, I., Vitenberg, R.: Group communication specifications: a comprehensive study. ACM Comput. Surv. 33(4), 427–469 (2001) CrossRefGoogle Scholar
  10. 10.
    Chong, Y.L., Hamadi, Y.: Distributed log-based reconciliation. In: Proc. of the European Conference on Artificial Intelligence (ECAI), Riva del Garda, Italy, September 2006, pp. 108–112 Google Scholar
  11. 11.
    Clarke, I., Miller, S., Hong, T.W., Sandberg, O., Wiley, B.: Protecting free expression online with Freenet. IEEE Internet Comput. 6(1), 40–49 (2002) CrossRefGoogle Scholar
  12. 12.
    El Dick, M., Martins, V., Pacitti, E.: A topology-aware approach for distributed data reconciliation in P2P networks. In: Proc. of the European Conf. on Parallel Computing (Euro-Par), Rennes, France, August 2007 Google Scholar
  13. 13.
    Grid5000 Project:
  14. 14.
    Howell, F., McNab, R.: SimJava: a discrete event simulation package for Java with applications in computer systems modeling. In: Proc. of the Int. Conf. on Web-based Modeling and Simulation, San Diego, California, January 1998 Google Scholar
  15. 15.
    Huebsch, R., Hellerstein, J., Lanham, N., Thau Loo, B., Shenker, S., Stoica, I.: Querying the Internet with PIER. In: Proc. of Int. Conf. on Very Large Databases (VLDB), Berlin, Germany, September 2003, pp. 321–332 Google Scholar
  16. 16.
    Kermarrec, A.-M., Rowstron, A., Shapiro, M., Druschel, P.: The IceCube approach to the reconciliation of diverging replicas. In: Proc. of the ACM Symp. on Principles of Distributed Computing (PODC), Newport, Rhode Island, August 2001, pp. 210–218 Google Scholar
  17. 17.
    Kleinbaum, D.G., Kupper, L.L., Muller, K.E., Nizam, A.: Applied Regression Analysis and Multi-variable Methods, 3rd edn. Duxbury Press (1998) Google Scholar
  18. 18.
    Knezevic, P., Wombacher, A., Risse, T.: Enabling high data availability in a DHT. In: Proc. of the Int. Workshop on Grid and Peer-to-Peer Computing Impacts on Large Scale Heterogeneous Distributed Database Systems (GLOBE’05), Copenhagen, Denmark, August 2005, pp. 363–367 Google Scholar
  19. 19.
    Kubiatowicz, J., Bindel, D., Chen, Y., Czerwinski, S., Eaton, P., Geels, D., Gummadi, R., Rhea, S., Weatherspoon, H., Weimer, W., Wells, C., Zhao, B.: OceanStore: an architecture for global-scale persistent storage. In: Proc. of the ACM Int. Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Cambridge, Massachusetts, November 2000, pp. 190–201 Google Scholar
  20. 20.
  21. 21.
    Martins, V.: Data replication in P2P systems. Ph.D. thesis, University of Nantes, Nantes, France, May 2007.
  22. 22.
    Martins, V., Akbarinia, R., Pacitti, E., Valduriez, P.: Reconciliation in the APPA P2P system. In: Proc. of the IEEE Int. Conf. on Parallel and Distributed Systems (ICPADS), Minneapolis, Minnesota, July 2006, pp. 401–410 Google Scholar
  23. 23.
    Martins, V., Pacitti, E.: Dynamic and distributed reconciliation in P2P-DHT networks. In: Proc. of the European Conf. on Parallel Computing (Euro-Par), Dresden, Germany, September 2006, pp. 337–349 Google Scholar
  24. 24.
    Martins, V., Pacitti, E., Jimenez-Peris, R., Valduriez, P.: Scalable and available reconciliation in P2P networks. In: Proc. of the Journées Bases de Données Avancées (BDA), Lille, France, October 2006 Google Scholar
  25. 25.
    Martins, V., Pacitti, E., Valduriez, P.: Distributed semantic reconciliation of replicated data. IEEE France and ACM SIGOPS France—Journées Francophones sur la Cohérence des Données en Univers Réparti (CDUR), Paris, France, November 2005 Google Scholar
  26. 26.
  27. 27.
    Preguiça, N., Shapiro, M., Matheson, C.: Efficient semantic-aware reconciliation for optimistic write sharing. Technical report MSR-TR-2002-52, Microsoft Research, Cambridge, UK, May 2002 Google Scholar
  28. 28.
    Preguiça, N., Shapiro, M., Matheson, C.: Semantics-based reconciliation for collaborative and mobile environments. In: Proc. of the Int. Conf. on Cooperative Information Systems (CoopIS), Catania, Italy, November 2003, pp. 38–55 Google Scholar
  29. 29.
  30. 30.
    Ratnasamy, S., Francis, P., Handley, M., Karp, R., Shenker, S.: A scalable content-addressable network. In: Proc. of the ACM SIGCOMM Conf. on Applications, Technologies, Architectures, and Protocols for Computer Communications, San Diego, California, August 2001, pp. 161–172 Google Scholar
  31. 31.
    Rowstron, A., Druschel, P.: Pastry: scalable, distributed object location and routing for large-scale peer-to-peer systems. In: Proc. of the IFIP/ACM Int. Conf. on Distributed Systems Platforms (Middleware), Heidelberg, Germany, November 2001, pp. 329–350 Google Scholar
  32. 32.
    Saito, Y., Shapiro, M.: Optimistic replication. ACM Comput. Surv. 37(1), 42–81 (2005) CrossRefGoogle Scholar
  33. 33.
    Shapiro, M., Bhargavan, K., Krishna, N.: A constraint-based formalism for consistency in replicated systems. In: Proc. of the Int. Conf. on Principles of Distributed Systems (OPODIS), Grenoble, France, December 2004 Google Scholar
  34. 34.
    Stoica, I., Morris, R., Karger, D.R., Kaashoek, M.F., Balakrishnan, H.: Chord: a scalable peer-to-peer lookup service for Internet applications. In: Proc. of the ACM SIGCOMM Conf. on Applications, Technologies, Architectures, and Protocols for Computer Communications, San Diego, California, August 2001, pp. 149–160 Google Scholar
  35. 35.
    Vidot, N., Cart, M., Ferrie, J., Suleiman, M.: Copies convergence in a distributed real-time collaborative environment. In: Proc. of the ACM Int. Conf. on Computer Supported Cooperative Work (CSCW), Philadelphia, Pennsylvania, December 2000, pp. 171–180 Google Scholar
  36. 36.
    Whittaker, S., Issacs, E., O’Day, V.: Widening the net: workshop report on the theory and practice of physical and network communities. ACM SIGCHI Bull. 29(3), 27–30 (1997) CrossRefGoogle Scholar
  37. 37.
  38. 38.
    Zhao, B.Y., Huang, L., Stribling, J., Rhea, S.C., Joseph, A.D., Kubiatowicz, J.D.: Tapestry: a resilient global-scale overlay for service deployment. IEEE J. Sel. Areas Commun. 22(1), 41–53 (2004) CrossRefGoogle Scholar
  39. 39.
    Zhao, B.Y., Kubiatowicz, J.D., Joseph, A.D.: Tapestry: an infrastructure for fault-tolerant wide-area location and routing. Technical report CSD-010-1141, University of California, Berkeley, California (2001) Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2008

Authors and Affiliations

  • Vidal Martins
    • 1
  • Esther Pacitti
    • 2
  • Manal El Dick
    • 2
  • Ricardo Jimenez-Peris
    • 3
  1. 1.PPGIA/PUCPRPontifical Catholic University of ParanáCuritibaBrazil
  2. 2.ATLAS Team, INRIA and LINAUniversity of NantesNantes Cedex 3France
  3. 3.Distributed Systems Lab, Facultad InformaticaUniversidad Politécnica de MadridMadridSpain

Personalised recommendations