Learning Feature Weights from Positive Cases

  • Sidath Gunawardena
  • Rosina O. Weber
  • Julia Stoyanovich
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7969)

Abstract

The availability of new data sources presents both opportunities and challenges for the use of Case-based Reasoning to solve novel problems. In this paper, we describe the research challenges we faced when trying to reuse experiences of successful academic collaborations available online in descriptions of funded grant proposals. The goal is to recommend the characteristics of two collaborators to complement an academic seeking a multidisciplinary team; the three form a collaboration that resembles a configuration that has been successful in securing funding. While seeking a suitable measure for computing similarity between cases, we were confronted with two challenges: a problem context with insufficient domain knowledge and data that consists exclusively of successful collaborations, that is, it contains only positive instances. We present our strategy to overcome these challenges, which is a clustering-based approach to learn feature weights. Our approach identifies poorly aligned cases, i.e., ones that violate the assumption that similar problems have similar solutions. We use the poorly aligned cases as negatives in a feedback algorithm to learn feature weights. The result of this work is an integration of methods that makes CBR useful to yet another context and in conditions it has not been used before.

Keywords

Case Alignment Case Cohesion Density Clustering Multidisciplinary Collaboration Recommender Systems Single Class Learning Subspace Clustering 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Aggarwal, C.C., Wolf, J.L., Yu, P.S., Procopiuc, C., Park, J.S.: Fast Algorithms for Projected Clustering. ACM SIGMOD Record 28(2), 61–72 (1999)CrossRefGoogle Scholar
  2. 2.
    Aha, D.W.: Feature weighting for lazy learning algorithms. In: Liu, H., Motoda, H. (eds.) Feature Extraction, Construction and Selection: A Data Mining Perspective, pp. 13–32. Kluwer, Norwell (1998)CrossRefGoogle Scholar
  3. 3.
    Adomavicius, G., Tuzhilin, A.: Toward the Next Generation of Recommender Systems: A Survey of the State-Of-The-Art and Possible Extensions. IEEE Transactions on Knowledge and Data Engineering 17(6), 734–749 (2005)CrossRefGoogle Scholar
  4. 4.
    Burke, R.: Hybrid Recommender Systems: Survey and Experiments. User Modeling and User-adapted Interaction 12(4), 331–370 (2002)MATHCrossRefGoogle Scholar
  5. 5.
    Calvo, B., López-Bigas, N., Furney, S.J., Larrañaga, P., Lozano, J.A.: A Partially Supervised Classification Approach to Dominant and Recessive Human Disease Gene Prediction. Computer Methods and Programs in Biomedicine 85(3), 229–237 (2007)CrossRefGoogle Scholar
  6. 6.
    Chakraborti, S., Cerviño Beresi, U., Wiratunga, N., Massie, S., Lothian, R., Watt, S.: Visualizing and Evaluating Complexity of Textual Case Bases. Advances in Case-Based Reasoning, 104–119 (2008)Google Scholar
  7. 7.
    Delany, S.J.: The Good, the Bad and the Incorrectly Classified: Profiling Cases for Case-Base Editing. In: McGinty, L., Wilson, D.C. (eds.) ICCBR 2009. LNCS, vol. 5650, pp. 135–149. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  8. 8.
    Dietterich, T.G.: Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms. Neural Computation 10(7), 1895–1923 (1998)CrossRefGoogle Scholar
  9. 9.
    Ester, M., Kriegel, H.-P., Sander, J., Xu, X.: A Density-Based Algorithm for Discovering Clusters In Large Spatial Databases with Noise. In: Simoudis, E., Han, J., Fayyad, U.M. (eds.) Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, pp. 226–231. AAAI Press, Menlo Alto (1996)Google Scholar
  10. 10.
    Gunawardena, S., Weber, R.O.: Blueprints for Success Guidelines for Building Multidisciplinary Collaboration Teams. In: Filipe, J., Fred, A.L.N. (eds.) ICAART 2012 Proceedings of the 4th Intl. Conference on Agents and Artificial Intelligence, pp. 387–399. SciTePress (2012)Google Scholar
  11. 11.
    Gunawardena, S., Weber, R.O.: Reasoning with Organizational Case Bases in the Absence Negative Exemplars. In: ICCBR 2012: 2nd Workshop on Process-Oriented Case-Based Reasoning, pp. 35–44 (2012)Google Scholar
  12. 12.
    Gunawardena, S., Weber, R.O.: Applying CBR principles to Reason without Negative Exemplars. In: FLAIRS 2013 (in press, 2013)Google Scholar
  13. 13.
    Gunawardena, S., Weber, R.O., Agosto, D.E.: Finding that Special Someone: Interdisciplinary Collaboration in an Academic Context. Journal of Education for Library and Information Science 51(4), 210–221 (2010)Google Scholar
  14. 14.
    Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA Data Mining Software: An Update. ACM SIGKDD Explorations Newsletter 11(1), 10–18 (2009)CrossRefGoogle Scholar
  15. 15.
    Kriegel, H.P., Kröger, P., Zimek, A.: Clustering High-Dimensional Data: A Survey on Subspace Clustering, Pattern-Based Clustering, and Correlation Clustering. ACM Transactions on Knowledge Discovery from Data (TKDD) 3(1), 1–58 (2009)CrossRefGoogle Scholar
  16. 16.
    Lamontagne, L.: Textual CBR Authoring Using Case Cohesion. In: Proceedings of the 2006 Workshop on Textual CBR, pp. 33–43 (2006)Google Scholar
  17. 17.
    Leake, D.B. (ed.): Case-Based Reasoning: Experiences, Lessons, and Future Directions. AAAI Press/MIT Press, Menlo Park, CA (1996)Google Scholar
  18. 18.
    Liu, B., Lee, W.S., Yu, P., Li, X.: Partially Supervised Classification of Text Documents. In: Proceedings of the Nineteenth International Conference on Machine Learning (2002)Google Scholar
  19. 19.
    Liu, B., Dai, Y., Li, X., Lee, W.S., Yu, P.S.: Building text classifiers using positive and unlabeled examples. In: Third IEEE International Conference on Data Mining, pp. 179–186. IEEE (2003)Google Scholar
  20. 20.
    Massie, S., Craw, S., Wiratunga, N.: When Similar Problems Don’t Have Similar Solutions. In: Weber, R.O., Richter, M.M. (eds.) ICCBR 2007. LNCS (LNAI), vol. 4626, pp. 92–106. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  21. 21.
    Massie, S., Wiratunga, N., Craw, S., Donati, A., Vicari, E.: From anomaly reports to cases. In: Weber, R.O., Richter, M.M. (eds.) ICCBR 2007. LNCS (LNAI), vol. 4626, pp. 359–373. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  22. 22.
    Müller, E., Günnemann, S., Assent, I., Seidl, T.: Evaluating clustering in subspace projections of high dimensional data. Proceedings of the VLDB Endowment 2(1), 1270–1281Google Scholar
  23. 23.
    Plaza, E.: Semantics and experience in the future web. Advances in Case-Based Reasoning, 44–58 (2008)Google Scholar
  24. 24.
    Quijano-Sánchez, L., Bridge, D., Díaz-Agudo, B., Recio-García, J.A.: A Case-Based Solution to the Cold-Start Problem in Group Recommenders. In: Agudo, B.D., Watson, I. (eds.) ICCBR 2012. LNCS, vol. 7466, pp. 342–356. Springer, Heidelberg (2012)CrossRefGoogle Scholar
  25. 25.
    Richter, M.M., Weber, R.O.: Case-based reasoning: a textbook. Springer, Berlin (in press, 2013)Google Scholar
  26. 26.
    Smyth, B., McKenna, E.: Footprint-based retrieval. In: Althoff, K.-D., Bergmann, R., Branting, L.K. (eds.) ICCBR 1999. LNCS (LNAI), vol. 1650, pp. 343–357. Springer, Heidelberg (1999)CrossRefGoogle Scholar
  27. 27.
    Yu, H., Han, J., Chang, K.C.-C.: PEBL: Web Page Classification Without Negative Examples. IEEE Trans. Knowledge and Data Engineering 16(1), 70–81 (2004)CrossRefGoogle Scholar
  28. 28.
    Zhou, X.F., Shi, Z.L., Zhao, H.C.: Reexamination of CBR hypothesis. In: Bichindaritz, I., Montani, S. (eds.) ICCBR 2010. LNCS, vol. 6176, pp. 332–345. Springer, Heidelberg (2010)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Sidath Gunawardena
    • 1
  • Rosina O. Weber
    • 1
  • Julia Stoyanovich
    • 1
  1. 1.The iSchoolDrexel UniversityPhiladelphiaUSA

Personalised recommendations