Skip to main content

Using Machine Learning to Support Resource Quality Assessment: An Adaptive Attribute-Based Approach for Health Information Portals

  • Conference paper
Database Systems for Adanced Applications (DASFAA 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6637))

Included in the following conference series:

Abstract

Labor-intensity of resource quality assessment is a bottleneck for content management in metadata-driven health information portals. This research proposes an adaptive attribute-based approach to assist informed judgments when assessing the quality of online information resources. It employs intelligent learning techniques to predict values of resource quality attributes based on previous value judgments encoded in resource metadata descriptions. The proposed approach is implemented as an intelligent quality attribute learning component of a portal’s content management system. This paper introduces the required machine learning procedures for the implementation of the component. Its prediction performance was evaluated via a series of machine learning experiments, which demonstrated the feasibility and the potential usefulness of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Benigeri, M., Pluye, P.: Shortcomings of Health Information on the Internet. Health Promotion International 18, 381–386 (2003)

    Article  Google Scholar 

  2. Evans, J., Manaszewicz, R., Xie, J.: The Role of Domain Expertise in Smart, User-Sensitive, Health Information Portal. In: the 42nd Hawaii International Conference on System Sciences, HICSS-42 (2009)

    Google Scholar 

  3. Xie, J.: Sustaining Quality Assessment Processes in User-Centred Health Information Portals. In: The 15th Americas Conference on Information Systems, AMCIS 2009 (2009)

    Google Scholar 

  4. Stvilia, B., Gasser, L., Twidale, M.B., Smith, L.C.: A Framework for Information Quality Assessment. Journal of the American Society for Information Science and Technology (JASIST) 58, 1720–1733 (2007)

    Article  Google Scholar 

  5. Wang, R.Y., Strong, D.M.: Beyond Accuracy: What Data Quality Means to Data Consumers. Journal of Management Information Systems 12, 5–33 (1996)

    Article  Google Scholar 

  6. Griffiths, K.M., Tang, T.T., Hawking, D., Christensen, H.: Automated Assessment of the Quality of Depression Websites. Journal of Medical Internet Research 7, e59 (2005)

    Google Scholar 

  7. Sessions, V., Valtorta, M.: Towards a Method for Data Accuracy Assessment Utilizing a Bayesian Network Learning Algorithm. Journal of Data and Information Quality 1, 1–34 (2009)

    Article  Google Scholar 

  8. Burstein, F., Fisher, J., McKemmish, S., Manaszewicz, R., Malhotra, P.: User Centred Quality Health Information Provision: Benefits and Challenges. In: The 38th Annual Hawaii International Conference on System Sciences, HICSS 2005 (2005)

    Google Scholar 

  9. McKemmish, S., Manaszewicz, R., Burstein, F., Fisher, J.: Consumer Empowerment through Metadata-Based Quality Reporting: The Breast Cancer Knowledge Online Portal. Journal of the American Society for Information Science and Technology (JASIST) 60, 1792–1807 (2009)

    Article  Google Scholar 

  10. Wang, R.Y., Reddy, M.P., Kon, H.B.: Toward Quality Data: An Attribute-Based Approach. Decision Support Systems 13, 349–372 (1995)

    Article  Google Scholar 

  11. Anderson, J., McKemmish, S., Manaszewicz, R.: Quality Criteria Models Used to Evaluate Health Websites. In: The 10th Asia Pacific Special Health and Law Librarians Conference, pp. 337–354 (2003)

    Google Scholar 

  12. Williamson, K., Manaszewicz, R.: Breast Cancer Information Needs and Seeking: Towards an Intelligent, User Sensitive Portal to Breast Cancer Knowledge Online. The New Review of Information Behaviour Research 3, 203–219 (2003)

    Google Scholar 

  13. Eysenbach, G., Diepgen, T.L.: Towards Quality Management of Medical Information on the Internet: Evaluation, Labelling, and Filtering of Information. British Medical Journal (BMJ) 317, 1496–1502 (1998)

    Article  Google Scholar 

  14. Wang, Y., Liu, Z.: Automatic Detecting Indicators for Quality of Health Information on the Web. International Journal of Medical Informatics 76, 575–582 (2007)

    Article  Google Scholar 

  15. McKemmish, S., Manaszewicz, R., Cheah, C.: Bckonline Metadata Schema Version 1.0 (2004), http://www.sims.monash.edu.au/research/eirg/BCKO_MetadataSchema_Version16.doc

  16. Griffiths, K.M., Christensen, H.: Website Quality Indicators for Consumers. Journal of Medical Internet Research 7, e55 (2005)

    Google Scholar 

  17. Price, S.L., Hersh, W.R.: Filtering Web Pages for Quality Indicators: An Empirical Approach to Finding High Quality Consumer Health Information on the World Wide Web. In: AMIA 1999 Annual Symposium, pp. 911–915 (1999)

    Google Scholar 

  18. Katerattanakul, P., Siau, K.: Measuring Information Quality of Web Sites: Development of an Instrument. In: The 20th International Conference on Information Systems, pp. 279–285 (1999)

    Google Scholar 

  19. Zhu, J., Gauch, S.: Incorporating Quality Metrics in Centralized/Distributed Information Retrieval on the World Wide Web. In: The 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 288–295. ACM Press, New York (2000)

    Google Scholar 

  20. Conrad, J.G., Leidner, J.L., Schilder, F.: Professional Credibility: Authority on the Web. In: The 2nd ACM Workshop on Information Credibility on the Web. ACM, New York (2008)

    Google Scholar 

  21. Freeman, K.S., Spyridakis, J.H.: An Examination of Factors That Affect the Credibility of Online Health Information. Technical Communication 51, 239–263 (2004)

    Google Scholar 

  22. Aladwani, A.M., Palvia, P.C.: Developing and Validating an Instrument for Measuring User-Perceived Web Quality. Information and Management 39, 467–476 (2002)

    Article  Google Scholar 

  23. Hatala, M., Richards, G.: Value-Added Metatagging: Ontology and Rule Based Methods for Smarter Metadata. In: Rules and Rule Markup Languages for the Semantic Web: Second International Workshop, pp. 65–80. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  24. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Technologies. Elsevier, Amsterdam (2005)

    MATH  Google Scholar 

  25. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The Weka Data Mining Software: An Update. SIGKDD Explorations 11, 10–18 (2009)

    Article  Google Scholar 

  26. Evangelista, P.F., Embrechts, M.J., Szymanski, B.K.: Taming the Curse of Dimensionality in Kernels and Novelty Detection. In: Abraham, A., de Baets, B., Köppen, M., Nickolay, B. (eds.) Applied Soft Computing Technologies: The Challenge of Complexity, pp. 425–438. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  27. Akbani, R., Kwek, S., Japkowicz, N.: Applying Support Vector Machines to Imbalanced Datasets. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), vol. 3201, pp. 39–50. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xie, J., Burstein, F. (2011). Using Machine Learning to Support Resource Quality Assessment: An Adaptive Attribute-Based Approach for Health Information Portals. In: Xu, J., Yu, G., Zhou, S., Unland, R. (eds) Database Systems for Adanced Applications. DASFAA 2011. Lecture Notes in Computer Science, vol 6637. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20244-5_50

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20244-5_50

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20243-8

  • Online ISBN: 978-3-642-20244-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics