Skip to main content

User-Constrained Clustering in Online Requirements Forums

  • Conference paper
  • First Online:
Requirements Engineering: Foundation for Software Quality (REFSQ 2015)

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 9013))

  • 2365 Accesses

Abstract

[Context & motivation:] Software development projects involving geographically dispersed stakeholders often use web-based discussion forums to gather feature requests. Our previous study showed that users have a tendency to create redundant threads as well as large unfocused mega-threads. [Question/problem:] In this paper we propose novel solution for integrating user feedback into the process of dynamically and iteratively clustering features into discussion threads. [Principal ideas/results:] We integrate feed back in the form of stick-together and move-apart advice, plus user-defined tags into our consensus based clustering process. [Contribution:] Experimental results demonstrate that our approach is able to deliver high quality and stable clusters to facilitate forum-based requirements elicitation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Basu, S., Banerjee, A., Mooney, R.: Semi-supervised clustering by seeding. In: Conference on Machine Learning, pp. 27–34 (2002)

    Google Scholar 

  2. Basu, S., Bilenko, M., Mooney, R.J.: A probabilistic framework for semi-supervised clustering. In: KDD, pp. 59–68 (2004)

    Google Scholar 

  3. Bilenko, M., Basu, S., Mooney, R.J.: Integrating constraints and metric learning in semi-supervised clustering. In: ICML (2004)

    Google Scholar 

  4. Can, F., Ozkarahan, E.A.: Concepts and effectiveness of the cover-coefficient-based clustering methodology for text databases. ACM Trans. Database Syst. 15(4), 483–517 (1990)

    Article  Google Scholar 

  5. Castro-Herrera, C., Cleland-Huang, J., Mobasher, B.: A recommender system for dynamically evolving online forums. In: ACM Conference on Recommender Systems, RecSys 2009, pp. 213–216. ACM, New York (2009)

    Google Scholar 

  6. Cleland-Huang, J., Dumitru, H., Duan, C., Castro-Herrera, C.: Automated support for managing feature requests in open forums. Communications of the ACM 52(10), 68–74 (2009)

    Article  Google Scholar 

  7. Cohn, D., Caruana, R., Mccallum, A.: Semi-supervised clustering with user feedback. Technical report (2003)

    Google Scholar 

  8. Davidson, I., Ravi, S.S.: Identifying and generating easy sets of constraints for clustering. In: AAAI, pp. 336–341 (2006)

    Google Scholar 

  9. Decker, B., Ras, E., Rech, J., Jaubert, P., Rieth, M.: Wiki-based stakeholder participation in requirements engineering. IEEE Software 24(2), 28–35 (2007)

    Article  Google Scholar 

  10. Dhillon, I.S., Modha, D.S.: Concept decompositions for large sparse text data using clustering. In: Machine Learning, pp. 143–175 (2000)

    Google Scholar 

  11. Duan, C., Cleland-Huang, J., Mobasher, B.: A consensus based approach to constrained clustering of software requirements. In: CIKM, pp. 1073–1082 (2008)

    Google Scholar 

  12. Frakes, W.B., Baeza-Yates, R.A.: Information Retrieval: Data Structures & Algorithms. Prentice-Hall (1992)

    Google Scholar 

  13. Fred, A.L.N., Jain, A.K.: Combining multiple clusterings using evidence accumulation. IEEE Trans. Pattern Anal. Mach. Intell. 27(6), 835–850 (2005)

    Article  Google Scholar 

  14. Glass, R.L.: The standish report: does it really describe a software crisis? Commun. ACM 49(8), 15–16 (2006)

    Article  Google Scholar 

  15. Hariri, N., Castro-Herrera, C., Mirakhorli, M., Cleland-Huang, J., Mobasher, B.: Supporting domain analysis through mining and recommending features from online product listings. IEEE Trans. Software Eng. 39(12), 1736–1752 (2013)

    Article  Google Scholar 

  16. Laurent, P., Cleland-Huang, J.: Lessons learned from open source projects for facilitating online requirements processes. In: Glinz, M., Heymans, P. (eds.) REFSQ 2009 Amsterdam. LNCS, vol. 5512, pp. 240–255. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  17. Robertson, S., Robertson, J.: Mastering the Requirements Process: Getting Requirements Right, 3rd edn. Pearson Educational (2012)

    Google Scholar 

  18. Strehl, A., Ghosh, J.: Cluster ensembles - a knowledge reuse framework for combining multiple partitions. Machine Learning Research 3, 583–617 (2002)

    MathSciNet  Google Scholar 

  19. Tang, W., Xiong, H., Zhong, S., Wu, J.: Enhancing semi-supervised clustering: a feature projection perspective. In: KDD, pp. 707–716 (2007)

    Google Scholar 

  20. TREC. Data collection from Text REtrieval Conference. http://trec.nist.gov/

  21. Wagstaff, K., Cardie, C., Rogers, S., Schrödl, S.: Constrained k-means clustering with background knowledge. In: ICML, pp. 577–584 (2001)

    Google Scholar 

  22. Xing, E.P., Ng, A.Y., Jordan, M. I., Russell, S.J.: Distance metric learning with application to clustering with side-information. In: NIPS, pp. 505–512 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jane Cleland-Huang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Duan, C., Dumitru, H., Cleland-Huang, J., Mobasher, B. (2015). User-Constrained Clustering in Online Requirements Forums. In: Fricker, S., Schneider, K. (eds) Requirements Engineering: Foundation for Software Quality. REFSQ 2015. Lecture Notes in Computer Science(), vol 9013. Springer, Cham. https://doi.org/10.1007/978-3-319-16101-3_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-16101-3_21

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-16100-6

  • Online ISBN: 978-3-319-16101-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics