Contextual keyword extraction by building sentences with crowdsourcing

Hong, Soon Gill; Shin, Sungho; Yi, Mun Yong

doi:10.1007/s11042-012-1338-z

Contextual keyword extraction by building sentences with crowdsourcing

Published: 03 January 2013

Volume 68, pages 401–412, (2014)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Soon Gill Hong¹,
Sungho Shin² &
Mun Yong Yi¹

505 Accesses
2 Citations
Explore all metrics

Abstract

Automatic keyword extraction from documents has long been used and proven its usefulness in various areas. Crowdsourced tagging for multimedia resources has emerged and looks promising to a certain extent. Automatic approaches for unstructured data, automatic keyword extraction and crowdsourced tagging are efficient but they all suffer from the lack of contextual understanding. In this paper, we propose a new model of extracting key contextual terms from unstructured data, especially from documents, with crowdsourcing. The model consists of four sequential processes: (1) term selection by frequency, (2) sentence building, (3) revised term selection reflecting the newly built sentences, and (4) sentence voting. Online workers read only a fraction of a document and participated in sentence building and sentence voting processes, and key sentences were generated as a result. We compared the generated sentences to the keywords entered by the author and to the sentences generated by offline workers who read the whole document. The results support the idea that sentence building process can help selecting terms with more contextual meaning, closing the gap between keywords from automated approaches and contextual understanding required by humans.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Assessment of Online Semantic Annotators for the Keyword Extraction Task

Crowdsourcing

Machine-Crowd Annotation Workflow for Event Understanding Across Collections and Domains

References

Barker K, Cornacchia N (2000) Using noun phrase heads to extract document keyphrases. Adv Artif Intell 40–52. Springer
Bernstein MS, Little G, Miller RC, Hartmann B, Ackerman MS, Karger DR, Crowell D, Panovich K (2010) Soylent: a word processor with a crowd inside. Proceedings of the 23nd annual ACM symposium on User interface software and technology, 313–322. ACM
Frank E, Paynter G, Witten I, Gutwin C, Nevill-Manning CG (1999) Domain-specific keyphrase extraction. International Joint Conference on Artificial Intelligence 16:668–673
Google Scholar
Howe J (2006) The rise of crowdsourcing. Wired magazine 14(6):1–4
MathSciNet Google Scholar
Hsu W, Mei T, Yan R (2008) Knowledge discovery over community-sharing media: from signal to intelligence. Proceedings of the 17th international conference on World Wide Web, 665-674
Hulth A (2003) Improved automatic keyword extraction given more linguistic knowledge. Proceedings of the 2003 conference on Empirical methods in natural language processing, 216-223. Morristown, NJ, USA: Association for Computational Linguistics
Hulth A, Karlgren J, Jonsson A, Boström H, Asker L (2001) Automatic keyword extraction using domain knowledge. Comput Linguist Intell Text Process 472–482. Springer
Kittur BA (2010) CrowdSourcing, collaboration and creativity. XRDS: Crossroads The ACM Magazine for Students 17(2):22–26
Article Google Scholar
Matsuo Y, Ishizuka M (2004) Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools 13:157–170
Article Google Scholar
Riste Gligorov (2012) User generated metadata in audio-visual collections. Proceedings of the 21st international conference, 139–143
Shaw AD, Horton JJ, Chen DL (2011) Designing Incentives for Inexpert Human Raters. Proceedings of the ACM conference on Computer supported cooperative work
Snow R, Connor BO, Jurafsky D, Ng AY, Labs D, St C (2008) Cheap and fast — but is it good? Evaluating non-expert annotations for natural language tasks. Comput Linguist 254–263
Turney P (2000) Learning algorithms for keyphrase extraction. Information Retrieval 2(4):303–336
Article Google Scholar
Von Ahn L, Dabbish L (2008) Designing games with a purpose. Communications of the ACM 51(8):57
Article Google Scholar
Witten IH, Paynter GW, Frank E, Gutwin C, Nevill-Manning CG (1999) KEA: practical automatic keyphrase extraction. Proceedings of the fourth ACM conference on Digital libraries, 254–255

Download references

Acknowledgements

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MEST) (No. 2011‐0029185).

Author information

Authors and Affiliations

Department of Knowledge Service Engineering, KAIST, Daejeon, Republic of Korea
Soon Gill Hong & Mun Yong Yi
Korea Institute of Science and Technology Information, Daejeon, Republic of Korea
Sungho Shin

Authors

Soon Gill Hong
View author publications
You can also search for this author in PubMed Google Scholar
Sungho Shin
View author publications
You can also search for this author in PubMed Google Scholar
Mun Yong Yi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mun Yong Yi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hong, S.G., Shin, S. & Yi, M.Y. Contextual keyword extraction by building sentences with crowdsourcing. Multimed Tools Appl 68, 401–412 (2014). https://doi.org/10.1007/s11042-012-1338-z

Download citation

Published: 03 January 2013
Issue Date: January 2014
DOI: https://doi.org/10.1007/s11042-012-1338-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Contextual keyword extraction by building sentences with crowdsourcing

Abstract

Access this article

Similar content being viewed by others

An Assessment of Online Semantic Annotators for the Keyword Extraction Task

Crowdsourcing

Machine-Crowd Annotation Workflow for Event Understanding Across Collections and Domains

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Contextual keyword extraction by building sentences with crowdsourcing

Abstract

Access this article

Similar content being viewed by others

An Assessment of Online Semantic Annotators for the Keyword Extraction Task

Crowdsourcing

Machine-Crowd Annotation Workflow for Event Understanding Across Collections and Domains

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation