Dependency-Based Semantic Parsing for Concept-Level Text Analysis

Poria, Soujanya; Agarwal, Basant; Gelbukh, Alexander; Hussain, Amir; Howard, Newton

doi:10.1007/978-3-642-54906-9_10

Soujanya Poria^17,20,22,
Basant Agarwal¹⁸,
Alexander Gelbukh¹⁹,
Amir Hussain²⁰ &
…
Newton Howard²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8403))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

2277 Accesses
37 Citations

Abstract

Concept-level text analysis is superior to word-level analysis as it preserves the semantics associated with multi-word expressions. It offers a better understanding of text and helps to significantly increase the accuracy of many text mining tasks. Concept extraction from text is a key step in concept-level text analysis. In this paper, we propose a ConceptNet-based semantic parser that deconstructs natural language text into concepts based on the dependency relation between clauses. Our approach is domain-independent and is able to extract concepts from heterogeneous text. Through this parsing technique, 92.21% accuracy was obtained on a dataset of 3,204 concepts. We also show experimental results on three different text analysis tasks, on which the proposed framework outperformed state-of-the-art parsing techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agirre, E., Ansa, O., Hovy, E., Martínez, D.: Enriching very large ontologies using the www. arXiv preprint cs/0010026 (2000)
Google Scholar
Cambria, E., Gastaldo, P., Bisio, F., Zunino, R.: An ELM-based model for affective analogical reasoning. Neurocomputing (2014)
Google Scholar
Cambria, E., Hussain, A., Havasi, C., Eckl, C., Munro, J.: Towards crowd validation of the uk national health service. In: WebSci, Raleigh (2010)
Google Scholar
Cambria, E., White, B.: Jumping NLP curves: A review of natural language processing research. IEEE Computational Intelligence Magazine 9(2) (2014)
Google Scholar
Cao, C., Feng, Q., Gao, Y., Gu, F., Si, J., Sui, Y., Tian, W., Wang, H., Wang, L., Zeng, Q., et al.: Progress in the development of national knowledge infrastructure. Journal of Computer Science and Technology 17(5), 523–534 (2002)
Article MATH Google Scholar
Çelebi, A., Özgür, A.: N-gram parsing for jointly training a discriminative constituency parser. Polibits 48, 5–12 (2013)
Google Scholar
Chen, W.L., Zhu, J.B., Yao, T.S., Zhang, Y.X.: Automatic learning field words by bootstrapping. In: Proc. of the JSCL, vol. 72. Tsinghua University Press, Beijing (2003)
Google Scholar
De Marneffe, M.-C., Manning, C.D.: The stanford typed dependencies representation. In: Coling 2008: Proceedings of the Workshop on Cross-Framework and Cross-Domain Parser Evaluation, pp. 1–8. Association for Computational Linguistics (2008)
Google Scholar
Du, B., Tian, H.F., Wang, L., Lu, R.Z.: Design of domain-specific term extractor based on multi-strategy. Computer Engineering 31(14), 159–160 (2005)
Google Scholar
Gelfand, B., Wulfekuler, M., Punch, W.F.: Automated concept extraction from plain text. In: AAAI 1998 Workshop on Text Categorization, pp. 13–17 (1998)
Google Scholar
Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, pp. 1–12 (2009)
Google Scholar
Harlambous, Y., Klyuev, V.: Thematically reinforced explicit semantic analysis. International Journal of Computational Linguistics and Applications 4(1), 79–94 (2013)
Google Scholar
Havasi, C., Speer, R., Alonso, J.: Conceptnet 3: A flexible, multilingual semantic network for common sense knowledge. In: Recent Advances in Natural Language Processing, pp. 27–29 (2007)
Google Scholar
Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th Conference on Computational Linguistics, vol. 2, pp. 539–545. Association for Computational Linguistics (1992)
Google Scholar
Howard, N., Cambria, E.: Intention awareness: Improving upon situation awareness in human-centric environments. Human-centric Computing and Information Sciences 3(9) (2013)
Google Scholar
Huang, G.-B., Zhu, Q.-Y., Siew, C.-K.: Extreme learning machine: Theory and applications. Neurocomputing 70(1), 489–501 (2006)
Article Google Scholar
Liu, L., Cao, C., Wang, H.: Acquiring hyponymy relations from large chinese corpus. WSEAS Transactions on Business and Economics 2(4), 211 (2005)
Google Scholar
Liu, L., Cao, C.-G., Wang, H.-T., Chen, W.: A method of hyponym acquisition based on “isa” pattern. Journal of Computer Science, 146–151 (2006)
Google Scholar
Mairesse, F., Walker, M.A., Mehl, M.R., Moore, R.K.: Using linguistic cues for the automatic recognition of personality in conversation and text. J. Artif. Intell. Res. (JAIR) 30, 457–500 (2007)
Google Scholar
Nakata, K., Voss, A., Juhnke, M., Kreifelts, T.: Collaborative concept extraction from documents. In: Proceedings of the 2nd Int. Conf. on Practical Aspects of Knowledge Management (PAKM 1998). Citeseer (1998)
Google Scholar
Paice, C.D.: Another stemmer. ACM SIGIR Forum 24(3), 56–61 (1990)
Article Google Scholar
Poria, S., Cambria, E., Hussain, A., Bin, G.-H.: Big multimodal data analysis. Elsevier Neural Networks (2014)
Google Scholar
Poria, S., Gelbukh, A., Agarwal, B., Cambria, E., Howard, N.: Common sense knowledge based personality recognition from text. In: Castro, F., Gelbukh, A., González, M. (eds.) MICAI 2013, Part II. LNCS, vol. 8266, pp. 484–496. Springer, Heidelberg (2013)
Chapter Google Scholar
Poria, S., Gelbukh, A., Cambria, E., Das, D., Bandyopadhyay, S.: Enriching senticnet polarity scores through semi-supervised fuzzy clustering. In: 2012 IEEE 12th International Conference on Data Mining Workshops (ICDMW), pp. 709–716. IEEE (2012)
Google Scholar
Poria, S., Gelbukh, A., Cambria, E., Yang, P., Hussain, A., Durrani, T.: Merging senticnet and wordnet-affect emotion lists for sentiment analysis. In: 2012 IEEE 11th International Conference on Signal Processing (ICSP), vol. 2, pp. 1251–1255. IEEE (2012)
Google Scholar
Poria, S., Gelbukh, A., Hussain, A., Howard, N., Das, D., Bandyopadhyay, S.: Enhanced senticnet with affective labels for concept-based opinion mining. IEEE Intelligent Systems 28(2), 31–38 (2013)
Article Google Scholar
Poria, S., Winterstein, G., Cambria, E., Gelbukh, A., Hussain, A., Bin, G.-H.: Dependency-based rules for concept-level sentiment analysis. Elsevier Knowledge-Based Systems (2014)
Google Scholar
Rajagopal, D., Cambria, E., Olsher, D., Kwok, K.: A graph-based approach to commonsense concept extraction and semantic similarity detection. In: Proceedings of the 22nd International Conference on World Wide Web Companion, pp. 565–570. International World Wide Web Conferences Steering Committee (2013)
Google Scholar
Ramirez, P.M., Mattmann, C.A.: Ace: Improving search engines via automatic concept extraction. In: Proceedings of the 2004 IEEE International Conference on Information Reuse and Integration (IRI 2004), pp. 229–234. IEEE (2004)
Google Scholar
Sidorov, G.: Non-continuous syntactic n-grams. Polibits 48, 67–75 (2013)
Google Scholar
Sidorov, G.: Syntactic dependency based n-grams in rule based automatic English as second language grammar correction. International Journal of Computational Linguistics and Applications 4(2), 169–188 (2013)
Google Scholar
Sidorov, G., Velasquez, F., Stamatatos, E., Gelbukh, A., Chanona-Hernández, L.: Syntactic n-grams as machine learning features for natural language processing. Expert Systems with Applications 41(3), 853–860 (2014)
Article Google Scholar
Socher, R., Perelygin, A., Wu, J.Y., Chuang, J., Manning, C.D., Ng, A.Y., Potts, C.: Recursive deep models for semantic compositionality over a sentiment treebank. In: Conference on Empirical Methods in Natural Language Processing (EMNLP). EMNLP (2013)
Google Scholar
Velardi, P., Fabriani, P., Missikoff, M.: Using text processing techniques to automatically enrich a domain ontology. In: Proceedings of the International Conference on Formal Ontology in Information Systems, vol. 2001, pp. 270–284. ACM (2001)
Google Scholar
Yuntao, Z., Ling, G., Yongcheng, W., Zhonghang, Y.: An effective concept extraction method for improving text classification performance. Geo-Spatial Information Science 6(4), 66–72 (2003)
Article Google Scholar
Zheng, J.H., Lu, J.L.: Study of an improved keywords distillation method. Computer Engineering 31(18), 194–196 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electrical & Electronic Engineering, Nanyang Technological University, Singapore
Soujanya Poria
Department of Computer Engineering, Malaviya National Institute of Technology, India
Basant Agarwal
Centro de Investigación en Computación, Instituto Politécnico Nacional, Mexico
Alexander Gelbukh
Department of Computing Science and Mathematics, University of Stirling, UK
Soujanya Poria & Amir Hussain
MIT Media Laberotory, MIT, USA
Newton Howard
The Brain Sciences Foundation, Cambridge, MA, 02139, USA
Soujanya Poria

Authors

Soujanya Poria
View author publications
You can also search for this author in PubMed Google Scholar
Basant Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Gelbukh
View author publications
You can also search for this author in PubMed Google Scholar
Amir Hussain
View author publications
You can also search for this author in PubMed Google Scholar
Newton Howard
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Computing Research, National Polytechnic Institute, Av. Juan Dios Bátiz, Col. Nueva Industrial Vallejo, 07738, Mexico D.F., Mexico
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Poria, S., Agarwal, B., Gelbukh, A., Hussain, A., Howard, N. (2014). Dependency-Based Semantic Parsing for Concept-Level Text Analysis. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2014. Lecture Notes in Computer Science, vol 8403. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54906-9_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-54906-9_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54905-2
Online ISBN: 978-3-642-54906-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics