Information extraction from multimedia web documents: an open-source platform and testbed

Dupplaw, David Paul; Matthews, Michael; Johansson, Richard; Boato, Giulia; Costanzo, Andrea; Fontani, Marco; Minack, Enrico; Demidova, Elena; Blanco, Roi; Griffiths, Thomas; Lewis, Paul; Hare, Jonathon; Moschitti, Alessandro

doi:10.1007/s13735-014-0051-2

Information extraction from multimedia web documents: an open-source platform and testbed

Regular Paper
Published: 21 March 2014

Volume 3, pages 97–111, (2014)
Cite this article

International Journal of Multimedia Information Retrieval Aims and scope Submit manuscript

David Paul Dupplaw¹,
Michael Matthews²,
Richard Johansson³,
Giulia Boato³,
Andrea Costanzo⁴,
Marco Fontani⁴,
Enrico Minack⁵,
Elena Demidova⁵,
Roi Blanco²,
Thomas Griffiths⁶,
Paul Lewis¹,
Jonathon Hare¹ &
…
Alessandro Moschitti³

794 Accesses
1 Citation
Explore all metrics

Abstract

The LivingKnowledge project aimed to enhance the current state of the art in search, retrieval and knowledge management on the web by advancing the use of sentiment and opinion analysis within multimedia applications. To achieve this aim, a diverse set of novel and complementary analysis techniques have been integrated into a single, but extensible software platform on which such applications can be built. The platform combines state-of-the-art techniques for extracting facts, opinions and sentiment from multimedia documents, and unlike earlier platforms, it exploits both visual and textual techniques to support multimedia information retrieval. Foreseeing the usefulness of this software in the wider community, the platform has been made generally available as an open-source project. This paper describes the platform design, gives an overview of the analysis algorithms integrated into the system and describes two applications that utilise the system for multimedia information retrieval.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A survey on sentiment analysis methods, applications, and challenges

Article 07 February 2022

Natural language processing: state of the art, current trends and challenges

Article 14 July 2022

Natural Language Processing

Notes

http://diversityengine.org.
http://solr.apache.org/.
The Diversty Engine documentation contains speed estimates for each analysis tool.
http://hadoop.apache.org.
https://sites.google.com/site/hcirworkshop/hcir-2010/challenge.
http://opennlp.sourceforge.net.
http://www.timeml.org/site/tarsqi/.
http://www.timeml.org/site/index.html.
Now available as OpenSource through the OpenIMAJ project at http://openimaj.org/.
http://trec.nist.gov/data/web10.html.
http://fbmya01.barcelonamedia.org:8080/future/.

References

Agrawal R, Gollapudi S, Halverson A, Ieong S (2009) Diversifying search results. In: WSDM ’09: Proceedings of the second ACM international conference on web search and data mining. ACM, New York, pp 5–14. doi:10.1145/1498759.1498766
Apache Software Foundation: Solr. (2010) http://lucene.apache.org/solr/
Bianchi T, De Rosa A, Piva A (2011) Improved dct coefficient analysis for forgery localization in jpeg images. In: IEEE international conference on acoustics, speech, and signal processing, IEEE. pp 2444–2447
Bianchi T, Piva A (2011) Detection of non-aligned double jpeg compression with estimation of primary compression parameters. In: IEEE international conference on image processing
Blanco R, Mika P, Atserias Batalla J, Matthews M, Tolchinsky P, Zaragoza H (2010) Searching through time in the New York Times. In: HCIR 2010. http://www.docstoc.com/docs/96068142/Searching-through-time-in-the-New-York-Times
Boato G, De Natale F, Zontone P (2010) How digital forensics may help assessing the perceptual impact of image formation and manipulation. In: Video processing and quality metrics for consumer electronics
Boser B, Guyon I, Vapnik V (1992) A training algorithm for optimal margin classifiers. In: Proceedings of the fifth annual workshop on computational learning theory. Pittsburgh, United States, pp 144–152
Breck E, Choi Y, Cardie C (2007) Identifying expressions of opinion in context. In: IJCAI 2007, Proceedings of the 20th international joint conference on artificial intelligence. Hyderabad, India, pp 2683–2688
Carbonell J, Goldstein J (1998) The use of mmr, diversity-based reranking for reordering documents and producing summaries. In: SIGIR ’98: Proceedings of the 21st annual international ACM SIGIR conference on research and development in information retrieval. ACM, New York, pp 335–336. doi:10.1145/290941.291025
Chapelle O, Metlzer D, Zhang Y, Grinspan P (2009) Expected reciprocal rank for graded relevance. In: Proceedings of the 18th ACM conference on information and knowledge management, CIKM ’09. ACM, New York, pp 621–630. doi:10.1145/1645953.1646033
Choi Y, Breck E, Cardie C (2006) Joint extraction of entities and relations for opinion recognition. In: Proceedings of the 2006 conference on empirical methods in natural language processing. Sydney, Australia, pp 431–439
Christoudias C, Georgescu B, Meer P (2002) Synergism in low level vision. In: Proceedings of 16th international conference on Pattern recognition, 2002, vol. 4, pp 150–155. doi:10.1109/ICPR.2002.1047421
Ciaramita M, Altun Y (2006) Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger. In: Processings of the 2006 conference on empirical methods in natural language processing. Sydney, Australia, pp 594–602
Clarke CL, Kolla M, Cormack GV, Vechtomova O, Ashkan A, Büttcher S, MacKinnon I (2008) Novelty and diversity in information retrieval evaluation. In: SIGIR ’08: Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval. ACM, New York, pp 659–666. doi:10.1145/1390334.1390446
Clarke CL, Kolla M, Vechtomova O (2009) An effectiveness measure for ambiguous and underspecified queries. In: Proceedings of the 2nd international conference on theory of information retrieval: advances in information retrieval theory, ICTIR ’09. Springer, Berlin, pp 188–199. doi:10.1007/978-3-642-04417-5_17
Cunningham H, Maynard D, Bontcheva K, Tablan V, Aswani N, Roberts I, Gorrell G, Funk A, Roberts A, Damljanovic D, Heitz T, Greenwood MA, Saggion H, Petrak J, Li Y, Peters W (2011) Text processing with GATE (Version 6). http://tinyurl.com/gatebook
Drelie Gelasca E, Tomasic D, Ebrahimi T (2005) Which colors best catch your eyes: a subjective study of color saliency. In: Fisrt international workshop on video processing and quality metrics for consumer electronics, Scottsdale, Arizona, USA, ISCAS. SPIE
Farid H (2009) Exposing digital forgeries from JPEG ghosts. IEEE Trans Inf Forensics Secur 4:154–160
Article Google Scholar
Fontani M, Bianchi T, De Rosa A, Piva A, Barni M (2011) A Desmpter-Shafer framework for decision fusion in image forensics. In: IEEE international workshop on information forensics and security
Gollapudi S, Sharma A (2009) An axiomatic approach for result diversification. In: WWW ’09: Proceedings of the 18th international conference on world wide web. ACM, New York, pp 381–390. doi:10.1145/1526709.1526761
Hare J, Samangooei S, Dupplaw D, Lewis P (2012) Imageterrier: an extensible platform for scalable high-performance image retrieval. In: The ACM international conference on multimedia retrieval (ICMR 2012) (to appear)
Hare JS, Samangooei S, Dupplaw DP (2011) Openimaj and imageterrier: Java libraries and tools for scalable multimedia analysis and indexing of images. In: Proceedings of the 19th ACM international conference on Multimedia, MM ’11. ACM, New York, pp 691–694. doi:10.1145/2072298.2072421
Hare JS, Samangooei S, Lewis PH (2011) Efficient clustering and quantisation of sift features: exploiting characteristics of the sift descriptor and interest region detectors under image inversion. In: Proceedings of the 1st ACM international conference on multimedia retrieval, ICMR ’11. ACM, New York , pp 2:1–2:8. doi:10.1145/1991996.1991998
Johansson R (2009) Statistical bistratal dependency parsing. In: Proceedings of the 2009 conference on empirical methods in natural language processing. Singapore, pp 561–569
Johansson R, Moschitti A (2010) A flexible representation of heterogeneous annotation data. In: Proceedings of the seventh conference on international language resources and evaluation (LREC’10). Valetta, Malta, pp 3712–3715
Johansson R, Moschitti A (2010) Reranking models in fine-grained opinion analysis. In: Proceedings of the 23rd international conference of computational linguistics (Coling 2010). Beijing, China, pp 519–527
Johansson R, Moschitti A (2011) Extracting opinion expressions and their polarities—exploration of pipelines and joint models. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies. Portland, United States, pp 101–106
Johansson R, Nugues P (2008) Dependency-based syntactic-semantic analysis with PropBank and NomBank. In: CoNLL 2008: Proceedings of the twelfth conference on natural language learning. Manchester, United Kingdom, pp 183–187
Karger D, Smith M (2009) Simile timeline. http://www.simile-widgets.org/timeline/
Lin Z, He J, Tang X, Tang C (2009) Fast, automatic and fine-grained tampered JPEG image detection via DCT coefficient analysis. Pattern Recognit 42(11):2492–2501
Article MATH Google Scholar
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vision 60(2):91–110. doi:10.1023/B:VISI.0000029664.99615.94
Article Google Scholar
Luo W, Qu Z, Huang J, Qiu G (2007) A novel method for detecting cropped and recompressed image blocks. In: IEEE conference on acoustics, speech, and signal processing (ICASSP)
Meyers A, Reeves R, Macleod C, Szekely R, Zielinska V, Young B, Grishman R (2004) The NomBank project: an interim report. In: HLT-NAACL 2004 workshop: frontiers in corpus annotation. Boston, United States, pp 24–31
Minack E, Siberski W, Nejdl W (2011) Incremental diversification for very large sets: a streaming-based approach. In: Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, SIGIR ’11. ACM, New York, pp 585–594. doi:10.1145/2009916.2009996
Moreau L, Clifford B, Freire J, Futrelle J, Gil Y, Groth P, Kwasnikowska N, Miles S, Missier P, Myers J, Plale B, Simmhan Y, Stephan E, den Bussche JV (2010) The open provenance model core specification (v1.1). Future generation computer systems. http://eprints.ecs.soton.ac.uk/21449/
Muratov O, Zontone P, Boato G, De Natale F (2011) A segment-based image saliency detection. In: IEEE conference on acoustics, speech, and signal processing (ICASSP)
Palmer M, Gildea D, Kingsbury P (2005) The proposition bank: an annotated corpus of semantic roles. Comput Linguist 31(1):71–106
Article Google Scholar
Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2(1–2):1–135
Article Google Scholar
Pavan M, Pelillo M (2003) A new graph-theoretic approach to clustering and segmentation. In: Proceedings of CVPR 2003, pp 145–152
Rocha A, Scheirer W, Boult T, Goldenstein S (2011) Vision of the unseen: current trends and challenges in digital image and video forensics. ACM Computing Surveys (CSUR) 43(4): Article id 26
Rosa AD, Uccheddu F, Costanzo A, Piva A, Barni M (2010) Exploring image dependencies: a new challenge in image forensics. In: Media forensics and security XII conference, SPIE electronic imagining
Ruppenhofer J, Somasundaran S, Wiebe J (2008) Finding the sources and targets of subjective expressions. In: Proceedings of the sixth international language resources and evaluation (LREC’08). Marrakech, Morocco, pp 2781–2788
Sandeep K, Rajagopalan AN (2002) Human face detection in cluttered color images using skin color, edge information. In: Chaudhuri S, Zisserman A, Jain AK, Majumder KL (eds) ICVGIP. Allied Publishers Private Limited, Ahmadabad
Sandhaus E (2008) The New York times annotated corpus. LDC2008T19. Linguistic Data Consortium, University of Pennsylvania
Shafer G (1976) A mathematical theory of evidence. Princeton University Press, Princeton
MATH Google Scholar
Sikora T (2001) The MPEG-7 visual standard for content description-an overview. IEEE Trans Circuits Syst Video Technol 11(6):696–702. doi:10.1109/76.927422
Article MathSciNet Google Scholar
Surdeanu M, Johansson R, Meyers A, Màrquez L, Nivre J (2008) The CoNLL-2008 shared task on joint parsing of syntactic and semantic dependencies. In: Proceedings of CoNLL 2008. Manchester, United Kingdom, pp 159–177
Vailaya A, Jain A, Zhang H (1998) On image classification: city images vs. landscapes. Pattern Recognit 31(12):1921–1935. doi:10.1016/S0031-3203(98)00079-X
Google Scholar
Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.3.7597
Wang J, Zhu J (2009) Portfolio theory of information retrieval. In: SIGIR ’09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval. ACM, New York, pp 115–122. doi: 10.1145/1571941.1571963
Wiebe J, Wilson T, Cardie C (2005) Annotating expressions of opinions and emotions in language. Lang Res Eval 39(2–3):165–210
Article Google Scholar
Wilson T, Wiebe J, Hoffmann P (2009) Recognizing contextual polarity: An exploration of features for phrase-level sentiment analysis. Comput Linguist 35(3):399–433
Article Google Scholar
Yosef MA, Hoffart J, Bordino I, Spaniol M, Weikum G (2011) AIDA: accurate online disambiguation of named entities in text and tables. In: Proceedings of the 37th international conference on very large databases, VLDB 2011, pp 1450–1453
Zhai CX, Cohen WW, Lafferty J (2003) Beyond independent relevance: methods and evaluation metrics for subtopic retrieval. In: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, SIGIR ’03. ACM, New York, pp 10–17. doi:10.1145/860435.860440
Zontone P, Carli M, Boato G, De Natale F (2010) Impact of contrast modification on human feeling: an objective and subjective assessment. In: IEEE conference on image processing (ICIP)

Download references

Acknowledgments

This work was supported by the European Union under the Seventh Framework project LivingKnowledge (IST-FP7-231126). We would also like to thank all our partners who contributed to the LivingKnowledge project on which this work has been built.

Author information

Authors and Affiliations

University of Southampton, Southampton, UK
David Paul Dupplaw, Paul Lewis & Jonathon Hare
Barcelona Media, Barcelona, Spain
Michael Matthews & Roi Blanco
University of Trento, Trento, Italy
Richard Johansson, Giulia Boato & Alessandro Moschitti
CNIT, Florence, Italy
Andrea Costanzo & Marco Fontani
Leibniz Universität Hannover, Hannover, Germany
Enrico Minack & Elena Demidova
SORA, Vienna, Austria
Thomas Griffiths

Authors

David Paul Dupplaw
View author publications
You can also search for this author in PubMed Google Scholar
Michael Matthews
View author publications
You can also search for this author in PubMed Google Scholar
Richard Johansson
View author publications
You can also search for this author in PubMed Google Scholar
Giulia Boato
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Costanzo
View author publications
You can also search for this author in PubMed Google Scholar
Marco Fontani
View author publications
You can also search for this author in PubMed Google Scholar
Enrico Minack
View author publications
You can also search for this author in PubMed Google Scholar
Elena Demidova
View author publications
You can also search for this author in PubMed Google Scholar
Roi Blanco
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Griffiths
View author publications
You can also search for this author in PubMed Google Scholar
Paul Lewis
View author publications
You can also search for this author in PubMed Google Scholar
Jonathon Hare
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Moschitti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David Paul Dupplaw.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dupplaw, D.P., Matthews, M., Johansson, R. et al. Information extraction from multimedia web documents: an open-source platform and testbed. Int J Multimed Info Retr 3, 97–111 (2014). https://doi.org/10.1007/s13735-014-0051-2

Download citation

Received: 15 August 2013
Revised: 11 January 2014
Accepted: 20 February 2014
Published: 21 March 2014
Issue Date: June 2014
DOI: https://doi.org/10.1007/s13735-014-0051-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Information extraction from multimedia web documents: an open-source platform and testbed

Abstract

Access this article

Similar content being viewed by others

A survey on sentiment analysis methods, applications, and challenges

Natural language processing: state of the art, current trends and challenges

Natural Language Processing

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Information extraction from multimedia web documents: an open-source platform and testbed

Abstract

Access this article

Similar content being viewed by others

A survey on sentiment analysis methods, applications, and challenges

Natural language processing: state of the art, current trends and challenges

Natural Language Processing

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation