Text-to-Video: Story Illustration from Online Photo Collections

Schwarz, Katharina; Rojtberg, Pavel; Caspar, Joachim; Gurevych, Iryna; Goesele, Michael; Lensch, Hendrik P. A.

doi:10.1007/978-3-642-15384-6_43

Text-to-Video: Story Illustration from Online Photo Collections

Katharina Schwarz²³,
Pavel Rojtberg²⁴,
Joachim Caspar²⁴,
Iryna Gurevych²⁴,
Michael Goesele²⁴ &
…
Hendrik P. A. Lensch²³

Conference paper

1588 Accesses
6 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6279))

Abstract

We present a first system to semi-automatically create a visual representation for a given, short text. We first parse the input text, decompose it into suitable units, and construct meaningful search terms. Using these search terms we retrieve a set of candidate images from online photo collections. We then select the final images in a user-assisted process and automatically create a storyboard or photomatic animation. We demonstrate promising initial results on several types of texts.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Chen, T., Cheng, M.-M., Tan, P., Shamir, A., Hu, S.-M.: Sketch2Photo: Internet image montage. ACM Trans. Graph. 28(5) (2009)
Google Scholar
Cowie, J., Lehnert, W.: Information extraction. ACM Commun. 39(1), 80–91 (1996)
Article Google Scholar
Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: A framework and graphical development environment for robust NLP tools and applications. In: Proc. 40th Anniv. Meeting of the Assoc. for Comp. Ling. (2002)
Google Scholar
de Marneffe, M.-C., MacCartney, B., Manning, C.D.: Generating typed dependency parses from phrase structure parses. In: LREC 2006 (2006)
Google Scholar
Guy, M., Tonkin, E.: Folksonomies, tidying up tags? D-Lib Magazine 1 (2006)
Google Scholar
Hays, J., Efros, A.A.: Scene completion using millions of photographs. ACM Trans. Graph. (2007)
Google Scholar
Johnson, M.K., Dale, K., Avidan, S., Pfister, H., Freeman, W.T., Matusik, W.: CG2Real: Improving the realism of computer generated images using a large collection of photographs. Technical Report MIT-CSAIL-TR-2009-034 (2009)
Google Scholar
Lalonde, J.-F., Hoiem, D., Efros, A.A., Rother, C., Winn, J., Criminisi, A.: Photo clip art. ACM Trans. Graph (2007)
Google Scholar
Mehtre, B.M., Kankanhalli, M.S., Narasimhalu, A.D., Man, G.C.: Color matching for image retrieval. Pattern Recogn. Lett. 16(3), 325–331 (1995)
Article Google Scholar
Neumann, L., Neumann, A.: Color style transfer techniques using hue, lightness and saturation histogram matching. In: Comp. Aesthetics in Graphics, Visualization and Imaging, pp. 111–122 (2005)
Google Scholar
Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)
Google Scholar
Snavely, N., Simon, I., Goesele, M., Szeliski, R., Seitz, S.M.: Scene reconstruction and visualization from community photo collections. Proc. of the IEEE, Special Issue on Internet Vision (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Ulm University,
Katharina Schwarz & Hendrik P. A. Lensch
TU Darmstadt,
Pavel Rojtberg, Joachim Caspar, Iryna Gurevych & Michael Goesele

Authors

Katharina Schwarz
View author publications
You can also search for this author in PubMed Google Scholar
Pavel Rojtberg
View author publications
You can also search for this author in PubMed Google Scholar
Joachim Caspar
View author publications
You can also search for this author in PubMed Google Scholar
Iryna Gurevych
View author publications
You can also search for this author in PubMed Google Scholar
Michael Goesele
View author publications
You can also search for this author in PubMed Google Scholar
Hendrik P. A. Lensch
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering, Cardiff University, The Parade, CF24 3AA, Cardiff, UK
Rossitza Setchi
Dept. of Computer Science and Software Engineering, University of Portsmouth, BUckingham Building, Lion Terrace, PO1 3HE, Portsmouth, UK
Ivan Jordanov
KES International, 145-157 St. John Street, EC1V 4PY, London, UK
Robert J. Howlett
School of Electrical and Information Engineering, University of South Australia, Adelaide, Mawson Lakes Campus, 5095, SA, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schwarz, K., Rojtberg, P., Caspar, J., Gurevych, I., Goesele, M., Lensch, H.P.A. (2010). Text-to-Video: Story Illustration from Online Photo Collections. In: Setchi, R., Jordanov, I., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2010. Lecture Notes in Computer Science(), vol 6279. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15384-6_43

Download citation

DOI: https://doi.org/10.1007/978-3-642-15384-6_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15383-9
Online ISBN: 978-3-642-15384-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics