Skip to main content

Text-to-Video: Story Illustration from Online Photo Collections

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6279))

Abstract

We present a first system to semi-automatically create a visual representation for a given, short text. We first parse the input text, decompose it into suitable units, and construct meaningful search terms. Using these search terms we retrieve a set of candidate images from online photo collections. We then select the final images in a user-assisted process and automatically create a storyboard or photomatic animation. We demonstrate promising initial results on several types of texts.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chen, T., Cheng, M.-M., Tan, P., Shamir, A., Hu, S.-M.: Sketch2Photo: Internet image montage. ACM Trans. Graph. 28(5) (2009)

    Google Scholar 

  2. Cowie, J., Lehnert, W.: Information extraction. ACM Commun. 39(1), 80–91 (1996)

    Article  Google Scholar 

  3. Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: A framework and graphical development environment for robust NLP tools and applications. In: Proc. 40th Anniv. Meeting of the Assoc. for Comp. Ling. (2002)

    Google Scholar 

  4. de Marneffe, M.-C., MacCartney, B., Manning, C.D.: Generating typed dependency parses from phrase structure parses. In: LREC 2006 (2006)

    Google Scholar 

  5. Guy, M., Tonkin, E.: Folksonomies, tidying up tags? D-Lib Magazine 1 (2006)

    Google Scholar 

  6. Hays, J., Efros, A.A.: Scene completion using millions of photographs. ACM Trans. Graph. (2007)

    Google Scholar 

  7. Johnson, M.K., Dale, K., Avidan, S., Pfister, H., Freeman, W.T., Matusik, W.: CG2Real: Improving the realism of computer generated images using a large collection of photographs. Technical Report MIT-CSAIL-TR-2009-034 (2009)

    Google Scholar 

  8. Lalonde, J.-F., Hoiem, D., Efros, A.A., Rother, C., Winn, J., Criminisi, A.: Photo clip art. ACM Trans. Graph (2007)

    Google Scholar 

  9. Mehtre, B.M., Kankanhalli, M.S., Narasimhalu, A.D., Man, G.C.: Color matching for image retrieval. Pattern Recogn. Lett. 16(3), 325–331 (1995)

    Article  Google Scholar 

  10. Neumann, L., Neumann, A.: Color style transfer techniques using hue, lightness and saturation histogram matching. In: Comp. Aesthetics in Graphics, Visualization and Imaging, pp. 111–122 (2005)

    Google Scholar 

  11. Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)

    Google Scholar 

  12. Snavely, N., Simon, I., Goesele, M., Szeliski, R., Seitz, S.M.: Scene reconstruction and visualization from community photo collections. Proc. of the IEEE, Special Issue on Internet Vision (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Schwarz, K., Rojtberg, P., Caspar, J., Gurevych, I., Goesele, M., Lensch, H.P.A. (2010). Text-to-Video: Story Illustration from Online Photo Collections. In: Setchi, R., Jordanov, I., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2010. Lecture Notes in Computer Science(), vol 6279. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15384-6_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15384-6_43

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15383-9

  • Online ISBN: 978-3-642-15384-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics