Research Frontiers

Nie, Liqiang; Liu, Meng; Song, Xuemeng

doi:10.1007/978-3-031-02255-5_7

Liqiang Nie²,
Meng Liu² &
Xuemeng Song²

Part of the book series: Synthesis Lectures on Image, Video, and Multimedia Processing ((SLIVMP))

59 Accesses

Abstract

In this book, we investigate some application-motivated problems, namely the research problems of micro-video understanding. To solve these problems, we design some general principles, methodologies, and optimizations by jointly learning from multiple correlated modalities of the given micro-videos, including the textual, visual, acoustic, and social ones. They are empirically validated on multiple real-world datasets. In particular, we first introduce the proliferation of micro-video services and identify three practical tasks of micro-video understanding: popularity prediction, venue category estimation, and micro-video routing. Based upon these tasks, we analyze the unique research challenges of micro-videos that are distinct from traditional long videos, such as information sparseness, hierarchical structure, low-quality, multimodal sequential data, as well as lack of benchmark datasets. To address these problems, we present a series of multimodal learning methods, consisting of multimodal transductive learning, multimodal cooperative learning, multimodal transductive learning and multimodal sequential learning. These theoretical methods are verified over three datasets we constructed. To facilitate other researchers, we have released the codes, parameter settings, as well as the three datasets. We have to emphasize that learning from multiple modalities of the given micro-videos is still a young and highly promising research field. There are many unexplored but fruitful future directions and challenging research issues. We illustrate a few of them here.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

Shandong University, Jinan, China
Liqiang Nie, Meng Liu & Xuemeng Song

Authors

Liqiang Nie
View author publications
You can also search for this author in PubMed Google Scholar
Meng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xuemeng Song
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Nie, L., Liu, M., Song, X. (2019). Research Frontiers. In: Multimodal Learning toward Micro-Video Understanding. Synthesis Lectures on Image, Video, and Multimedia Processing. Springer, Cham. https://doi.org/10.1007/978-3-031-02255-5_7

Download citation

DOI: https://doi.org/10.1007/978-3-031-02255-5_7
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-01127-6
Online ISBN: 978-3-031-02255-5
eBook Packages: Synthesis Collection of Technology (R0)eBColl Synthesis Collection 9

Publish with us

Policies and ethics