MirBot: A Multimodal Interactive Image Retrieval System
- Cite this paper as:
- Pertusa A., Gallego AJ., Bernabeu M. (2013) MirBot: A Multimodal Interactive Image Retrieval System. In: Sanches J.M., Micó L., Cardoso J.S. (eds) Pattern Recognition and Image Analysis. IbPRIA 2013. Lecture Notes in Computer Science, vol 7887. Springer, Berlin, Heidelberg
This study presents a multimodal interactive image retrieval system for smartphones (MirBot). The application is designed as a collaborative game where users can categorize photographs according to the WordNet hierarchy. After taking a picture, the region of interest of the target can be selected, and the image information is sent with a set of metadata to a server in order to classify the object. The user can validate the category proposed by the system to improve future queries. The result is a labeled database with a structure similar to ImageNet, but with contents selected by the users, fully marked with regions of interest, and with novel metadata that can be useful to constrain the search space in a future work. The MirBot app is freely available on the Apple app store.
KeywordsImage retrieval multimodality interactive labeling
Unable to display preview. Download preview PDF.