Multimedia Tools and Applications

, Volume 7, Issue 1–2, pp 83–101 | Cite as

Brahma: Browsing and Retrieval Architecture for Hierarchical Multimedia Annotation

  • Asit Dan
  • Dinkar Sitaram
  • Junehwa Song


Traditional browsing of large multimedia documents (e.g., video, audio) is primarily sequential. In the absence of an index structure browsing and searching for relevant information in a long video, audio or other multimedia document becomes difficult. Manual annotation can be used to mark various segments of such documents. Different segments can be combined to create new annotated segments, thus creating hierarchical annotation structures. Given the lack of structure in media data, it is natural for different users to have different views on the same media data. Therefore, different users can create different annotation structures. Users may also share some or all of each other's annotation structures. The annotation structure can be browsed or used to playback as a composed video consisting of different segments. Finally, the annotation structures can be manipulated dynamically by different users to alter views on a document. BRAHMA is a multimedia environment for browsing and retrieval of multimedia documents based on such hierarchical annotation structures.

media annotation retrieval architecture hierarchically annotated media video browsing and viewgroup management 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    J. Allen, “Maintaining knowledge about temporal intervals,” CACM, Vol. 26, No. 11, 1983.Google Scholar
  2. 2.
    K.S. Candan, B. Prabhakaran, and V.S. Subrahmanian, “Retrieval schedules based on resource availability and flexible presentation specifications,” Technical Report, University of Maryland, 1996.Google Scholar
  3. 3.
    A. Dan and D. Sitaram, “Generalized interval caching policy for mixed interactive and long video environments,” Multimedia Computing and Networking, Jan. 1996.Google Scholar
  4. 4.
    A. Dan, D. Sitaram, and P. Shahabuddin, “Dynamic batching policies for an on-demand video server,” ACM Multimedia Systems, 1996.Google Scholar
  5. 5.
    A. Hampapur and H. Harashima, “Production model based digital video segmentation,” Multimedia Tools and Applications, Vol. 1, pp. 9–46, March 1995.Google Scholar
  6. 6.
    R. Hjelsvold and R. Midstraum, “Modelling and querying video data,” in Proceedings of VLDB Conference, 1994, pp. 686–694.Google Scholar
  7. 7.
    M. Kim and J. Song, “Multimedia documents with elastic time,” in ACM Multimedia Conference'95, 1995.Google Scholar
  8. 8.
    M. Kim and J. Song, “Hypeatrstories: Combining time, space, and asynchrony in multimedia documents,” Technical Report, IBM, RC 19277, 1995.Google Scholar
  9. 9.
    T.D.C. Little and A. Ghafoor, “Multimedia synchronization protocols for broadband integrated services,” IEEE JSAC, Vol. 9, No. 9, pp. 1368–1382, Dec. 1991.Google Scholar
  10. 10.
    S. Marcus and V.S. Subrahmanian, “Foundations of multimedia database systems,” Journal of ACM, Vol. 43, No. 3, pp. 474–523, May 1996.CrossRefGoogle Scholar
  11. 11.
    A. Nagasaka and Y. Tanaka, “Automatic video indexing and full-motion search for object appearance,” in IFIP TC2/WG2.6 Second Working Conference on Visual Database Systems, Sept. 1991, pp. 113–127.Google Scholar
  12. 12.
    R.T. Ng and P. Shum, “Optimal clip ordering for news on-demand queries,” in 2nd International Workhop on Multimedia Information Systems, Sept. 1996, pp. 1–5.Google Scholar
  13. 13.
    K. Otsuji, Y. Tonomura, and Y. Ohba, “Video browsing using brightness data,” Visual Communications and Image Processing, pp. 980–989, 1991.Google Scholar
  14. 14.
    R.A. Rowe, J.S. Boreczky, and C.A. Eads, “Indexes for user access to large video databases,” in Storage and Retrieval for Image and Video Databases II, Symposium on Elec. Imaging Sci. and Tech., Feb. 1994.Google Scholar
  15. 15.
    J. Song, A. Dan, and D. Sitaram, “Jinsil: A system for presentation of composite multimedia objects in a distributed environment,” IBM T.J. Watson Research Center, Computer Science Research Report RC 20476, June 1996.Google Scholar
  16. 16.
    R. Weiss, A. Duda, and D.K. Gifford, “Composition and search with a video algebra,” IEEE Multimedia, pp. 12–25, Spring 1995.Google Scholar
  17. 17.
    B.L. Yeo, “Efficient processing of compressed images and video,” Ph.D. Thesis, Princeton University, 1996.Google Scholar
  18. 18.
    H. Zhang, A. Kankanhalli, and S.W. Smoliar, “Automatic partitioning of full motion video,” Multimedia Systems Journal, Vol. 1, pp. 10–28, July 1999Google Scholar

Copyright information

© Kluwer Academic Publishers 1998

Authors and Affiliations

  • Asit Dan
    • 1
  • Dinkar Sitaram
    • 2
  • Junehwa Song
    • 3
  1. 1.IBM Research DivisionT.J. Watson Research CenterHawthorne
  2. 2.IBM Research DivisionT.J. Watson Research CenterHawthorne
  3. 3.University of MarylandCollege Park

Personalised recommendations