Information Reuse and Integration in Academia and Industry

pp 187-200


ReadFast: Structural Information Retrieval from Biomedical Big Text by Natural Language Processing

  • Michael GubanovAffiliated withUniversity of Washington Email author 
  • , Linda ShapiroAffiliated withUniversity of Washington
  • , Anna PyaytAffiliated withStanford University

* Final gross prices may vary according to local VAT.

Get Access


While the problem to find needed information on the Web is being solved by the major search engines, access to the information in Big text, large-scale text datasets, and documents (Biomedical literature, e-books, conference proceedings, etc.) is still very rudimentary (Lin and Cohen (2010) A very fast method for clustering big text datasets. In: ECAI, Lisbon). Thus, keyword-search is often the only way to find the needle in the haystack. There is abundance of relevant research results in the Semantic Web research community that offers more robust access interfaces compared to keyword-search. Here we describe a new information retrieval engine that offers advanced user experience combining keyword-search with navigation over an automatically inferred hierarchical document index. The internal representation of the browsing index as a collection of UFOs (Gubanov et al. (2009) Ibm ufo repository. In: VLDB, Lyon; Gubanov et al. (2011) Learning unified famous objects (ufo) to bootstrap information integration. In: IEEE IRI, Las Vegas) yields more relevant search results and improves user experience.