Semi-automatic Content Extraction from Specifications

  • Krishnaprasad Thirunarayan
  • Aaron Berkovich
  • Dan Sokol
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2553)


Specifications are critical to companies involved in complex manufacturing. The constant reading, reviewing, and analysis of materials and process specifications is extremely labor-intensive, qualityimpacting, and time-consuming. A conceptual design for a tool that provides computer-assistance in the interpretation of specification requirements has been created and a strategy for semantic-markup, which is the overlaying of abstract syntax (“the essence”) on the text, has been developed. The solution is based on the techniques for Information Extraction and the XML technology, and it captures the specification content within a semantic ontology. The working prototype of the tool being built will serve as the foundation for potential full-scale commercialization.


Information Extraction Original Spec Literal Translation Integrate Development Environment Concurrent Engineer 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Sokol, D.Z.: Concurrent Engineering in the Materials Industry: Case Study in the Application of Information Technology, Fourth Annual Conference on Management of Technology, 1994.Google Scholar
  2. 2.
    Sokol, D.Z., Rowe, J.: Integrating STEP and SGML for Concurrent Engineering, CALS 95 International Expo.Google Scholar
  3. 3.
    Sokol, D.Z.: Concurrent Engineering Design System for High Technology Material Suppliers, NSF Phase II Final Report, 1997.Google Scholar
  4. 4.
    Soderland S. G.: Learning Information Extraction Rules for Semi-structured and Free Text, Machine Learning, Vol. 34, No. 1-3 (1999) 233–272.zbMATHCrossRefGoogle Scholar
  5. 7.
    Hobbs J., Appelt D., Bear J., Israel D., Kameyama M., Stickel M., and Tyson M.: FASTUS: Extracting Information from Natural-Language Text, 1996. (
  6. 8.
    Grishman R.: The NYU System for MUC-6 or Where’s the Syntax?, Procs. of MUC-6 (1995).Google Scholar
  7. 9.
    Lehnert W.G., Cardie C., Fisher D., McCarthy J., Riloff E., and Soderland S.: Evaluating an Information Extraction System, Journal of Integrated Computer-Aided Engineering, 1(6) (1994) 453–472.Google Scholar
  8. 10.
    Riloff, E.: Automatically Constructing a Dictionary for Information Extraction Tasks, Proceedings of the Eleventh Annual Conference on Artificial Intelligence (1994) 811–816.Google Scholar
  9. 11.
    Fujii, A., and Ishikawa, T.: Cross-Language Information Retrieval for Technical Documents (1996).Google Scholar
  10. 12.
    Grishman R.: Information Extraction: Techniques and Challenges, Information Extraction (International Summer School SCIE-97), ed. Maria Teresa Pazienza, Springer-Verlag, 1997.Google Scholar
  11. 13.
    Soderland S.G.: CRYSTAL: Learning Domain-specific Text Analysis Rules, CIIR Technical Report # 43, University of Massachusetts at Amherst.Google Scholar
  12. 14.
    Du Charme B.: XSLT Quickly, Manning Publications Co. (2001).Google Scholar
  13. 15.
    Tidwell D.: XSLT, O’Reilly (2001).Google Scholar
  14. 16.
    Harold E. R.: XML Bible, Hungry Minds Inc. (1999).Google Scholar
  15. 17.
    Dietel H. M.: et al, XML: How to Program, Prentice Hall Inc. (2000).Google Scholar
  16. 18.
    Porter M. F.: An Algorithm for Suffix Stripping, Program, Vol. 14, No. 3, (1990), 130–137.Google Scholar
  17. 19.
    McCarthy J.: A Trainable Approach to Coreference Resolution for Information Extraction, PhD Thesis. Dept. of Computer Science Technical Report # 78, University of Massachusetts, Amherst.Google Scholar
  18. 20.
    van Harmelen F. and Fensel D.: Practical Knowledge Representation for the Web. Practical Knowledge Representation for the Web. In Proceedings of the Workshop on Intelligent Information Integration (III99), (1999) IJCAI-99.Google Scholar
  19. 21.
    Muslea, I.: Extraction Patterns for Information Extraction Tasks: A Survey, In Proceedings of AAAI-99 Workshop on Machine Learning for Information Extraction, (1999) AAAI-99.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Krishnaprasad Thirunarayan
    • 1
  • Aaron Berkovich
    • 2
  • Dan Sokol
    • 2
  1. 1.Department of Computer Science and EngineeringWright State UniversityDaytonUSA
  2. 2.Cohesia CorporationDaytonUSA

Personalised recommendations