Advertisement

Operational Engines

  • Douglas W. Oard
  • Carl Madson
  • Joseph Olive
  • John McCary
  • Caitlin Christianson
Chapter

Abstract

The goal of the GALE Program is to empower the warfighter through the use of language technologies. In order to accomplish this goal, the technologies must be integrated into operational engines that meet the needs of both warfighters and those who support them. That integration imperative leads directly to the two key parts of this chapter: how to perform that integration, and assessing how well we have met operational needs.

Keywords

Machine Translation Automatic Speech Recognition Word Error Rate Prosodic Feature Sentence Boundary 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Notes

Acknowledgements

Since 2003, David Herrington of DoD/CTTSO/TSWG has been providing valuable resources and guidance. We also would like to acknowledge and thank all of the users - the early adopters- as well as the researchers and developers on the team who have made this DARPA/TSWG leveraging a success story. We would also like to thank Charles Wayne for his early contributions to the pre-GALE efforts.

The authors thank James Allan, Bob Armstrong, Shilpa Arora, Manuel Bardea, Oliver Bender, Eric Brown, Jaroslaw Cwiklik, Bret Ehlert, Gopi Flaherty, Martin Franz, Andre Gauthier, Isaac Harris, Abe Ittycheriah, Nanda Kambhatla, Francis Kubala, Daben Liu, Evgeny Matusov, Kathy McKeown, Justin Merrill, Uma Murthy, Udhyakumar Nallasamy, Eric Nyberg, Leiming Qian, Jerome Quinn, Ganesh Ramaswamy, Eric Riebling, George Saon, Barry Schiffman, Jean Senellart, Sergey Sigelman, Olivier Siohan, Sara Rosenthal, David Svoboda, Alan Watkins, William Wong, and Jian-Ming Xu for their contributions to IOD.

The authors thank Shilpa Arora, Eric Brown, Luis Chavez, Yurdaer Dogonata, David Ferrucci, Tong-Haing Fin, Lei Fong, Greg Hanneman, Kimberly Kettner, Nanda Khambatla, Alon Lavie, Humberto Lezama, Justin Merrill, James Rankin, Hideki Shima, and Django Wexler for their contributions to the UCR and/or UCC.

References

  1. Ahn, J., P. Brusilovsky, D. He, J. Grady, and Q. Li. 2008. Personalized Web Exploration with Task Models. Proceedings of the International World Wide Web Conference. Beijing, China.Google Scholar
  2. Allan, J., ed. 2002. Topic Detection and Tracking: Event-based Information Organization. Kluwer Academic Publishers.Google Scholar
  3. Allan, J. 2003. HARD Track Overview in TREC 2003 High Accuracy Retrieval from Documents. Proceedings of the 12th Text Retrieval Conference. Gaithersburg, MD.Google Scholar
  4. Bemish, N. 2008. Broadcast Monitoring Tools Open Doors to Other Cultures, Information. Communiqué - A Defense Intelligence Agency Publication 21(2).Google Scholar
  5. Billa, J., M. Noamany, A. Srivastava, D. Liu, R. Stone, J. Xu, J. Makhoul, and F. Kubala. 2002. Audio Indexing of Arabic Broadcast News. Proceedings of the International Conference on Acoustics, Speech, and Signal Processing pp. 15-18. Orlando, FL.Google Scholar
  6. Chandra, A., D. Kozen, and L. Stockmeyer. 1981. Alternation. Journal of the Association for Computing Machinery 28(1)pp. 114-133.MathSciNetzbMATHGoogle Scholar
  7. Colbath, S., F. Kubala, D. Liu, and A. Srivastava. 2000. Spoken Documents: Creating Searchable Archives from Continuous Audio. Proceedings of the 33rd Hawaii International Conference on System Sciences vol.3. Maui, HI.Google Scholar
  8. Collins, M. 2002. Discriminative Training Methods For Hidden Markov Models: Theory And Experiments With Perceptron Algorithms. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) 2002 pp.1–8. Philadelphia, PA.Google Scholar
  9. Ferrucci, D. and A. Lally. 2004. UIMA: An Architectural Approach to Unstructured Information Processing in the Corporate Research Environment. Natural Language Engineering 10(3-4): pp. 327–348.CrossRefGoogle Scholar
  10. Fiscus, J. G. 1997. A Post-Processing System to Yield Reduced Word Error Rates: Recognizer Output Voting Error Reduction (ROVER). Proceedings of the Institute of Electrical and Electronics Engineers Automatic Speech Recognition and Understanding Workshop pp. 347–354. Santa Barbara, CA.Google Scholar
  11. Gilmore, G. J. 2007. Modern Missions Rely on Languages, Cultural Awareness, Official Says. American Forces Press Service, April 27.Google Scholar
  12. Gonzalo, J, and D. W. Oard. 2004. iCLEF 2004 Track Overview: Pilot Experiments in Interactive Cross-Language Question Answering. Proceedings of the Multilingual Information Access for Text, Speech, and Images, Fifth Workshop of the Cross-Language Evaluation Forum pp. 310-322. Revised Selected Papers Series, Springer-Verlag, LNCS (3491) pp. 310-322. Bath, UK.Google Scholar
  13. He, D., and D. Demner-Fushman. 2003. HARD Experiment at Maryland: from Need Negotiation to Automated HARD Process. Proceedings of the 12 th Text Retrieval Conference pp. 707-714. Gaithersburg, MD.Google Scholar
  14. He, D., P. Brusilovsky, J. Ahn., J. Grady, R. Farzan, Y. Peng, Y. Yang, and M. Rogati. 2008. An Evaluation of Adaptive Filtering in the Context of Realistic Task-Based Information Exploration. Information Processing and Management 44(2): pp. 511-533.CrossRefGoogle Scholar
  15. He, D., P. Brusiloviksy, J. Grady, Q. Li, and J-W. Ahn. 2007. How Up-to-date Should It Be? The Value of Instant Profiling and Adaptation in Information Filtering. Proceedings of the International Conference on Web Intelligence pp.699-705. Silicon Valley, CA.Google Scholar
  16. He, D., P. Brusilovsky, J-W. Ahn, J. Grady, R. Farzan, Y. Peng, Y. Yang, and M. Rogati. 2008. An Evaluation of Adaptive Filtering in the Context of Realistic Task-Based Information Exploration. Information Processing and Management 44(2): pp. 511-533.CrossRefGoogle Scholar
  17. Huang, J., and G. Zweig. 2002. Maximum Entropy Model for Punctuation Annotation from Speech. Proceedings of the International Conference on Speech and Language Processing( ICSLP) pp. 917-920.Google Scholar
  18. Jayaraman, S., and A. Lavie. 2005. Multi-engine Machine Translation Guided by Explicit Word Matching. Proceedings of the 10th Annual Conference of the European Association for Machine Translation pp.143–152. Budapest, Hungary.Google Scholar
  19. Jones, D. and W. Shen. 2006. Two New Experiments for ILR-Based MT Evaluation. Proceedings of the Association for Machine Translation in the Americas. Boston, MA.Google Scholar
  20. Kubala, F., S. Colbath, D. Liu, A. Srivastava, and J. Makhoul. 2000. Integrated Technologies for Indexing Spoken Language. Communications of the Association for Computing Machinery 43: pp. 48–56.Google Scholar
  21. Kubala, F., S. Colbath, D. Liu, and J. Makhoul. 1999. Rough'n'Ready: a Meeting Recorder and Browser. Association for Computing Machinery Computing Surveys 31(2es).Google Scholar
  22. Liu, D., and F. Kubala. 1999. Fast Speaker Change Detection for Broadcast News Transcription and Indexing. Proceedings of the 6 th European Conference on Speech Communication and Technology (EuroSpeech) 3: pp. 1031-1034.Google Scholar
  23. Liu, D., D. Kiecza, A. Srivastava, and F. Kubala. 2005. Online Speaker Adaptation and Tracking for Real-Time Speech Recognition. Proceedings of the International Speech Communication Association Conference (Interspeech) pp. 281-284.Google Scholar
  24. Liu, D., J. Ma, D. Xu, A.Srivastava, and F. Kubala. 2002. Real-Time Rich-Content Transcription Of Chinese Broadcast News. Proceedings of the International Conference on Speech and Language Processing (ICSLP) pp. 1981-1984.Google Scholar
  25. Liu, Y., E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, B. Peskin, and M. Harper. 2004. The ICSI-SRIUW Metadata Extraction System. Proceedings of the International Conference on Spoken Language Processing pp. 577-580. Jeju Island, Korea.Google Scholar
  26. Makhoul, J., F. Kubala, F., T. Leek, D. Liu, L. Nguyen, R. Schwartz, and A. Srivastava. 2000. Speech and Language Technologies For Audio Indexing. Proceedings of the IEEE 88: pp. 1338–1353.Google Scholar
  27. Makhoul, J., F. Kubala, R. Schwartz, and R. Weischedel. 1999. Performance Measures for Information Extraction. Proceedings of the DARPA Broadcast News Workshop pp. 249-252. Herndon, VA.Google Scholar
  28. Marchionini, G., and B. Shneiderman. 1988. Finding Facts Vs. Browsing Knowledge in Hypertext Systems. IEEE Computer 21(1): pp. 70-79.Google Scholar
  29. Matusov, E., A. Mauser, and H. Ney. 2006. Automatic Sentence Segmentation and Punctuation Prediction for Spoken Language Translation. Proceedings of the International Workshop on Spoken Language Translation (IWSLT) pp. 158-165. Kyoto, Japan.Google Scholar
  30. Murthy, U., J. Pitrelli, G. Ramaswamy, M. Franz, and B. Lewis. 2008. A Methodology and Tool Suite for Evaluation of Accuracy of Interoperating Statistical Natural Language Processing Engines. Proceedings of The International Speech Communication Association Conference (Interspeech) pp. 2066–2069. Brisbane, Australia.Google Scholar
  31. Nyberg, E., E. Riebling, R. C. Wang, and R. Frederking. 2008. Integrating a Natural Language Message Pre-processor with UIMA. Proceedings of LREC Workshop on Enhanced Operability for Large HLT Systems. Marrakech, Morocco.Google Scholar
  32. Pirolli, P., and S. Card. 2005. The Sensemaking Process and Leverage Points for Analyst Technology as Identified through Cognitive Task Analysis. Proceedings of the International Conference on Intelligence Analysis. McLean, VA.Google Scholar
  33. Pitrelli, J., B. Lewis, E. Epstein, D. Kiecza, M. Franz, J. Quinn, G. Ramaswamy, A. Srivastava, and P. Virga. 2008a. Aggregating Distributed STT, MT, and Information Extraction Engines: The GALE Interoperability Demo System. Proceedings of The International Speech Communication Association Conference (INTERSPEECH) pp. 2743–2746. Brisbane, Australia.Google Scholar
  34. Pitrelli, J., B. Lewis, E. Epstein, J. Quinn, and G. Ramaswamy. 2008b. A Data Format Enabling Interoperation of Speech Recognition, Translation, and Information Extraction Engines: The GALE Type System. Proceedings of The International Speech Communication Association Conference (Interspeech) pp. 1654–1657. Brisbane, Australia.Google Scholar
  35. Rosti, A., N. Ayan, B. Xiang, S. Matsoukas, R. Schwartz, and B. Dorr. 2007. Combining Outputs from Multiple Machine Translation Systems. Proceedings of The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL) pp. 228–235. Rochester, NY.Google Scholar
  36. Shepard, S., S. Colbath, K. Egan, and F. Kubala. 2002. Oasis Translator's Aide. Proceedings of the Second international Conference on Human Language Technology Research pp. 389-390. San Diego, CA.Google Scholar
  37. Srivastava, A., and F. Kubala. 2003. Sentence Boundary Detection in Arabic Speech. Proceedings of the European Conference on Speech Communication and Technology (EuroSpeech) pp. 949-952. Geneva, Switzerland.Google Scholar
  38. Stolcke, A. and E. Shriberg. 1996. Automatic Linguistic Segmentation of Conversational Speech. Proceedings of the International Conference on Spoken Language Processing pp. 1005–1008. Philadelphia, PA.Google Scholar
  39. Stolcke, A., E. Shriberg, R. Bates, M. Ostendorf, D. Hakkani, M. Plauche, M., G. Tur, and Y. Lu. 1998. Automatic Detection of Sentence Boundaries And Disfluencies Based On Recognized Words. Proceedings of International Conference on Spoken Language Processing (ICLP) pp. 2247-2250. Sydney, Australia.Google Scholar
  40. White, R. W., G. Muresan, and G. Marchionini. 2006. Evaluating Exploratory Search Systems. Proceedings of the Association for Computing Machinery's Special Interest Group on Information Retrieval Conference. Seattle, WA.Google Scholar
  41. Yang, Y., A. Lad., N. Lao, A. Harpale, B. Kisiel, M. Rogati. 2007. Utility-based Information Distillation Over Temporally Sequenced Documents. Proceedings of the International Association for Computing Machinery Special Interest Group on Information Retrieval Conference on Research and Development in Information Retrieval. pp. 31-38. Amsterdam, Netherlands.Google Scholar
  42. Zhang, P., E. Plettenberg, J. Klavans, D. Oard, and D. Soergel. 2007. Task-based Interaction with an Integrated Multilingual, Multimedia Information System: A Formative Evaluation. Proceedings of the Joint Conference on Digital Libraries. Vancouver, Canada.Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2011

Authors and Affiliations

  • Douglas W. Oard
    • 1
  • Carl Madson
    • 2
  • Joseph Olive
    • 3
  • John McCary
    • 3
  • Caitlin Christianson
    • 3
  1. 1.University of MarylandCollege ParkUSA
  2. 2.SRI InternationalMenlo ParkUSA
  3. 3.Defense Advanced Research Projects AgencyArlingtonUSA

Personalised recommendations