Advances in Knowledge Discovery and Data Mining

Volume 5476 of the series Lecture Notes in Computer Science pp 541-547

A Hybrid Approach to Improve Bilingual Multiword Expression Extraction

  • Jianyong DuanAffiliated withCollege of Information Engineering
  • , Mei ZhangAffiliated withCollege of Art Design, North China University of Technology
  • , Lijing TongAffiliated withCollege of Information Engineering
  • , Feng GuoAffiliated withCollege of Information Engineering

* Final gross prices may vary according to local VAT.

Get Access


We propose a hybrid approach for bilingual multiword expression extraction. There are two phases in the extraction process. In the first phase, lots of candidates are extracted from the corpus by statistic methods. The algorithm of multiple sequence alignment is sensitive to the flexible multiword. In the second phase, error-driven rules and patterns are extracted from corpus. These trained rules are used to filter the candidates. Some related experiments are designed for achieving the best performance because there are lots of parameters in this system. Experimental results showed our approach gains good performance.