NERC-fr: Supervised Named Entity Recognition for French

  • Andoni Azpeitia
  • Montse Cuadros
  • Seán Gaines
  • German Rigau
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8655)


Currently there are only few available language resources for French. Additionally there is a lack of available language models for for tasks such as Named Entity Recognition and Classification (NERC) which makes difficult building natural language processing systems for this language. This paper presents a new publicly available supervised Apache OpenNLP NERC model that has been trained and tested under a maximum entropy approach. This new model achieves state of the art results for French when compared with another systems. Finally we have also extended Apache OpenNLP libraries to support part-of-speech feature extraction component which has been used for our experiments.


Computational Linguistics Entity Recognition Automatic Speech Recognition System Lexical Resource Sentence Boundary 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Abeillé, A., Clément, L., Toussenel, F.: Building a treebank for french. In: Treebanks, pp. 165–187. Springer (2003)Google Scholar
  2. 2.
    Appelt, D.E., Hobbs, J.R., Bear, J., Israel, D., Kameyama, M., Martin, D., Myers, K., Tyson, M.: Sri international fastus system: Muc-6 test results and analysis. In: Proceedings of the 6th Conference on Message Understanding, pp. 237–248. Association for Computational Linguistics (1995)Google Scholar
  3. 3.
    Bikel, D.M., Miller, S., Schwartz, R., Weischedel, R.: Nymble: A high performance learning name-finder. In: Proceedings of the 5th Conference on Applied Natural Language Processing, ANLP, Washington DC (1997)Google Scholar
  4. 4.
    Borthwick, A.: A maximum entropy approach to named entity recognition. Ph.D. thesis, New York University (1999)Google Scholar
  5. 5.
    Budi, I., Bressan, S.: Association rules mining for name entity recognition (2003)Google Scholar
  6. 6.
    Ekbal, A., Bandyopadhyay, S.: Named entity recognition using support vector machine: A language independent approach. International Journal of Computer Systems Science & Engineering 4(2) (2008)Google Scholar
  7. 7.
    Favre, B., Béchet, F., Nocéra, P.: Robust named entity extraction from large spoken archives. In: Proceedings of HLT-EMNLP, pp. 491–498. Association for Computational Linguistics (2005)Google Scholar
  8. 8.
    Mikheev, A., Moens, M., Grover, C.: Named entity recognition without gazetteers. In: Proceedings of the 9th EACL, pp. 1–8 (1999)Google Scholar
  9. 9.
    Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investigationes 30(1), 3–26 (2007)CrossRefGoogle Scholar
  10. 10.
    Nothman, J., Ringland, N., Radford, W., Murphy, T., Curran, J.R.: Learning multilingual named entity recognition from wikipedia. Artificial Intelligence 194, 151–175 (2013)CrossRefzbMATHMathSciNetGoogle Scholar
  11. 11.
    Petasis, G., Vichot, F., Wolinski, F., Paliouras, G., Karkaletsis, V., Spyropoulos, C.D.: Using machine learning to maintain rule-based named-entity recognition and classification systems. In: Proceedings of the 39th Annual Meeting on Association for Computational Linguistics, pp. 426–433. Association for Computational Linguistics (2001)Google Scholar
  12. 12.
    Poibeau, T.: The multilingual named entity recognition framework. In: Proceedings of the tenth conference on European Chapter of the Association for Computational Linguistics, vol. 2, pp. 155–158. Association for Computational Linguistics (2003)Google Scholar
  13. 13.
    Richman, A.E., Schone, P.: Mining wiki resources for multilingual named entity recognition. In: ACL, pp. 1–9 (2008)Google Scholar
  14. 14.
    Sekine, S.: Nyu: Description of the japanese NE system used for met-2. In: Proc. Message Understanding Conference (1998)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Andoni Azpeitia
    • 1
  • Montse Cuadros
    • 1
  • Seán Gaines
    • 1
  • German Rigau
    • 2
  1. 1.HSLT, IP Department - Vicomtech-IK4Donostia-San SebastiánSpain
  2. 2.IXA NLP Group - UPV/EHUDonostia-San SebastiánSpain

Personalised recommendations