Skip to main content

Table 3 Statistics of named entities

From: Accelerating the annotation of sparse named entities by dynamic sentence selection

  # Entities Sentences (%)
CoNLL: LOC 7,140 5,127 (36.5%)
CoNLL: MISC 3,438 2,698 (19.2%)
CoNLL: ORG 6,321 4,587 (32.7%)
CoNLL: PER 6,600 4,373 (31.1%)
GENIA: DNA 2,017 5,251 (28.3%)
GENIA: RNA 225 810 (4.4%)
GENIA: cell_line 835 2,880 (15.5%)
GENIA: cell_type 1,104 5,212 (28.1%)
GENIA: protein 5,272 13,040 (70.3%)