Article: Collating Texts Using Progressive Multiple Alignment
- Cite this article as:
- Spencer, M. & Howe, C. Computers and the Humanities (2004) 38: 253. doi:10.1007/s10579-004-8682-1
- 93 Downloads
To reconstruct a stemma or do any other kind of statistical analysis of a text tradition, one needs accurate data on the variants occurring at each location in each witness. These data are usually obtained from computer collation programs. Existing programs either collate every witness against a base text or divide all texts up into segments as long as the longest variant phrase at each point. These methods do not give ideal data for stemma reconstruction. We describe a better collation algorithm (progressive multiple alignment) that collates all witnesses word by word without a base text, adding groups of witnesses one at a time, starting with the most closely related pair.