The mth longest runs of multivariate random sequences

Article
  • 89 Downloads

Abstract

The distributions of the mth longest runs of multivariate random sequences are considered. For random sequences made up of k kinds of letters, the lengths of the runs are sorted in two ways to give two definitions of run length ordering. In one definition, the lengths of the runs are sorted separately for each letter type. In the second definition, the lengths of all the runs are sorted together. Exact formulas are developed for the distributions of the mth longest runs for both definitions. The derivations are based on a two-step method that is applicable to various other runs-related distributions, such as joint distributions of several letter types and multiple run lengths of a single letter type.

Keywords

Generating function Combinatorial identities Randomness test Distribution-free statistical test Runs length test Biological sequence analysis 

Copyright information

© The Institute of Statistical Mathematics, Tokyo 2016

Authors and Affiliations

  1. 1.Department of Molecular Biophysics and Biochemistry, W.M. Keck Foundation Biotechnology Resource LaboratoryYale UniversityNew HavenUSA

Personalised recommendations