Several polymorphisms of human DNA have been shown to be hypervariable due to the recurrence of a variable number of tandem repeats (VNTRs) in the lengths of allelic restriction fragments. The recurrence of allelic variants in this novel class of polymorphisms seems to comply well with a model of continuous random variables. Based on this assumption, we have compiled some simple algorithms for classification of continuous data and estimation of classes of relative frequencies and have implemented these routines for the management of databases storing hypervariable single locus DNA genetic systems. The algorithms are compiled in BASIC language and can be incorporated in task-oriented computer programs. Three procedures are discussed, based in turn on: (a) using predetermined, arbitrary classes; (b) point estimations of frequencies for single fragments using error measurements associated with the kilobase value assignment; (c) estimates of phenotype frequencies according to error measurements. Error measurements are obtained from a statistic of values pertaining to several restriction fragments (genomic controls) repeatedly tested in different experiments. Problems related to these approaches are discussed.
- Gene Frequency
- Polymorphism, Restriction Fragment Length
- Repetitive Sequences, Nucleic Acid
- Restriction Mapping