Skip to Main Content
Article navigation

Variety Generation involves the selection of sets of character strings, or symbols, which are intended to occur with equal probabilities in bodies of text or sets of text units from a particular source. It is important that the sample used to generate the symbol set should be representative of the data with which the set will be used. An assessment is given here of the amount of variation in symbol sets generated from files of titles and author names from BNB MARC data over a five year period, and a comparison is made with LC MARC. Some of the BNB symbol sets are compared directly, and equifrequency statistics are obtained for the assignment of each symbol set to each file. The differences between the equifrequency statistics are examined by means of an analysis of variance technique.

This content is only available via PDF.
You do not currently have access to this content.
Don't already have an account? Register

Purchased this content as a guest? Enter your email address to restore access.

Please enter valid email address.
Email address must be 94 characters or fewer.
Pay-Per-View Access
$41.00
Rental

or Create an Account

Close Modal
Close Modal