# Spearman–Brown prediction formula

The **Spearman–Brown prediction formula**, also known as the **Spearman–Brown prophecy formula**, is a formula relating psychometric reliability to test length and used by psychometricians to predict the reliability of a test after changing the test length.^{[1]} The method was published independently by Spearman (1910) and Brown (1910).^{[2]}^{[3]}

## Calculation

Predicted reliability, , is estimated as:

where *N* is the number of "tests" combined (see below) and is the reliability of the current "test". The formula predicts the reliability of a new test composed by replicating the current test *N* times (or, equivalently, creating a test with *N* parallel forms of the current exam). Thus *N* = 2 implies doubling the exam length by adding items with the same properties as those in the current exam. Values of *N* less than one may be used to predict the effect of shortening a test.

## Forecasting test length

The formula can also be rearranged to predict the number of replications required to achieve a degree of reliability:

This formula is commonly used by psychometricians to predict the
reliability of a test after changing the test length. This relationship is
particularly vital to the split-half and related methods of estimating
reliability (where this method is sometimes known as the "Step Up" formula).^{[4]}

The formula is also helpful in understanding the nonlinear relationship between test reliability and test length. Test length must grow by increasingly larger values as the desired reliability approaches 1.0.

If the longer/shorter test is not parallel to the current test, then the prediction will not be strictly accurate. For example, if a highly reliable test was lengthened by adding many poor items then the achieved reliability will probably be much lower than that predicted by this formula.

For the reliability of a two-item test, the formula is more appropriate than Cronbach's alpha.^{[5]}

Item response theory *item information* provides a much more precise means of predicting changes in the quality of measurement by adding or removing individual items.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}}

## Citations

- ↑ {{#invoke:citation/CS1|citation |CitationClass=book }}
- ↑ Stanley, J. (1971). Reliability. In R. L. Thorndike (Ed.),
*Educational Measurement*. Second edition. Washington, DC: American Council on Education - ↑ Wainer, H., & Thissen, D. (2001). True score theory: The traditional method. In H. Wainer and D. Thissen, (Eds.),
*Test Scoring*. Mahwah, NJ:Lawrence Erlbaum - ↑ Stanley, J. (1971). Reliability. In R. L. Thorndike (Ed.),
*Educational Measurement*. Second edition. Washington, DC: American Council on Education - ↑ {{#invoke:Citation/CS1|citation |CitationClass=journal }}

## References

- Spearman, Charles, C. (1910). Correlation calculated from faulty data.
*British Journal of Psychology, 3*, 271–295. - Brown, W. (1910). Some experimental results in the correlation of mental abilities.
*British Journal of Psychology, 3*, 296–322.