L-notation: Difference between revisions
en>ChrisGualtieri m TypoScan Project / General Fixes, typos fixed: eg. → e.g. using AWB |
No edit summary |
||
Line 1: | Line 1: | ||
{{For|loudspeakers|mid-range speaker}} | |||
In [[statistics]], the '''mid-range''' or '''mid-extreme''' of a set of statistical data values is the [[arithmetic mean]] of the maximum and minimum values in a [[data set]], defined as:{{sfn|Dodge|2003}} | |||
:<math>M=\frac{\max x + \min x}{2}.</math> | |||
The mid-range is the midpoint of the [[Range (statistics)|range]]; as such, it is a measure of [[central tendency]]. | |||
The mid-range is rarely used in practical statistical analysis, as it lacks [[#Efficiency|efficiency]] as an estimator for most distributions of interest, because it ignores all intermediate points, and lacks [[#Robustness|robustness]], as outliers change it significantly. Indeed, it is one of the least efficient and least robust statistics. However, it finds some use in special cases: it is the maximally efficient estimator for the center of a uniform distribution, trimmed mid-ranges address robustness, and as an [[L-estimator]], it is simple to understand and compute. | |||
==Comparison with other measures== | |||
===Robustness=== | |||
The midrange is highly sensitive to outliers and ignores all but two data points. It is therefore a very non-[[robust statistic]], having a [[breakdown point]] of 0, meaning that a single observation can change it arbitrarily. Further, it is highly influenced by outliers: increasing the sample maximum or decreasing the sample minimum by ''x'' changes the mid-range by <math>x/2,</math> while it changes the sample mean, which also has breakdown point of 0, by only <math>x/n.</math> It is thus of little use in practical statistics, unless outliers are already handled. | |||
A [[trimmed estimator|trimmed]] midrange is known as a '''{{visible anchor|midsummary}}''' – the ''n''% trimmed midrange is the average of the ''n''% and (100−''n'')% percentiles, and is more robust, having a [[breakdown point]] of ''n''%. In the middle of these is the [[midhinge]], which is the 25% midsummary. The [[median]] can be interpreted as the fully trimmed (50%) mid-range; this accords with the convention that the median of an even number of points is the mean of the two middle points. | |||
These trimmed midranges are also of interest as [[descriptive statistics]] or as [[L-estimator]]s of central location or [[skewness]]: differences of midsummaries, such as midhinge minus the median, give measures of skewness at different points in the tail.{{sfn|Velleman|Hoaglin|1981}} | |||
===Efficiency=== | |||
Despite its drawbacks, in some cases it is useful: the midrange is a highly [[Efficiency (statistics)|efficient]] [[estimator]] of μ, given a small sample of a sufficiently [[platykurtic]] distribution, but it is inefficient for mesokurtic distributions, such as the normal. | |||
For example, for a [[continuous uniform distribution]] with unknown maximum and minimum, the mid-range is the [[UMVU]] estimator for the mean. The [[sample maximum]] and sample minimum, together with sample size, are a sufficient statistic for the population maximum and minimum – the distribution of other samples, conditional on a given maximum and minimum, is just the uniform distribution between the maximum and minimum and thus add no information. See [[German tank problem]] for further discussion. Thus the mid-range, which is an unbiased and sufficient estimator of the population mean, is in fact the UMVU: using the sample mean just adds noise based on the uninformative distribution of points within this range. | |||
Conversely, for the normal distribution, the sample mean is the UMVU estimator of the mean. Thus for platykurtic distributions, which can often be thought of as between a uniform distribution and a normal distribution, the informativeness of the middle sample points versus the extrema values varies from "equal" for normal to "uninformative" for uniform, and for different distributions, one or the other (or some combination thereof) may be most efficient. A robust analog is the [[trimean]], which averages the midhinge (25% trimmed mid-range) and median. | |||
====Small samples==== | |||
For small sample sizes (''n'' from 4 to 20) drawn from a sufficiently platykurtic distribution (negative [[excess kurtosis]], defined as γ<sub>2</sub> = (μ<sub>4</sub>/(μ<sub>2</sub>)²) − 3), the mid-range is an efficient estimator of the mean ''μ''. The following table summarizes empirical data comparing three estimators of the mean for distributions of varied kurtosis; the [[modified mean]] is the [[truncated mean]], where the maximum and minimum are eliminated.<ref>{{Cite thesis |type=Master's |title=An Investigation of Measures of Central Tendency Used in Quality Control |url=http://books.google.co.jp/books/about/An_Investigation_of_Measures_of_Central.html?id=GNtiNwAACAAJ |author= |last=Vinson |first=William Daniel |year=1951 |publisher= University of North Carolina at Chapel Hill |at=Table (4.1), pp. 32–34}}</ref><ref>{{cite book |title=Statistical methods in quality control |first=Dudley Johnstone |last=Cowden |publisher=Prentice-Hall |year=1957 |pages=[http://books.google.co.jp/books?ei=DXFqUf7hFIiiige4lICIAg&id=b-BTAAAAMAAJ&dq=William+D.+Vinson+statistics&q=William+D.+Vinson#search_anchor 67–68]}}</ref> | |||
{| class="wikitable" | |||
! Excess kurtosis (γ<sub>2</sub>) !! Most efficient estimator of ''μ'' | |||
|- | |||
| −1.2 to −0.8 || Midrange | |||
|- | |||
| −0.8 to 2.0 || Mean | |||
|- | |||
| 2.0 to 6.0 || Modified mean | |||
|} | |||
For ''n'' = 1 or 2, the midrange and the mean are equal (and coincide with the median), and are most efficient for all distributions. For ''n'' = 3, the modified mean is the median, and instead the mean is the most efficient measure of central tendency for values of ''γ''<sub>2</sub> from 2.0 to 6.0 as well as from −0.8 to 2.0. | |||
==Sampling properties== | |||
For a sample of size ''n'' from the [[standard normal distribution]], the mid-range ''M'' is unbiased, and has a variance given by:{{sfn|Kendall|Stuart|1969|loc=Example 14.4}} | |||
:<math>\operatorname{var}(M)=\frac{\pi^2}{24 \ln(n)}.</math> | |||
For a sample of size ''n'' from the standard [[Laplace distribution]], the mid-range ''M'' is unbiased, and has a variance given by:{{sfn|Kendall|Stuart|1969|loc=Example 14.5}} | |||
:<math>\operatorname{var}(M)=\frac{\pi^2}{12}</math> | |||
and, in particular, the variance does not decrease to zero as the sample size grows. | |||
For a sample of size ''n'' from a zero-centred [[Uniform distribution (continuous)|uniform distribution]], the mid-range ''M'' is unbiased, ''nM'' has an [[asymptotic distribution]] which is a [[Laplace distribution]].{{sfn|Kendall|Stuart|1969|loc=Example 14.12}} | |||
==Deviation== | |||
While the mean of a set of values minimizes the sum of squares of [[Deviation (statistics)|deviations]] and the [[median]] minimizes the [[average absolute deviation]], the midrange minimizes the [[maximum deviation]] (defined as <math>\max\left|x_i-m\right|</math>): it is a solution to a variational problem. | |||
==See also== | |||
* [[Range (statistics)]] | |||
* [[Midhinge]] | |||
==References== | |||
{{reflist|2}} | |||
{{refbegin}} | |||
* {{cite isbn|0199206139}} | |||
* {{cite isbn|0852641419}} | |||
* {{cite isbn|087150409X}} | |||
{{refend}} | |||
{{DEFAULTSORT:Mid-Range}} | |||
[[Category:Means]] | |||
[[Category:Summary statistics]] |
Revision as of 22:25, 24 June 2013
28 year-old Painting Investments Worker Truman from Regina, usually spends time with pastimes for instance interior design, property developers in new launch ec Singapore and writing. Last month just traveled to City of the Renaissance. In statistics, the mid-range or mid-extreme of a set of statistical data values is the arithmetic mean of the maximum and minimum values in a data set, defined as:Template:Sfn
The mid-range is the midpoint of the range; as such, it is a measure of central tendency.
The mid-range is rarely used in practical statistical analysis, as it lacks efficiency as an estimator for most distributions of interest, because it ignores all intermediate points, and lacks robustness, as outliers change it significantly. Indeed, it is one of the least efficient and least robust statistics. However, it finds some use in special cases: it is the maximally efficient estimator for the center of a uniform distribution, trimmed mid-ranges address robustness, and as an L-estimator, it is simple to understand and compute.
Comparison with other measures
Robustness
The midrange is highly sensitive to outliers and ignores all but two data points. It is therefore a very non-robust statistic, having a breakdown point of 0, meaning that a single observation can change it arbitrarily. Further, it is highly influenced by outliers: increasing the sample maximum or decreasing the sample minimum by x changes the mid-range by while it changes the sample mean, which also has breakdown point of 0, by only It is thus of little use in practical statistics, unless outliers are already handled.
A trimmed midrange is known as a Template:Visible anchor – the n% trimmed midrange is the average of the n% and (100−n)% percentiles, and is more robust, having a breakdown point of n%. In the middle of these is the midhinge, which is the 25% midsummary. The median can be interpreted as the fully trimmed (50%) mid-range; this accords with the convention that the median of an even number of points is the mean of the two middle points.
These trimmed midranges are also of interest as descriptive statistics or as L-estimators of central location or skewness: differences of midsummaries, such as midhinge minus the median, give measures of skewness at different points in the tail.Template:Sfn
Efficiency
Despite its drawbacks, in some cases it is useful: the midrange is a highly efficient estimator of μ, given a small sample of a sufficiently platykurtic distribution, but it is inefficient for mesokurtic distributions, such as the normal.
For example, for a continuous uniform distribution with unknown maximum and minimum, the mid-range is the UMVU estimator for the mean. The sample maximum and sample minimum, together with sample size, are a sufficient statistic for the population maximum and minimum – the distribution of other samples, conditional on a given maximum and minimum, is just the uniform distribution between the maximum and minimum and thus add no information. See German tank problem for further discussion. Thus the mid-range, which is an unbiased and sufficient estimator of the population mean, is in fact the UMVU: using the sample mean just adds noise based on the uninformative distribution of points within this range.
Conversely, for the normal distribution, the sample mean is the UMVU estimator of the mean. Thus for platykurtic distributions, which can often be thought of as between a uniform distribution and a normal distribution, the informativeness of the middle sample points versus the extrema values varies from "equal" for normal to "uninformative" for uniform, and for different distributions, one or the other (or some combination thereof) may be most efficient. A robust analog is the trimean, which averages the midhinge (25% trimmed mid-range) and median.
Small samples
For small sample sizes (n from 4 to 20) drawn from a sufficiently platykurtic distribution (negative excess kurtosis, defined as γ2 = (μ4/(μ2)²) − 3), the mid-range is an efficient estimator of the mean μ. The following table summarizes empirical data comparing three estimators of the mean for distributions of varied kurtosis; the modified mean is the truncated mean, where the maximum and minimum are eliminated.[1][2]
Excess kurtosis (γ2) | Most efficient estimator of μ |
---|---|
−1.2 to −0.8 | Midrange |
−0.8 to 2.0 | Mean |
2.0 to 6.0 | Modified mean |
For n = 1 or 2, the midrange and the mean are equal (and coincide with the median), and are most efficient for all distributions. For n = 3, the modified mean is the median, and instead the mean is the most efficient measure of central tendency for values of γ2 from 2.0 to 6.0 as well as from −0.8 to 2.0.
Sampling properties
For a sample of size n from the standard normal distribution, the mid-range M is unbiased, and has a variance given by:Template:Sfn
For a sample of size n from the standard Laplace distribution, the mid-range M is unbiased, and has a variance given by:Template:Sfn
and, in particular, the variance does not decrease to zero as the sample size grows.
For a sample of size n from a zero-centred uniform distribution, the mid-range M is unbiased, nM has an asymptotic distribution which is a Laplace distribution.Template:Sfn
Deviation
While the mean of a set of values minimizes the sum of squares of deviations and the median minimizes the average absolute deviation, the midrange minimizes the maximum deviation (defined as ): it is a solution to a variational problem.
See also
References
43 year old Petroleum Engineer Harry from Deep River, usually spends time with hobbies and interests like renting movies, property developers in singapore new condominium and vehicle racing. Constantly enjoys going to destinations like Camino Real de Tierra Adentro. Template:Refbegin
- ↑ Template:Cite thesis
- ↑ 20 year-old Real Estate Agent Rusty from Saint-Paul, has hobbies and interests which includes monopoly, property developers in singapore and poker. Will soon undertake a contiki trip that may include going to the Lower Valley of the Omo.
My blog: http://www.primaboinca.com/view_profile.php?userid=5889534