Statistical inference: Difference between revisions

From formulasearchengine
Jump to navigation Jump to search
en>Monkbot
en>SolidPhase
better citation of Bandyopadhyay & Forster
 
(One intermediate revision by one other user not shown)
Line 1: Line 1:
{{refimprove|date=September 2013}}
Marvella is what you can call her but it's not the most female name out there. For years he's been residing in North Dakota and his family enjoys it. I am a meter reader but I plan on altering it. Doing ceramics is what her family members and her enjoy.<br><br>Feel free to surf to my weblog ... [http://tube.ledawy.com/user/EHackney ledawy.com]
In [[statistics]] and [[quantitative research]] methodology, a '''data sample''' is a set of [[data]] collected and/or selected from a [[statistical population]] by a defined procedure.
 
==Kinds of samples==
In other words, the population is very large, making a [[census]] or a complete [[enumeration]] of all the values in the population impractical or impossible. The sample represents a subset of manageable size. Samples are collected and statistics are calculated from the samples so that one can make [[inference]]s or [[extrapolation]]s from the sample to the population. This process of collecting information from a sample is referred to as [[Sampling (statistics)|sampling]]. The data sample may be drawn from a population without replacement, in which case it is a [[subset]] of a [[Statistical population|population]]; or with replacement, in which case it is a multisubset.
 
A '''complete sample''' is a set of objects from a parent population that includes ALL such objects that satisfy a set of well-defined selection criteria. For example, a complete sample of Australian men taller than 2m would consist of a list of '''every''' Australian male taller than 2m. But it wouldn't include German males, or tall Australian females, or people shorter than 2m. So to compile such a complete sample requires a complete list of the parent population, including data on height, gender, and nationality for each member of that parent population. In the case of human populations, such a complete list is unlikely to exist, but such complete samples are often available in other disciplines, such as complete magnitude-limited  samples of astronomical objects.
 
An '''unbiased (representative) sample'''  is a set of objects chosen from a complete sample using a selection process that does not depend on the properties of the objects. For example, an unbiased sample  of Australian men taller than 2m might consist of a randomly sampled subset of 1% of Australian males taller than 2m. But one chosen from the electoral register might not be unbiased since, for example, males aged under 18 will not be on the electoral register. In an astronomical context, an unbiased sample might consist of that fraction of a complete sample for which data are available, provided the data availability is not biased by individual source properties.
 
The best way to avoid a biased or unrepresentative sample is to select a [[random sample]], also known as a probability sample. A random sample is defined as a sample where each individual member of the population has a known, non-zero chance of being selected as part of the sample. Several types of random samples are [[simple random sample]]s, [[Systematic sampling|systematic samples]], [[stratified sampling|stratified random samples]], and [[Cluster sampling|cluster random samples]].
 
A sample that is not random is called a [[Non-random sampling|non-random sample]] or a [[Non-probability sample|non-probability sampling]]. Some examples of nonrandom samples are [[convenience sample]]s, [[judgment sample]]s, [[purposive sample]]s, [[quota sample]]s, [[snowball sampling|snowball samples]], and [[quadrature node]]s in [[quasi-Monte Carlo method]]s.
 
Statistic samples have multiple uses. They can be used in many situations.
 
==Mathematical description of random sample==<!-- This section is linked from [[Sample (probability)]] -->
In mathematical terms, given a [[random variable]] ''X'' with [[Probability distribution|distribution]] ''F'', a random sample of length ''n'' (where ''n'' may be any of 1,2,3,...) is a set of ''n'' [[Statistical independence|independent]], identically distributed ([[Independent and identically-distributed random variables|iid]]) random variables with distribution ''F''.<ref>[[Samuel S. Wilks]], ''Mathematical Statistics'', John Wiley, 1962, Section 8.1 </ref>
 
A sample concretely represents ''n'' experiments in which the same quantity is measured. For example, if ''X'' represents the height of an individual and ''n'' individuals are measured, <math>X_i</math> will be the height of the ''i''-th individual. Note that a sample of random variables (i.e. a set of measurable functions) must not be confused with the realizations of these variables (which are the values that these random variables take, formally called [[random variate]]s). In other words, <math>X_i</math> is a function representing the measurement at the i-th experiment and <math>x_i=X_i(\omega)</math> is the value actually obtained when making the measurement.
 
The concept of a sample thus includes the ''process'' of how the data are obtained (that is, the random variables). This is necessary so that mathematical statements can be made about the sample and [[statistic]]s computed from it, such as the [[sample mean and covariance]].
 
==See also==
*[[Estimation theory]]
*[[Replication (statistics)]]
*[[Sample size determination]]
*[[Sampling (statistics)]]
*[[Survey sampling]]
 
== Notes ==
<references/>
 
==External links==
*[http://www.socialresearchmethods.net/kb/sampstat.php Statistical Terms Made Simple]
 
[[Category:Sampling (statistics)]]
 
[[pl:Próba statystyczna]]

Latest revision as of 10:17, 16 December 2014

Marvella is what you can call her but it's not the most female name out there. For years he's been residing in North Dakota and his family enjoys it. I am a meter reader but I plan on altering it. Doing ceramics is what her family members and her enjoy.

Feel free to surf to my weblog ... ledawy.com