Smooth morphism: Difference between revisions

Revision as of 16:50, 21 July 2013

Discriminant function analysis is a statistical analysis to predict a categorical dependent variable (called a grouping variable) by one or more continuous or binary independent variables (called predictor variables). The original dichotomous discriminant analysis was developed by Sir Ronald Fisher in 1936.^[1] It is different from an ANOVA or MANOVA, which is used to predict one (ANOVA) or multiple (MANOVA) continuous dependent variables by one or more independent categorical variables. Discriminant function analysis is useful in determining whether a set of variables is effective in predicting category membership.^[2]

Discriminant analysis is used when groups are known a priori (unlike in cluster analysis). Each case must have a score on one or more quantitative predictor measures, and a score on a group measure.^[3] In simple terms, discriminant function analysis is classification - the act of distributing things into groups, classes or categories of the same type.

Moreover, it is a useful follow-up procedure to a MANOVA instead of doing a series of one-way ANOVAs, for ascertaining how the groups differ on the composite of dependent variables. In this case, a significant F test allows classification based on a linear combination of predictor variables. Terminology can get confusing here, as in MANOVA, the dependent variables are the predictor variables, and the independent variables are the grouping variables.^[2]

Assumptions

The assumptions of discriminant analysis are the same as those for MANOVA. The analysis is quite sensitive to outliers and the size of the smallest group must be larger than the number of predictor variables.^[3]

Multivariate normality: Independent variables are normal for each level of the grouping variable.^[2]^[3]

Homogeneity of variance/covariance (homoscedasticity): Variances among group variables are the same across levels of predictors. Can be tested with Box's M statistic.^[2]Template:Page needed It has been suggested, however, that linear discriminant analysis be used when covariances are equal, and that quadratic discriminant analysis may be used when covariances are not equal.^[3]

Multicollinearity: Predictive power can decrease with an increased correlation between predictor variables.^[3]

Independence: Participants are assumed to be randomly sampled, and a participant’s score on one variable is assumed to be independent of scores on that variable for all other participants.^[2]^[3]

It has been suggested that discriminant analysis is relatively robust to slight violations of these assumptions,^[4] and it has also been shown that discriminant analysis may still be reliable when using dichotomous variables (where multivariate normality is often violated).^[5]

Discriminant functions

Discriminant analysis works by creating one or more linear combinations of predictors, creating a new latent variable for each function. These functions are called discriminant functions. The number of functions possible is either Ng-1 where Ng = number of groups, or p (the number of predictors), whichever is smaller. The first function created maximizes the differences between groups on that function. The second function maximizes differences on that function, but also must not be correlated with the previous function. This continues with subsequent functions with the requirement that the new function not be correlated with any of the previous functions.

Given group $j$ , with $\mathbb {R} _{j}$ sets of sample space, there is a discriminant rule such that if $x\in \mathbb {R} _{j}$ , then $x\in j$ . Discriminant analysis then, finds “good” regions of $\mathbb {R} _{j}$ to minimize classification error, therefore leading to a high percent correct classified in the classification table.Hardle, W., Simar, L. (2007). Applied Multivariate Statistical Analysis. Springer Berlin Heidelberg. pp. 289-303.

Each function is given a discriminant score to determine how well it predicts group placement.

Structure Correlation Coefficients: The correlation between each predictor and the discriminant score of each function. This is a wholeTemplate:Clarify correlation.Garson, G. D. (2008). Discriminant function analysis. http://www2.chass.ncsu.edu/garson/pa765/discrim.htm .
Standardized Coefficients: Each predictor’s unique contribution to each function, therefore this is a partial correlation. Indicates the relative importance of each predictor in predicting group assignment from each function.
Functions at Group Centroids: Mean discriminant scores for each grouping variable are given for each function. The farther apart the means are, the less error there will be in classification.

Discrimination rules

Maximum likelihood: Assigns x to the group that maximizes population (group) density.^[6]
Bayes Discriminant Rule: Assigns x to the group that maximizes $\pi _{i}f_{i}(x)$ , where $f_{i}(x)$ represents the prior probability of that classification, and π_i represents the population density.^[6]
Fisher’s linear discriminant rule: Maximizes the ratio between SS_between and SS_within , and finds a linear combination of the predictors to predict group.^[6]

Eigenvalues

An eigenvalue in discriminant analysis is the characteristic root of each function.Template:Clarify It is an indication of how well that function differentiates the groups, where the larger the eigenvalue, the better the function differentiates.^[3] This however, should be interpreted with caution, as eigenvalues have no upper limit.^[2]^[3] The eigenvalue can be viewed as a ratio of SS_between and SS_within as in ANOVA when the dependent variable is the discriminant function, and the groups are the levels of the IVTemplate:Clarify.^[2] This means that the largest eigenvalue is associated with the first function, the second largest with the second, etc..

Effect size

Some suggest the use of eigenvalues as effect size measures, however, this is generally not supported.^[2] Instead, the canonical correlation is the preferred measure of effect size. It is similar to the eigenvalue, but is the square root of the ratio of SS_between and SS_total. It is the correlation between groups and the function.^[2] Another popular measure of effect size is the percent of varianceTemplate:Clarify for each function. This is calculated by: (λ_x/Σλ_i) X 100 where λ_x is the eigenvalue for the function and Σλ_i is the sum of all eigenvalues. This tells us how strong the prediction is for that particular function compared to the others.^[2] Percent correctly classified can also be analyzed as an effect size. The kappa valueTemplate:Clarify can describe this while correcting for chance agreement.^[2]

Variations

Multiple discriminant analysis (MDA): related to MANOVA. Has more than two groups, and uses multiple dummy variables.^[7]
Sequential discriminant analysis: assesses the importance of a set of IVs over and above a set of controls. In this case, the controls are entered first, and then the IVs.^[7]
Stepwise discriminant analysis: Selects the most correlated predictor first, removes that variance in the grouping variable then adds the next most correlated and continues until the change in canonical correlation is not significant. Of course, both forward and backward stepwise procedures may be performed.^[7]

Comparison to logistic regression

Discriminant function analysis is very similar to logistic regression, and both can be used to answer the same research questions.^[2] Logistic regression does not have as many assumptions and restrictions as discriminant analysis. However, when discriminant analysis’ assumptions are met, it is more powerful than logistic regression.Potter or Ceramic Artist Truman Bedell from Rexton, has interests which include ceramics, best property developers in singapore developers in singapore and scrabble. Was especially enthused after visiting Alejandro de Humboldt National Park. Unlike logistic regression, discriminant analysis can be used with small sample sizes. It has been shown that when sample sizes are equal, and homogeneity of variance/covariance holds, discriminant analysis is more accurate.^[3] With all this being considered, logistic regression is the common choice nowadays, since the assumptions of discriminant analysis are rarely met.^[1]^[3]

References

43 year old Petroleum Engineer Harry from Deep River, usually spends time with hobbies and interests like renting movies, property developers in singapore new condominium and vehicle racing. Constantly enjoys going to destinations like Camino Real de Tierra Adentro.

External links

↑ ^1.0 ^1.1 Cohen et al. Applied Multiple Regression/Correlation Analysis for the Behavioural Sciences 3rd ed. (2003). Taylor & Francis Group.
↑ ^2.00 ^2.01 ^2.02 ^2.03 ^2.04 ^2.05 ^2.06 ^2.07 ^2.08 ^2.09 ^2.10 ^2.11 Green, S.B. Salkind, N. J. & Akey, T. M. (2008). Using SPSS for Windows and Macintosh: Analyzing and understanding data. New Jersey: Prentice Hall.
↑ ^3.0 ^3.1 ^3.2 ^3.3 ^3.4 ^3.5 ^3.6 ^3.7 ^3.8 ^3.9 BÖKEOĞLU ÇOKLUK, Ö, & BÜYÜKÖZTÜRK, Ş. (2008). Discriminant function analysis: Concept and application. Eğitim araştırmaları dergisi, (33), 73-92.
↑ Lachenbruch, P. A. (1975). Discriminant analysis. NY: Hafner
↑ Klecka, William R. (1980). Discriminant analysis. Quantitative Applications in the Social Sciences Series, No. 19. Thousand Oaks, CA: Sage Publications.
↑ ^6.0 ^6.1 ^6.2 Hardle, W., Simar, L. (2007). Applied Multivariate Statistical Analysis. Springer Berlin Heidelberg. pp. 289-303.
↑ ^7.0 ^7.1 ^7.2 Garson, G. D. (2008). Discriminant function analysis. http://www2.chass.ncsu.edu/garson/pa765/discrim.htm .

[cohen-1] 1.0 ^1.1 Cohen et al. Applied Multiple Regression/Correlation Analysis for the Behavioural Sciences 3rd ed. (2003). Taylor & Francis Group.

[green-2] 2.00 ^2.01 ^2.02 ^2.03 ^2.04 ^2.05 ^2.06 ^2.07 ^2.08 ^2.09 ^2.10 ^2.11 Green, S.B. Salkind, N. J. & Akey, T. M. (2008). Using SPSS for Windows and Macintosh: Analyzing and understanding data. New Jersey: Prentice Hall.

[buy-3] 3.0 ^3.1 ^3.2 ^3.3 ^3.4 ^3.5 ^3.6 ^3.7 ^3.8 ^3.9 BÖKEOĞLU ÇOKLUK, Ö, & BÜYÜKÖZTÜRK, Ş. (2008). Discriminant function analysis: Concept and application. Eğitim araştırmaları dergisi, (33), 73-92.

[4] Lachenbruch, P. A. (1975). Discriminant analysis. NY: Hafner

[5] Klecka, William R. (1980). Discriminant analysis. Quantitative Applications in the Social Sciences Series, No. 19. Thousand Oaks, CA: Sage Publications.

[har-6] 6.0 ^6.1 ^6.2 Hardle, W., Simar, L. (2007). Applied Multivariate Statistical Analysis. Springer Berlin Heidelberg. pp. 289-303.

[garson-7] 7.0 ^7.1 ^7.2 Garson, G. D. (2008). Discriminant function analysis. http://www2.chass.ncsu.edu/garson/pa765/discrim.htm .

[1]

[2]

[3]

[4]

[5]

[6]

[7]

@@ Line 1: / Line 1: @@
-I would like to introduce myself to you, I am Jayson Simcox but I don't like when individuals use my full title. Invoicing is what I do. The favorite hobby for him and his kids is to perform lacross and he would by no means give it up. Alaska is exactly where I've always been living.<br><br>my weblog :: accurate psychic readings ([http://formalarmour.com/index.php?do=/profile-26947/info/ formalarmour.com])
+'''Discriminant function analysis''' is a statistical analysis to predict a [[categorical variable|categorical]] [[dependent variable|dependent]] [[Variable (mathematics)#Applied statistics|variable]] (called a grouping variable) by one or more [[continuous variable|continuous]] or [[Binary variable|binary]] [[independent variable|independent]] variables (called predictor variables). The original dichotomous discriminant analysis was developed by Sir Ronald Fisher in 1936.<ref name="cohen">Cohen et al. Applied Multiple Regression/Correlation Analysis for the Behavioural Sciences 3rd ed. (2003). Taylor & Francis Group.</ref> It is different from an [[ANOVA]] or [[MANOVA]], which is used to predict one (ANOVA) or multiple (MANOVA) continuous dependent variables by one or more independent categorical variables. Discriminant function analysis is useful in determining whether a set of variables is effective in predicting category membership.<ref name="green">Green, S.B. Salkind, N. J. & Akey, T. M. (2008). Using SPSS for Windows and Macintosh: Analyzing and understanding data. New Jersey: Prentice Hall.</ref>
+Discriminant analysis is used when groups are known a priori (unlike in [[cluster analysis]]). Each case must have a score on one or more quantitative predictor measures, and a score on a group measure.<ref name="buy">BÖKEOĞLU ÇOKLUK, Ö, & BÜYÜKÖZTÜRK, Ş. (2008). Discriminant function analysis: Concept and application. Eğitim araştırmaları dergisi, (33), 73-92.</ref> In simple terms, discriminant function analysis is classification - the act of distributing things into groups, classes or categories of the same type.
+Moreover, it is a useful follow-up procedure to a MANOVA instead of doing a series of one-way ANOVAs, for ascertaining how the groups differ on the composite of dependent variables. In this case, a significant F test allows classification based on a linear combination of predictor variables. Terminology can get confusing here, as in MANOVA, the dependent variables are the predictor variables, and the independent variables are the grouping variables.<ref name="green"/>
+==Assumptions==
+The assumptions of discriminant analysis are the same as those for MANOVA. The analysis is quite sensitive to outliers and the size of the smallest group must be larger than the number of predictor variables.<ref name="buy"/>
+*[[Multivariate normal distribution|Multivariate normality]]: Independent variables are normal for each level of the grouping variable.<ref name="green"/><ref name="buy"/>
+*Homogeneity of variance/covariance ([[homoscedasticity]]): Variances among group variables are the same across levels of predictors. Can be tested with Box's M statistic.<ref name="green"/>{{page needed|date=April 2012}} It has been suggested, however, that [[linear discriminant analysis]] be used when covariances are equal, and that [[quadratic classifier#Quadratic discriminant analysis|quadratic discriminant analysis]] may be used when covariances are not equal.<ref name="buy"/>
+*[[Multicollinearity]]: Predictive power can decrease with an increased correlation between predictor variables.<ref name="buy"/>
+*[[statistical independence|Independence]]: Participants are assumed to be randomly sampled, and a participant’s score on one variable is assumed to be independent of scores on that variable for all other participants.<ref name="green"/><ref name="buy"/>
+	It has been suggested that discriminant analysis is relatively robust to slight violations of these assumptions,<ref>Lachenbruch, P. A. (1975). ''Discriminant analysis''. NY: Hafner</ref> and it has also been shown that discriminant analysis may still be reliable when using dichotomous variables (where multivariate normality is often violated).<ref>Klecka, William R. (1980). ''Discriminant analysis''. Quantitative Applications in the Social Sciences Series, No. 19. Thousand Oaks, CA: Sage Publications.</ref>
+==Discriminant functions==
+Discriminant analysis works by creating one or more linear combinations of predictors, creating a new [[latent variable]] for each function. These functions are called discriminant functions. The number of functions possible is either ''Ng''-1 where ''Ng'' = number of groups, or ''p'' (the number of predictors), whichever is smaller. The first function created maximizes the differences between groups on that function. The second function maximizes differences on that function, but also must not be correlated with the previous function. This continues with subsequent functions with the requirement that the new function not be correlated with any of the previous functions.
+Given group <math>j</math>, with  <math>\mathbb{R}_j</math>  sets of sample space, there is a discriminant rule such that if <math>x \in\mathbb{R}_j</math>, then <math>x\in j</math>. Discriminant analysis then, finds “good” regions of  <math>\mathbb{R}_j</math> to minimize classification error, therefore leading to a high percent correct classified in the classification table.Hardle, W., Simar, L. (2007). ''Applied Multivariate Statistical Analysis''. Springer Berlin Heidelberg. pp. 289-303.
+Each function is given a discriminant score to determine how well it predicts group placement.
+*Structure Correlation Coefficients: The correlation between each predictor and the discriminant score of each function. This is a whole{{clarify|date=April 2012}} correlation.Garson, G. D. (2008). Discriminant function analysis. http://www2.chass.ncsu.edu/garson/pa765/discrim.htm .
+*Standardized Coefficients: Each predictor’s unique contribution to each function, therefore this is a [[partial correlation]]. Indicates the relative importance of each predictor in predicting group assignment from each function.
+*Functions at Group Centroids: Mean discriminant scores for each grouping variable are given for each function. The farther apart the means are, the less error there will be in classification.
+==Discrimination rules==
+*[[Maximum likelihood]]: Assigns x to the group that maximizes population (group) density.<ref name="har">Hardle, W., Simar, L. (2007). ''Applied Multivariate Statistical Analysis''. Springer Berlin Heidelberg. pp. 289-303.</ref>
+*Bayes Discriminant Rule: Assigns x to the group that maximizes <math>\pi_i f_i(x)</math>, where <math>f_i(x)</math> represents the [[prior probability]] of that classification, and ''π<sub>i</sub>'' represents the population density.<ref name="har"/>
+*[[Linear Discriminant Analysis|Fisher’s linear discriminant rule]]: Maximizes the ratio between ''SS''<sub>between</sub> and ''SS''<sub>within</sub> , and finds a linear combination of the predictors to predict group.<ref name="har"/>
+==Eigenvalues==
+	An [[eigenvalues and eigenvectors|eigenvalue]] in discriminant analysis is the characteristic root of each function.{{clarify|date=April 2012}} It is an indication of how well that function differentiates the groups, where the larger the eigenvalue, the better the function differentiates.<ref name="buy"/> This however, should be interpreted with caution, as eigenvalues have no upper limit.<ref name="green"/><ref name="buy"/>
+	The eigenvalue can be viewed as a ratio of ''SS''<sub>between</sub> and ''SS''<sub>within</sub> as in ANOVA when the dependent variable is the discriminant function, and the groups are the levels of the IV{{clarify|date=April 2012}}.<ref name="green"/> This means that the largest eigenvalue is associated with the first function, the second largest with the second, etc..
+==Effect size==
+	Some suggest the use of eigenvalues as [[effect size]] measures, however, this is generally not supported.<ref name="green"/> Instead, the [[canonical correlation]] is the preferred measure of effect size. It is similar to the eigenvalue, but is the square root of the ratio of ''SS''<sub>between</sub> and ''SS''<sub>total</sub>. It is the correlation between groups and the function.<ref name="green"/>
+	Another popular measure of effect size is the percent of variance{{clarify|date=April 2012}} for each function.  This is calculated by: (''λ<sub>x</sub>/Σλ<sub>i</sub>'') X 100 where ''λ<sub>x</sub>'' is the eigenvalue for the function and Σ''λ<sub>i</sub>'' is the sum of all eigenvalues. This tells us how strong the prediction is for that particular function compared to the others.<ref name="green"/>
+	Percent correctly classified can also be analyzed as an effect size. The kappa value{{clarify|date=April 2012}} can describe this while correcting for chance agreement.<ref name="green"/>
+==Variations==
+*[[Linear Discriminant Analysis#Multiclass LDA|Multiple discriminant analysis (MDA)]]: related to MANOVA. Has more than two groups, and uses multiple dummy variables.<ref name="garson">Garson, G. D. (2008). Discriminant function analysis. http://www2.chass.ncsu.edu/garson/pa765/discrim.htm .</ref>
+*Sequential discriminant analysis: assesses the importance of a set of IVs over and above a set of controls. In this case, the controls are entered first, and then the IVs.<ref name="garson"/>
+*Stepwise discriminant analysis: Selects the most correlated predictor first, removes that variance in the grouping variable then adds the next most correlated and continues until the change in canonical correlation is not significant. Of course, both forward and backward stepwise procedures may be performed.<ref name="garson"/>
+==Comparison to logistic regression==
+	Discriminant function analysis is very similar to [[logistic regression]], and both can be used to answer the same research questions.<ref name="green"/> Logistic regression does not have as many assumptions and restrictions as discriminant analysis. However, when discriminant analysis’ assumptions are met, it is more powerful than logistic regression.{{citation needed|date=April 2012}} Unlike logistic regression, discriminant analysis can be used with small sample sizes. It has been shown that when sample sizes are equal, and homogeneity of variance/covariance holds, discriminant analysis is more accurate.<ref name="buy"/> With all this being considered, logistic regression is the common choice nowadays, since the assumptions of discriminant analysis are rarely met.<ref name="cohen"/><ref name="buy"/>
+==See also==
+{{wikiversity}}
+*[[Statistical classification]]
+*[[Linear discriminant analysis]]
+*[[Multiple discriminant analysis]]
+==References==
+{{Reflist}}
+==External links==
+* [http://www2.chass.ncsu.edu/garson/pa765/discrim.htm Course notes, Discriminant function analysis by G. David Garson, NC State University]
+* [http://www.psychstat.missouristate.edu/multibook/mlt03m.html Course notes, Discriminant function analysis by David W. Stockburger, Missouri State University]
+* [http://userwww.sfsu.edu/~efc/classes/biol710/discrim/discrim.pdf Discriminant function analysis (DA) by John Poulsen and Aaron French, San Francisco State University]
+[[Category:Multivariate statistics]]
+[[Category:Statistical classification]]

Smooth morphism: Difference between revisions

Revision as of 16:50, 21 July 2013

Contents

Assumptions

Discriminant functions

Discrimination rules

Eigenvalues

Effect size

Variations

Comparison to logistic regression

See also

References

External links

Navigation menu

Smooth morphism: Difference between revisions

Revision as of 16:50, 21 July 2013

Assumptions

Discriminant functions

Discrimination rules

Eigenvalues

Effect size

Variations

Comparison to logistic regression

See also

References

External links

Navigation menu

Search