<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://en.formulasearchengine.com/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=62.227.224.100</id>
	<title>formulasearchengine - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://en.formulasearchengine.com/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=62.227.224.100"/>
	<link rel="alternate" type="text/html" href="https://en.formulasearchengine.com/wiki/Special:Contributions/62.227.224.100"/>
	<updated>2026-05-30T06:46:41Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.43.0-wmf.28</generator>
	<entry>
		<id>https://en.formulasearchengine.com/index.php?title=Fibonacci_retracement&amp;diff=24929</id>
		<title>Fibonacci retracement</title>
		<link rel="alternate" type="text/html" href="https://en.formulasearchengine.com/index.php?title=Fibonacci_retracement&amp;diff=24929"/>
		<updated>2013-11-03T21:39:08Z</updated>

		<summary type="html">&lt;p&gt;62.227.224.100: /* External links */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{orphan|date=July 2010}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Cluster labeling&#039;&#039;&#039; is closely related to the concept of [[text clustering]]. This process tries to select descriptive labels for the clusters obtained through a clustering algorithm such as Flat Clustering and [[Hierarchical Clustering]]. For example, a cluster of documents that talks about various [[internet protocols]] can be best labeled as &amp;quot;Internet Protocols&amp;quot;. Typically, the labels are obtained by examining the contents of the documents in a cluster. A good label not only summarizes the central concept of a cluster but also uniquely differentiates it from other clusters in the collection.&lt;br /&gt;
&lt;br /&gt;
==Differential Cluster Labeling==&lt;br /&gt;
Differential cluster labeling labels a cluster by comparing the terms in one cluster with the terms occurring in other clusters. The techniques used for [[feature selection]] in [[information retrieval]], such as [[mutual information]] and [[Pearson&#039;s chi-squared test|chi-squared feature selection]], can also be applied to differential cluster labeling.  Terms having very low frequency are not the best in representing the whole cluster and can be omitted in labeling a cluster.  By omitting those rare terms and using a differential test, one can achieve the best results with differential cluster labeling.&amp;lt;ref&amp;gt;Manning, Christopher D., Prabhakar Raghavan, and Hinrich Schutze. &#039;&#039;Introduction to Information Retrieval&#039;&#039;. Cambridge: Cambridge UP, 2008. &#039;&#039;Cluster Labeling&#039;&#039;. Stanford Natural Language Processing Group. Web. 25 Nov. 2009. &amp;lt;http://nlp.stanford.edu/IR-book/html/htmledition/cluster-labeling-1.html&amp;gt;.&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Mutual Information===&lt;br /&gt;
&lt;br /&gt;
{{Main|Mutual information}}&lt;br /&gt;
&lt;br /&gt;
In the fields of [[probability theory]] and [[information theory]], mutual information measures the degree of dependence of two [[random variables]].  The mutual information of two variables X and Y is defined as:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;I(X, Y) = \sum_{x\in X}{ \sum_{y\in Y} {p(x, y)log_2\left(\frac{p(x, y)}{p_1(x)p_2(y)}\right)}}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
where &#039;&#039;p(x, y)&#039;&#039; is the [[joint probability|joint probability distribution]] of the two variables, &#039;&#039;p&amp;lt;sub&amp;gt;1&amp;lt;/sub&amp;gt;(x)&#039;&#039; is the probability distribution of X, and &#039;&#039;p&amp;lt;sub&amp;gt;2&amp;lt;/sub&amp;gt;(y)&#039;&#039; is the probability distribution of Y.&lt;br /&gt;
&lt;br /&gt;
In the case of cluster labeling, the variable X is associated with membership in a cluster, and the variable Y is associated with the presence of a term.&amp;lt;ref&amp;gt;Manning, Christopher D., Prabhakar Raghavan, and Hinrich Schutze. &#039;&#039;Introduction to Information Retrieval&#039;&#039;. Cambridge: Cambridge UP, 2008. &#039;&#039;Mutual Information&#039;&#039;. Stanford Natural Language Processing Group. Web. 25 Nov. 2009. &amp;lt;http://nlp.stanford.edu/IR-book/html/htmledition/mutual-information-1.html&amp;gt;.&amp;lt;/ref&amp;gt;  Both variables can have values of 0 or 1, so the equation can be rewritten as follows:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;I(C, T) = \sum_{c\in {0, 1}}{ \sum_{t\in {0, 1}} {p(C = c, T = t)log_2\left(\frac{p(C = c, T = t)}{p(C = c)p(T = t)}\right)}}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
In this case, &#039;&#039;p(C = 1)&#039;&#039; represents the probability that a randomly selected document is a member of a particular cluster, and &#039;&#039;p(C = 0)&#039;&#039; represents the probability that it isn&#039;t.  Similarly, &#039;&#039;p(T = 1)&#039;&#039; represents the probability that a randomly selected document contains a given term, and &#039;&#039;p(T = 0)&#039;&#039; represents the probability that it doesn&#039;t.  The [[joint probability|joint probability distribution function]] &#039;&#039;p(C, T)&#039;&#039; represents the probability that two events occur simultaneously.  For example, &#039;&#039;p(0, 0)&#039;&#039; is the probability that a document isn&#039;t a member of cluster &#039;&#039;c&#039;&#039; and doesn&#039;t contain term &#039;&#039;t&#039;&#039;; &#039;&#039;p(0, 1)&#039;&#039; is the probability that a document isn&#039;t a member of cluster &#039;&#039;c&#039;&#039; and does contain term &#039;&#039;t&#039;&#039;; and so on.&lt;br /&gt;
&lt;br /&gt;
====Example====&lt;br /&gt;
The following example calculates the mutual information between the cluster &amp;quot;commerce&amp;quot; and the term &amp;quot;tariff&amp;quot;:&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!&lt;br /&gt;
! Documents in &amp;quot;commerce&amp;quot;&lt;br /&gt;
! Documents not in &amp;quot;commerce&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
| Documents containing &amp;quot;tariff&amp;quot;&lt;br /&gt;
| 60&lt;br /&gt;
| 10,000&lt;br /&gt;
|-&lt;br /&gt;
| Documents not containing &amp;quot;tariff&amp;quot;&lt;br /&gt;
| 200&lt;br /&gt;
| 500,000&lt;br /&gt;
|}&lt;br /&gt;
Total number of documents = (60 + 200 + 10,000 + 500,000) = 510,260&lt;br /&gt;
&lt;br /&gt;
P (C = 1, T = 1) = 60/510,260&lt;br /&gt;
&lt;br /&gt;
P (C = 1, T = 0) = 200/510,260&lt;br /&gt;
&lt;br /&gt;
P (C = 0, T = 1) = 10,000/510,260&lt;br /&gt;
&lt;br /&gt;
P (C = 0, T = 0) = 500,000/510,260&lt;br /&gt;
&lt;br /&gt;
P (C = 1) = (# of documents in the cluster) / (total number of documents) = (60 + 200) / 510,260 = 260/510,260&lt;br /&gt;
&lt;br /&gt;
P (C = 0) = (# of documents not in the cluster) / (total number of documents) = (10,000 + 500,000) / 510,260 = 510,000/510,260&lt;br /&gt;
&lt;br /&gt;
P (T = 1) = (# of documents containing the term) / (total number of documents) = (60 + 10,000) / 510,260 = 10,060/510,260&lt;br /&gt;
&lt;br /&gt;
P (T = 0) = (# of documents not containing the term) / (total number of documents) = (200 + 500,000) / 510,260 = 500,200/510,260&lt;br /&gt;
&lt;br /&gt;
Plugging these probabilities into the above equation gives us the following MI value:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;I(C, T) = \frac{60}{510,260} log_2\left(\frac{60/510,260}{260/510,260 * 10,060/510,260}\right)&amp;lt;/math&amp;gt;&lt;br /&gt;
&amp;lt;math&amp;gt;+ \frac{200}{510,260} log_2\left(\frac{200/510,260}{260/510,260 * 500,200/510,260}\right)&amp;lt;/math&amp;gt;&lt;br /&gt;
&amp;lt;math&amp;gt;+ \frac{10,000}{510,260} log_2\left(\frac{10,000/510,260}{510,000/510,260 * 10,060/510,260}\right)&amp;lt;/math&amp;gt;&lt;br /&gt;
&amp;lt;math&amp;gt;+ \frac{500,000}{510,260} log_2\left(\frac{500,000/510,260}{510,000/510,260 * 500,200/510,260}\right)&amp;lt;/math&amp;gt;&lt;br /&gt;
&amp;lt;math&amp;gt;= \frac{60}{510,260} log_2\left(\frac{60*510,260}{260 * 10,060}\right) + \frac{200}{510,260} log_2\left(\frac{200*510,260}{260 * 500,200}\right)&amp;lt;/math&amp;gt;&lt;br /&gt;
&amp;lt;math&amp;gt;+ \frac{10,000}{510,260} log_2\left(\frac{10,000*510,260}{510,000 * 10,060}\right) + \frac{500,000}{510,260} log_2\left(\frac{500,000*510,260}{510,000 * 500,200}\right)&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;nbsp;= 0.000417322 - 0.000137100 - 0.000154725 + 0.000155158&lt;br /&gt;
&lt;br /&gt;
&amp;amp;nbsp;= 0.000280655&lt;br /&gt;
&lt;br /&gt;
Therefore, the mutual information between the cluster &amp;quot;commerce&amp;quot; and the term &amp;quot;tariff&amp;quot; is 0.000280655.  We can create a label for the cluster &amp;quot;commerce&amp;quot; by calculating the mutual information of each term in the cluster, and selected the k terms with the highest MI value.&lt;br /&gt;
&lt;br /&gt;
===Chi-Squared Selection===&lt;br /&gt;
{{Main|Pearson&#039;s chi-squared test}}&lt;br /&gt;
The Pearson&#039;s chi-squared test can be used to calculate the probability that the occurrence of an event matches the initial expectations.  In particular, it can be used to determine whether two events, A and B, are [[statistically independent]].  The value of the chi-squared statistic is:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;X^2 = \sum_{a \in A}{\sum_{b \in B}{\frac{(O_{a,b} - E_{a, b})^2}{E_{a, b}}}}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
where &#039;&#039;O&amp;lt;sub&amp;gt;a,b&amp;lt;/sub&amp;gt;&#039;&#039; is the &#039;&#039;observed&#039;&#039; frequency of a and b co-occurring, and &#039;&#039;E&amp;lt;sub&amp;gt;a,b&amp;lt;/sub&amp;gt;&#039;&#039; is the &#039;&#039;expected&#039;&#039; frequency of co-occurrence.&lt;br /&gt;
&lt;br /&gt;
In the case of cluster labeling, the variable A is associated with membership in a cluster, and the variable B is associated with the presence of a term.  Both variables can have values of 0 or 1, so the equation can be rewritten as follows:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;X^2 = \sum_{a \in {0,1}}{\sum_{b \in {0,1}}{\frac{(O_{a,b} - E_{a, b})^2}{E_{a, b}}}}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For example, &#039;&#039;O&amp;lt;sub&amp;gt;1,0&amp;lt;/sub&amp;gt;&#039;&#039; is the observed number of documents that are in a particular cluster but don&#039;t contain a certain term, and &#039;&#039;E&amp;lt;sub&amp;gt;1,0&amp;lt;/sub&amp;gt;&#039;&#039; is the expected number of documents that are in a particular cluster but don&#039;t contain a certain term.&lt;br /&gt;
Our initial assumption is that the two events are independent, so the expected probabilities of co-occurrence can be calculated by multiplying individual probabilities:&amp;lt;ref&amp;gt;Manning, Christopher D., Prabhakar Raghavan, and Hinrich Schutze. &#039;&#039;Introduction to Information Retrieval&#039;&#039;. Cambridge: Cambridge UP, 2008. &#039;&#039;Chi2 Feature Selection&#039;&#039;. Stanford Natural Language Processing Group. Web. 25 Nov. 2009. &amp;lt;http://nlp.stanford.edu/IR-book/html/htmledition/feature-selectionchi2-feature-selection-1.html&amp;gt;.&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;E&amp;lt;sub&amp;gt;1,0&amp;lt;/sub&amp;gt; = N * P(C = 1) * P(T = 0)&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
where N is the total number of documents in the collection.&lt;br /&gt;
&lt;br /&gt;
====Example====&lt;br /&gt;
Using the same data for the mutual information example, we can calculate the expected probabilities and plug them into the equation to calculate the chi-squared statistic:&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!&lt;br /&gt;
! Documents in &amp;quot;commerce&amp;quot;&lt;br /&gt;
! Documents not in &amp;quot;commerce&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
| Documents containing &amp;quot;tariff&amp;quot;&lt;br /&gt;
| 60&lt;br /&gt;
| 10,000&lt;br /&gt;
|-&lt;br /&gt;
| Documents not containing &amp;quot;tariff&amp;quot;&lt;br /&gt;
| 200&lt;br /&gt;
| 500,000&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
P (C = 1) = (# of documents in the cluster) / (total number of documents) = (60 + 200) / 510,260 = 260/510,260&lt;br /&gt;
&lt;br /&gt;
P (C = 0) = (# of documents not in the cluster) / (total number of documents) = (10,000 + 500,000) / 510,260 = 510,000/510,260&lt;br /&gt;
&lt;br /&gt;
P (T = 1) = (# of documents containing the term) / (total number of documents) = (60 + 10,000) / 510,260 = 10,060/510,260&lt;br /&gt;
&lt;br /&gt;
P (T = 0) = (# of documents not containing the term) / (total number of documents) = (200 + 500,000) / 510,260 = 500,200/510,260&lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;E_{0, 0} = 510,260 * \frac {510,000}{510,260} * \frac {500,200}{510,260}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
= 499,945&lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;E_{0,1}= 510,260 * \frac {510,000}{510,260} * \frac {10.060}{510,260}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
= 10,055&lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;E_{1,0}= 510,260 * \frac {260}{510,260} * \frac {500,200}{510,260}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
= 254.9&lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;E_{1,1}= 510,260 * \frac {260}{510,260} * \frac {10,060}{510,260}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
= 5.13&lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;X^2 = \frac{(500,000 - 499,945)^2}{499,945} + \frac{(10,000 - 10,055)^2}{10,055} + \frac{(200 - 254.9)^2}{254.9} + \frac{(60 - 5.13)^2}{5.13}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
= 599&lt;br /&gt;
&lt;br /&gt;
Since each variable can have two values, the number of [[degrees of freedom (statistics)|degrees of freedom]] is (2 - 1)(2 - 1) = 1.  The [[chi-squared distribution]] for one degree of freedom states that the probability of observing a value greater than 10.83 is less than 0.001.  Therefore, we can reject the [[null hypothesis]], which states that the two events are independent.  Since the term &amp;quot;tariff&amp;quot; and the cluster &amp;quot;commerce&amp;quot; are dependent, we can assume that the term is a good label for the cluster.&lt;br /&gt;
&lt;br /&gt;
==Cluster-Internal Labeling==&lt;br /&gt;
Cluster-internal labeling selects labels that only depend on the contents of the cluster of interest. No comparison is made with the other clusters.&lt;br /&gt;
Cluster-internal labeling can use a variety of methods, such as finding terms that occur frequently in the centroid or finding the document that lies closest to the centroid.&lt;br /&gt;
&lt;br /&gt;
===Centroid Labels===&lt;br /&gt;
{{Main|Vector space model}}&lt;br /&gt;
A frequently used model in the field of [[information retrieval]] is the vector space model, which represents documents as vectors.  The entries in the vector correspond to terms in the [[vocabulary]]. Binary vectors have a value of 1 if the term is present within a particular document and 0 if it is absent. Many vectors make use of weights that reflect the importance of a term in a document, and/or the importance of the term in a document collection. For a particular cluster of documents, we can calculate the [[centroid]] by finding the [[arithmetic mean]] of all the document vectors.  If an entry in the centroid vector has a high value, then the corresponding term occurs frequently within the cluster.  These terms can be used as a label for the cluster.&lt;br /&gt;
One downside to using centroid labeling is that it can pick up words like &amp;quot;place&amp;quot; and &amp;quot;word&amp;quot; that have a high frequency in written text, but have little relevance to the contents of the particular cluster.&lt;br /&gt;
&lt;br /&gt;
===Title Labels===&lt;br /&gt;
An alternative to centroid labeling is title labeling.  Here, we find the document within the cluster that has the smallest [[Euclidean distance]] to the centroid, and use its title as a label for the cluster.  One advantage to using document titles is that they provide additional information that would not be present in a list of terms.  However, they also have the potential to mislead the user, since one document might not be representative of the entire cluster.&lt;br /&gt;
&lt;br /&gt;
===External knowledge labels===&lt;br /&gt;
Cluster labeling can be done indirectly using external knowledge such as pre-categorized knowledge such as the one of Wikipedia.&amp;lt;ref&amp;gt;David Carmel, Haggai Roitman, Naama Zwerdling. [http://portal.acm.org/citation.cfm?doid=1571941.1571967 Enhancing cluster labeling using wikipedia.] SIGIR 2009: 139-146&amp;lt;/ref&amp;gt; In such methods, a set of important cluster text features are first extracted from the cluster documents. These features then can be used to retrieve the (weighted) K-nearest categorized documents from which candidates for cluster labels can be extracted. The final step involves the ranking of such candidates. Suitable methods are such that are based on a voting or a fusion process which is determined using the set of categorized documents and the original cluster features.&lt;br /&gt;
&lt;br /&gt;
==External links==&lt;br /&gt;
* [http://nlp.stanford.edu/IR-book/html/htmledition/hierarchical-clustering-1.html Hierarchical Clustering]&lt;br /&gt;
* [http://erulemaking.ucsur.pitt.edu/doc/papers/dgo06-labeling.pdf Automatically Labeling Hierarchical Clusters]&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{{DEFAULTSORT:Cluster Labeling}}&lt;br /&gt;
[[Category:Information retrieval]]&lt;/div&gt;</summary>
		<author><name>62.227.224.100</name></author>
	</entry>
</feed>