Depth of noncommutative subrings: Difference between revisions
en>ShelfSkewed m fix isbn, rm notice |
en>Jackfork m Reverted 1 edit by 2001:690:2200:c401::133 (talk) to last revision by 2001:690:2200:C401:0:0:0:133. (TW) |
||
| Line 1: | Line 1: | ||
{{Context|date=June 2010}} | |||
'''M-trees''' are [[tree data structure]]s that are similar to [[R-tree]]s and [[B-tree]]s. It is constructed using a [[Metric (mathematics)|metric]] and relies on the [[triangle inequality]] for efficient range and [[k-nearest neighbor algorithm|k-NN]] queries. | |||
While M-trees can perform well in many conditions, the tree can also have large overlap and there is no clear strategy on how to best avoid overlap. In addition, it can only be used for [[distance function]]s that satisfy the triangle inequality, while many advanced dissimilarity functions used in [[information retrieval]] do not satisfy this.<ref name="p426">{{cite conference | |||
| first = Paolo | |||
| last = Ciaccia | |||
| authorlink = | |||
| coauthors = Patella, Marco; Zezula, Pavel | |||
| title = M-tree An Efficient Access Method for Similarity Search in Metric Spaces | |||
| booktitle = Proceedings of the 23rd VLDB Conference Athens, Greece, 1997 | |||
| pages = 426–435 | |||
| publisher = Very Large Databases Endowment Inc. | |||
| year = 1997 | |||
| location = IBM Almaden Research Center | |||
| url = http://www.vldb.org/conf/1997/P426.PDF | |||
| accessdate = 2010-09-07 | |||
| id = p426 | |||
}}</ref> | |||
==Overview== | |||
[[File:Mtree-2d.svg|thumb|350px|2D M-Tree visualized using [[Environment for DeveLoping KDD-Applications Supported by Index-Structures|ELKI]]. The tree has a single level of leaf nodes. Due to a suboptimal split heuristic, there is a large overlap.]] | |||
As in any Tree-based data structure, the M-Tree is composed of Nodes and Leaves. In each node there is a data object that identifies it uniquely and a pointer to a sub-tree where its children reside. Every leaf has several data objects. For each node there is a radius '''''r''''' that defines a Ball in the desired metric space. Thus, every node <math>n</math> and leaf <math>l</math> residing in a particular node <math>N</math> is at most distance <math>r</math> from <math>N</math>, and every node ''n'' and leaf ''l'' with node parent ''N'' keep the distance from it. | |||
==M-Tree construction== | |||
=== Components === | |||
An M-Tree has these components and sub-components: | |||
# Non-leaf nodes | |||
## A set of routing objects N<sub>''RO''</sub>. | |||
## Pointer to Node's parent object O<sub>''p''</sub>. | |||
# Leaf nodes | |||
## A set of objects N<sub>''O''</sub>. | |||
## Pointer to Node's parent object O<sub>''p''</sub>. | |||
# Routing Object | |||
## (Feature value of) routing object O<sub>''r''</sub>. | |||
## Covering radius r(O<sub>''r''</sub>). | |||
## Pointer to covering tree T(O<sub>''r''</sub>). | |||
## Distance of O<sub>''r''</sub> from its parent object d(O<sub>''r''</sub>,P(O<sub>''r''</sub>)) | |||
# Object | |||
## (Feature value of the) object O<sub>''j''</sub>. | |||
## Object identifier oid(O<sub>''j''</sub>). | |||
## Distance of O<sub>''j''</sub> from its parent object d(O<sub>''j''</sub>,P(O<sub>''j''</sub>)) | |||
=== Insert === | |||
The main idea is first to find a leaf node <math>N</math> where the new object <math>O</math> belongs. If <math>N</math> is not full then just attach it to <math>N</math>. If <math>N</math> is full then invoke a method to split <math>N</math>. The algorithm is as follows: | |||
{{algorithm-begin|name=Insert}} | |||
Input: Node <math>N</math> of M-Tree <math>MT</math>, Entry <math>O_{n}</math> | |||
Output: A new instance of <math>MT</math> containing all entries in original <math>MT</math> plus <math>O_{n}</math> | |||
<math>N_{e}</math> ← <math>N</math>'s routing objects or objects | |||
'''if''' <math>N</math> is not a leaf '''then''' | |||
{ | |||
/*Look for entries that the new object fits into*/ | |||
let <math>N_{in}</math> be routing objects from <math>N_{e}</math>'s set of routing objects <math>N_{RO}</math> such that <math>d(O_{r}, O_{n}) <= r(O_{r})</math> | |||
'''if''' <math>N_{in}</math> is not empty '''then''' | |||
{ | |||
/*If there are one or more entry, then look for an entry such that is closer to the new object*/ | |||
<math>O_{r}^{*} = \min_{O_{r}\in N_{in}} d(O_{r}, O_{n})</math> | |||
} | |||
'''else''' | |||
{ | |||
/*If there are no such entry, then look for an object with minimal distance from */ | |||
/*its covering radius's edge to the new object*/ | |||
<math>O_{r}^{*} = \min_{O_{r}\in N_{in}} d(O_{r}, O_{n}) - r(O_{r})</math> | |||
/*Upgrade the new radii of the entry*/ | |||
<math>r(O_{r}^{*})</math> = <math>d(O_{r}^{*}, O_{n})</math> | |||
} | |||
/*Continue inserting in the next level*/ | |||
return insert(<math>T(O_{r}^{*})</math>, <math>O_{n}</math>); | |||
'''else''' | |||
{ | |||
/*If the node has capacity then just insert the new object*/ | |||
'''if''' <math>N</math> is not full '''then''' | |||
{ store(<math>N</math>, <math>O_{n}</math>) } | |||
/*The node is at full capacity, then it is needed to do a new split in this level*/ | |||
'''else''' | |||
{ split(<math>N</math>, <math>O_{n}</math>) } | |||
} | |||
{{algorithm-end}} | |||
=== Split === | |||
If the split method arrives to the root of the tree, then it choose two routing objects from <math>N</math>, and creates two new nodes containing all the objects in original <math>N</math>, and store them into the new root. If split methods arrives to a node <math>N</math> that is not the root of the tree, the method choose two new routing objects from <math>N</math>, re-arrange every routing object in <math>N</math> in two new nodes <math>N_{1}</math> and <math>N_{2}</math>, and store this new nodes in the parent node <math>N_{p}</math> of original <math>N</math>. The split must be repeated if <math>N_{p}</math> has not enough capacity to store <math>N_{2}</math>. The algorithm is as follow: | |||
{{algorithm-begin|name=Split}} | |||
Input: Node <math>N</math> of M-Tree <math>MT</math>, Entry <math>O_{n}</math> | |||
Output: A new instance of <math>MT</math> containing a new partition. | |||
/*The new routing objects are now all those in the node plus the new routing object*/ | |||
let be <math>NN</math> entries of <math>N \cup O</math> | |||
'''if''' <math>N</math> is not the root '''then''' | |||
{ | |||
/*Get the parent node and the parent routing object*/ | |||
let <math>O_{p}</math> be the parent routing object of <math>N</math> | |||
let <math>N_{p}</math> be the parent node of <math>N</math> | |||
} | |||
/*This node will contain part of the objects of the node to be split*/ | |||
Create a new node <math>N'</math> | |||
/*Promote two routing objects from the node to be split, to be new routing objects*/ | |||
Create new objects <math>O_{p1}</math> and <math>O_{p2}</math>. | |||
Promote(<math>N</math>, <math>O_{p1}</math>, <math>O_{p2}</math>) | |||
/*Choose which objects from the node being split will act as new routing objects*/ | |||
Partition(<math>N</math>, <math>O_{p1}</math>, <math>O_{p2}</math>, <math>N_{1}</math>, <math>N_{2}</math>) | |||
/*Store entries in each new routing object*/ | |||
Store <math>N_{1}</math>'s entries in <math>N</math> and <math>N_{2}</math>'s entries in <math>N'</math> | |||
'''if''' <math>N</math> is the current root '''then''' | |||
{ | |||
/*Create a new node and set it as new root and store the new routing objects*/ | |||
Create a new root node <math>N_{p}</math> | |||
Store <math>O_{p1}</math> and <math>O_{p2}</math> in <math>N_{p}</math> | |||
} | |||
'''else''' | |||
{ | |||
/*Now use the parent rouing object to store one of the new objects*/ | |||
Replace entry <math>O_{p}</math> with entry <math>O_{p1}</math> in <math>N_{p}</math> | |||
'''if''' <math>N_{p}</math> is no full '''then''' | |||
{ | |||
/*The second routinb object is stored in the parent only if it has free capacity*/ | |||
Store <math>O_{p2}</math> in <math>N_{p}</math> | |||
} | |||
'''else''' | |||
{ | |||
/*If there is no free capacity then split the level up*/ | |||
split(<math>N_{p}</math>, <math>O_{p2}</math>) | |||
} | |||
} | |||
{{algorithm-end}} | |||
== M-Tree Queries == | |||
=== Range Query === | |||
A range query is where a minimum similarity/maximum distance value is specified. | |||
For a given query object Q ∈ D and a maximum search distance | |||
r(Q), the range query '''range'''(Q, r(Q)) selects all the indexed objects Oj such that d(Oj, Q) ≤ r(Q).<ref name="Univ Bologna Range">{{cite web|title=Indexing Metric Spaces with M-tree|url=http://www-db.deis.unibo.it/research/papers/SEBD97.pdf|work=Department of Computer Science and Engineering|publisher=University of Bologna|accessdate=19 November 2013|author=P. Ciaccia, M. Patella, F. Rabitti, P. Zezula|page=3}}</ref> | |||
Algorithm RangeSearch starts from the root node and recursively traverses all the paths which cannot be excluded from leading to qualifying objects. | |||
{{algorithm-begin|name=Insert}} | |||
Input: Node <math>N</math> of M-Tree <math>MT</math>, <math>Q</math>: query object, <math>r(Q)</math>: search radius | |||
Output: all the DB objects such that <math>d(Oj, Q)</math> ≤ <math>r(Q)</math> | |||
{ let <math>O_{p}</math> be the parent object of node <math>N</math>; | |||
'''if''' <math>N</math> is not a leaf | |||
'''then''' { for each <math>entry(O_{r})</math> in N do: | |||
'''if''' | <math>d(O_{p}, Q)</math> − <math>d(O_{r}, O_{p})</math> | ≤ <math>r(Q) +r(O_{r})</math> | |||
'''then''' { Compute <math>d(O_{r}, Q)</math>; | |||
'''if''' <math>d(O_{r}, Q)</math> ≤ <math>r(Q) +r(O_{r})</math> | |||
'''then''' <math>RangeSearch(*ptr(T(O_{r})),Q,r(Q))</math>; }} | |||
'''else''' { for each <math>entry(O_{j})</math> in <math>N</math> do: | |||
'''if''' | <math>d(O_{p}, Q)</math> − <math>d(O_{j}, O_{p})</math> | ≤ <math>r(Q)</math> | |||
'''then''' { Compute <math>d(O_{j}, Q)</math>; | |||
'''if''' <math>d(O_{j}, Q)</math> ≤ <math>r(Q)</math> | |||
'''then''' add <math>oid(O_{j})</math> to the result; }}} | |||
{{algorithm-end}} | |||
<math>oid(O_{j})</math> is the identifier of the object which resides on a separate data file. | |||
<math>T(O_{r})</math> is a sub-tree – the covering tree of <math>O_{r}</math> | |||
=== k-NN queries === | |||
K Nearest Neighbor (k-NN) query takes the cardinality of the input set as an input perimeter. For a given query object Q ∈ D and an | |||
integer k ≥ 1, the k-NN query NN(Q, k) selects the k indexed objects which have the shortest distance from Q, according to the distance function d. | |||
<ref name="Univ Bologna Range"/> | |||
{{Empty section|date=January 2011}} | |||
==See also== | |||
* [[Segment tree]] | |||
* [[Interval tree]] - A degenerate R-Tree for 1 dimension (usually time). | |||
* [[Bounding volume hierarchy]] | |||
* [[Spatial index]] | |||
* [[GiST]] | |||
==References== | |||
{{reflist}} | |||
{{CS-Trees}} | |||
{{DEFAULTSORT:M-Tree}} | |||
[[Category:Trees (data structures)]] | |||
[[Category:Database index techniques]] | |||
[[Category:Geometric data structures]] | |||
Revision as of 15:10, 11 November 2013
My name is Jestine (34 years old) and my hobbies are Origami and Microscopy.
Here is my web site; http://Www.hostgator1centcoupon.info/ (support.file1.com)
M-trees are tree data structures that are similar to R-trees and B-trees. It is constructed using a metric and relies on the triangle inequality for efficient range and k-NN queries. While M-trees can perform well in many conditions, the tree can also have large overlap and there is no clear strategy on how to best avoid overlap. In addition, it can only be used for distance functions that satisfy the triangle inequality, while many advanced dissimilarity functions used in information retrieval do not satisfy this.[1]
Overview
As in any Tree-based data structure, the M-Tree is composed of Nodes and Leaves. In each node there is a data object that identifies it uniquely and a pointer to a sub-tree where its children reside. Every leaf has several data objects. For each node there is a radius r that defines a Ball in the desired metric space. Thus, every node and leaf residing in a particular node is at most distance from , and every node n and leaf l with node parent N keep the distance from it.
M-Tree construction
Components
An M-Tree has these components and sub-components:
- Non-leaf nodes
- A set of routing objects NRO.
- Pointer to Node's parent object Op.
- Leaf nodes
- A set of objects NO.
- Pointer to Node's parent object Op.
- Routing Object
- (Feature value of) routing object Or.
- Covering radius r(Or).
- Pointer to covering tree T(Or).
- Distance of Or from its parent object d(Or,P(Or))
- Object
- (Feature value of the) object Oj.
- Object identifier oid(Oj).
- Distance of Oj from its parent object d(Oj,P(Oj))
Insert
The main idea is first to find a leaf node where the new object belongs. If is not full then just attach it to . If is full then invoke a method to split . The algorithm is as follows:
Input: Node of M-Tree , Entry Output: A new instance of containing all entries in original plus
← 's routing objects or objects if is not a leaf then { /*Look for entries that the new object fits into*/ let be routing objects from 's set of routing objects such that if is not empty then { /*If there are one or more entry, then look for an entry such that is closer to the new object*/ } else { /*If there are no such entry, then look for an object with minimal distance from */ /*its covering radius's edge to the new object*/ /*Upgrade the new radii of the entry*/ = } /*Continue inserting in the next level*/ return insert(, ); else { /*If the node has capacity then just insert the new object*/ if is not full then { store(, ) } /*The node is at full capacity, then it is needed to do a new split in this level*/ else { split(, ) } }
Split
If the split method arrives to the root of the tree, then it choose two routing objects from , and creates two new nodes containing all the objects in original , and store them into the new root. If split methods arrives to a node that is not the root of the tree, the method choose two new routing objects from , re-arrange every routing object in in two new nodes and , and store this new nodes in the parent node of original . The split must be repeated if has not enough capacity to store . The algorithm is as follow:
Input: Node of M-Tree , Entry Output: A new instance of containing a new partition.
/*The new routing objects are now all those in the node plus the new routing object*/ let be entries of if is not the root then { /*Get the parent node and the parent routing object*/ let be the parent routing object of let be the parent node of } /*This node will contain part of the objects of the node to be split*/ Create a new node /*Promote two routing objects from the node to be split, to be new routing objects*/ Create new objects and . Promote(, , ) /*Choose which objects from the node being split will act as new routing objects*/ Partition(, , , , ) /*Store entries in each new routing object*/ Store 's entries in and 's entries in if is the current root then { /*Create a new node and set it as new root and store the new routing objects*/ Create a new root node Store and in } else { /*Now use the parent rouing object to store one of the new objects*/ Replace entry with entry in if is no full then { /*The second routinb object is stored in the parent only if it has free capacity*/ Store in } else { /*If there is no free capacity then split the level up*/ split(, ) } }
M-Tree Queries
Range Query
A range query is where a minimum similarity/maximum distance value is specified. For a given query object Q ∈ D and a maximum search distance r(Q), the range query range(Q, r(Q)) selects all the indexed objects Oj such that d(Oj, Q) ≤ r(Q).[2]
Algorithm RangeSearch starts from the root node and recursively traverses all the paths which cannot be excluded from leading to qualifying objects. Template:Algorithm-begin Input: Node of M-Tree , : query object, : search radius
Output: all the DB objects such that ≤
{ let be the parent object of node ;
if is not a leaf then { for each in N do:
if | − | ≤ then { Compute ; if ≤ then ; }}
if | − | ≤ then { Compute ; if ≤ then add to the result; }}}
is the identifier of the object which resides on a separate data file.
is a sub-tree – the covering tree of
k-NN queries
K Nearest Neighbor (k-NN) query takes the cardinality of the input set as an input perimeter. For a given query object Q ∈ D and an integer k ≥ 1, the k-NN query NN(Q, k) selects the k indexed objects which have the shortest distance from Q, according to the distance function d. [2] Template:Empty section
See also
- Segment tree
- Interval tree - A degenerate R-Tree for 1 dimension (usually time).
- Bounding volume hierarchy
- Spatial index
- GiST
References
43 year old Petroleum Engineer Harry from Deep River, usually spends time with hobbies and interests like renting movies, property developers in singapore new condominium and vehicle racing. Constantly enjoys going to destinations like Camino Real de Tierra Adentro.
- ↑ 55 years old Systems Administrator Antony from Clarence Creek, really loves learning, PC Software and aerobics. Likes to travel and was inspired after making a journey to Historic Ensemble of the Potala Palace.
You can view that web-site... ccleaner free download - ↑ 2.0 2.1 Template:Cite web