Multiclass classification

From formulasearchengine
Revision as of 13:57, 28 October 2013 by en>Mcld (not to be confused with)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Template:Underlinked Controlled grammars[1] are a class of grammars that extend, usually, the context-free grammars with additional controls on the derivations of a sentence in the language. A number of different kinds of controlled grammars exist, the four main divisions being Indexed grammars, grammars with prescribed derivation sequences, grammars with contextual conditions on rule application, and grammars with parallelism in rule application. Because indexed grammars are so well established in the field, this article will address only the latter three kinds of controlled grammars.

Control by prescribed sequences

Grammars with prescribed sequences are grammars in which the sequence of rule application is constrained in some way. There are four different versions of prescribed sequence grammars: language controlled grammars (often called just controlled grammars), matrix grammars, vector grammars, and programmed grammars.

In the standard context-free grammar formalism, a grammar itself is viewed as a 4-tuple, G=(N,T,S,P), where N is a set of non-terminal/phrasal symbols, T is a disjoint set of terminal/word symbols, S is a specially designated start symbol chosen from N, and P is a set of production rules like Xα, where X is some member of N, and α is some member of (NT)*.

Productions over such a grammar are sequences of rules in P that, when applied in order of the sequence, lead to a terminal string. That is, one can view the set of imaginable derivations in G as the set {p1p2...pn:n0}, and the language of G as being the set of terminal strings L(G)={wT*:Sp1...pnw}. Control grammars take seriously this definition of the language generated by a grammar, concretizing the set-of-derivations as an aspect of the grammar. Thus, a prescribed sequence controlled grammar is at least approximately a 5-tuple G=(N,T,S,P,R) where everything except R is the same as in a CFG, and R is an infinite set of valid derivation sequences p1p2...pn.

The set R, due to its infinitude, is almost always (though not necessarily) described via some more convenient mechanism, such as a grammar (as in language controlled grammars), or a set of matrices or vectors (as in matrix and vector grammars). The different variations of prescribed sequence grammars thus differ by how the sequence of derivations is defined on top of the context-free base. Because matrix grammars and vector grammars are essentially special cases of language controlled grammars, examples of the former two will not be provided below.

Language controlled grammars

Language controlled grammars are grammars in which the production sequences constitute a well-defined language of arbitrary nature, usually though not necessarily regular, over a set of (again usually though not necessarily) context-free production rules. They also often have a sixth set in the grammar tuple, making it G=(N,T,S,P,R,F), where F is a set of productions that are allowed to apply vacuously. This version of language controlled grammars, ones with what is called "appearance checking", is the one henceforth.

Proof-theoretic description

We let a regularly controlled context-free grammar with appearance checking be a 6-tuple G=(N,T,S,P,R,F) where N, T, S, and P are defined as in CFGs, R is a subset of P* constituting a regular language over P, and F is some subset of P. We then define the immediately derives relation pi as follows:

Given some strings x and y, both in (NT)*, and some rule p=AwP,

xpacy

holds if either

x=x1Ax2 and y=y1wy2, or
x=y and pF

Intuitively, this simply spells out that a rule can apply to a string if the rule's left-hand-side appears in that string, or if the rule is in the set of "vacuously applicable" rules which can "apply" to a string without changing anything. This requirement that the non-vacuously applicable rules must apply is the appearance checking aspect of such a grammar. The language for this kind of grammar is then simply set of terminal strings L(G)={wT*:Sp1acw1p2acw2p3ac...pnacw,forsomep1p2...pnR}.

Example

Let's consider a simple (though not the simplest) context-free grammar that generates the language {an:n1}:

Let G=({S,A,X},{a},S,{f,g,h,k,l}), where

f:SAA
g:SX
h:AS
k:AX
l:Sa

In language controlled form, this grammar is simply G=({S,A,X},{a},S,{f,g,h,k,l},(f|g|h|k|l)*,{f,g,h,k,l}) (where (f|g|h|k|l)* is a regular expression denoting the set of all sequences of production rules). A simple modification to this grammar, changing is control sequence set R into the set (f*gh*k)*l*, and changing its vacuous rule set F to {g,k}, yields a grammar which generates the non-CF language {a2n:n0}. To see how, let's consider the general case of some string with n instances of S in it, i.e. Sn (the special case S1 trivially derives the string a which is a20, an uninteresting fact).

If we chose some arbitrary production sequence fughvk..., we can consider three possibilities: n=u, n<u, and n>u When n=u we rewrite all n instances of S as AA, by applying rule f to the string u times, and proceed to apply g, which applies vacuously (by virtue of being in F) . When n<u, we rewrite all n instances of S as AA, and then try to perform the n+1 rewrite using rule f, but this fails because there are no more Ss to rewrite, and f is not in F and so cannot apply vacuously, thus when n<u, the derivation fails. Lastly, then n>u, we rewrite u instances of S, leaving at least one instance of S to be rewritten by the subsequent application of g, rewriting S as X. Given that no rule of this grammar ever rewrites X, such a derivation is destined to never produce a terminal string. Thus only derivations with n=u will ever successfully rewrite the string Sn. Similar reasoning holds of the number of As and v. In general, then, we can say that the only valid derivations have the structure Snf...fA2ngA2nh...hS2nkS2n will produce terminal strings of the grammar. The X rules, combined with the structure of the control, essentially force all Ss to be rewritten as AAs prior to any As being rewritten as Ss, which again is forced to happen prior to all still later iterations over the S-to-AA cycle. Finally, the Ss are rewritten as as. In this way, the number of Ss doubles each for each instantiation of f8gh*k that appears in a terminal-deriving sequence.

Choosing two random non-terminal deriving sequences, and one terminal-deriving one, we can see this in work:

Let s1=ffghkll, then we get the failed derivation:

SfacAAfacfailure: f cannot apply, no S to rewrite

Let s2=fghhhkll, then we get the failed derivation:

SfacAAgacAAhacSAhacSShacfailure: h cannot apply, no A to rewrite

Let s3=fghhkll, then we get the successful derivation:

SfacAAgacAAhacSAhacSSkacSSlacaSlacaa

Similar derivations with a second cycle of f*gh*k produce only SSSS. Showing only the (continued) successful derivation:

...SSfacAASfacAAAAgacAAAA
hacSAAAhacSSAAhacSSSAhacSSSSkacSSSS
lacaSSSlacaaSSlacaaaSlacaaaa

Matrix grammars

Matrix grammars (expanded on in their own article) are a special case of regular controlled context-free grammars, in which the production sequence language is of the form (m1|m2|...|mn)*, where each "matrix" mi is a single sequence. For convenience, such a grammar is not represented with a grammar over P, but rather with just a set of the matrices in place of both the language and the production rules. Thus, a matrix grammar is the 5-tuple G=(N,T,M,S,F), where N, T, S, and F are defined essentially as previously done (with F a subset of M this time), and M is a set of matrices mi=pi,1pi,2...pi,ni where each pi,j is a context-free production rule.

The derives relation in a matrix grammar is thus defined simply as:

Given some strings x and y, both in (NT)*, and some matrix m=p1p2...pnM,

xmacy

holds if either

x=x1Ax2, y=y1wy2, and Ap1acw1p2acw2p3ac...pnacw, or
x=y and mF

Informally, a matrix grammar is simply a grammar in which during each rewriting cycle, a particular sequence of rewrite operations must be performed, rather than just a single rewrite operation, i.e. one rule "triggers" a cascade of other rules. Similar phenomena can be performed in the standard context-sensitive idiom, as done in rule-based phonology and earlier Transformational grammar, by what are known as "feeding" rules, which alter a derivation in such a way as to provide the environment for a non-optional rule that immediately follows it.

Vector grammars

Vector grammars are closely related to matrix grammars, and in fact can be seen as a special class of matrix grammars, in which if mM, then so are all of its permutations p(m). For convenience, however, we will define vector grammars as follows: a vector grammar is a 5-tuple G=(N,T,M,S,F), where N, T, and F are defined previously (F being a subset of M again), and where M is a set of vectors mi={p1,p2,...,pn}, each vector being a set of context free rules.

The derives relation in a vector grammar is then:

Given some strings x and y, both in (NT)*, and some matrix m={p1,p2,...,pn}M,

xmacy

holds if either

x=x1Ax2, y=y1wy2, and Api1acw1pi2acw2pi3ac...pinacw, where m={pi1,pi2,...,pin}, or
x=y and mF

Notice that the number of production rules used in the derivation sequence, n, is the same as the number of production rules in the vector. Informally, then, a vector grammar is one in which a set of productions is applied, each production applied exactly once, in arbitrary order, to derive one string from another. Thus vector grammars are almost identical to matrix grammars, minus the restriction on the order in which the productions must occur during each cycle of rule application.

Programmed grammars

Programmed grammars are relatively simple extensions to context-free grammars with rule-by-rule control of the derivation. A programmed grammar is a 4-tuple G=(N,T,S,P), where N, T, and S are as in a context-free grammar, and P is a set of tuples (p,σ,ϕ), where p is a context-free production rule, σ is a subset of N (called the success field), and ϕ is a subset of N (called the failure field). If the failure field of every rule in P is empty, the grammar is lacks appearance checking, and if at least one failure field is not empty, the grammar has appearance checking. The derivation relation on a programmed grammar is defined as follows:

Given two strings x,y(NT)*, and some rule p=(Aw,σ,ϕ)P,

xpy and x=xAx,y=xwx, or
x=y and A does not appear in x.

The language of a programmed grammar G is defined by constraining the derivation rule-wise, as L(G)={w(NT)*:Sp1w1p2...pnw}, where for each pi=(Aivi,σi,ϕi), either wi1=xi1Ax'i1,wi=xi1vix'i1,andpi+1σi or wi1=wi,pi+1ϕi.

Intuitively, when applying a rule p in a programmed grammar, the rule can either succeed at rewriting a symbol in the string, in which case the subsequent rule must be in ps success field, or the rule can fail to rewrite a symbol (thus applying vacuously), in which case the subsequent rule must be in ps failure field. The choice of which rule to apply to the start string is arbitrary, unlike in a language controlled grammar, but once a choice is made the rules that can be applied after it constrain the sequence of rules from that point on.

Example

As with so many controlled grammars, programmed grammars can generate the language {a2n:n0}:

Let G=({S,A},{a},S,{r1,r2,r3}), where

r1=(SAA,{r1},{r2})
r2=(AS,{r2},{r1,r3})
r3=(Sa,{r3},)

The derivation for the string aaaa is as follows:

Sr1AAr1AAr2SAr2SSr2SS
r1AASr1AAAAr1AAAA
r2SAAAr2SSAAr2SSSAr2SSSSr2SSSS
r3aSSSr3aaSSr3aaaSr3aaaar3aaaa

As can be seen from the derivation and the rules, each time r1 and r2 succeed, they feed back to themselves, which forces each rule to continue to rewrite the string over and over until it can do so no more. Upon failing, the derivation can switch to a different rule. In the case of r1, that means rewriting all Ss as AAs, then switching to r2. In the case of r2, it means rewriting all As as Ss, then switching either to r1, which will lead to doubling the number of Ss produced, or to r3 which converts the Ss to as then halts the derivation. Each cycle through r1 then r2 therefore either doubles the initial number of Ss, or converts the Ss to as. The trivial case of generating a, in case it is difficult to see, simply involves vacuously applying r1, thus jumping straight to r2 which also vacuously applies, then jumping to r3 which produces a.

Control by context conditions

Unlike grammars controlled by prescribed sequences of production rules, which constrain the space of valid derivations but do not constrain the sorts of sentences that a production rule can apply to, grammars controlled by context conditions have no sequence constraints, but permit constraints of varying complexity on the sentences to which a production rule applies. Similar to grammars controlled by prescribed sequences, there are multiple different kinds of grammars controlled by context conditions: conditional grammars, semi-conditional grammars, random context grammars, and ordered grammars.

Conditional grammars

Conditional grammars are the simplest version of grammars controlled by context conditions. The structure of a conditional grammar is very similar to that of a normal rewrite grammar: G=(N,T,S,P), where N, T, and S are as defined in a context-free grammar, and P is a set of pairs of the form (p,R) where p is a production rule (usually context-free), and R is a language (usually regular) over NT. When R is regular, R can just be expressed as a regular expression.

Proof-theoretic definition

With this definition of a conditional grammar, we can define the derives relation as follows:

Given two strings x,y(NT)*, and some production rule p=(Aw,R)P,

xpy if and only if x=xAx, y=xwx, and xR

Informally then, the production rule for some pair in P can apply only to strings that are in its context language. Thus, for example, if we had some pair (Sx,a*Sb*), we can only apply this to strings consisting of any number of as followed by exactly only S followed by any number of bs, i.e. to sentences in {amAbn:m,n0}, such as the strings S, aSb, aaaS, aSbbbbbb, etc. It cannot apply to strings like xSy, aaaSxbbb, etc.

Example

Conditional grammars can generate the context-sensitive language {a2n:n0}.

Let G=({S,S},{a},{f,g,h},S), where

f=(SAA,A*S+)
g=(AB,B*A+)
h=(BS,S*B+)
k=(Sa,a*S+)

We can then generate the sentence aaaa with the following derivation:

SfAAgBAgBB
hSBhSSfAASfAAAA
gBAAAgBBAAgBBBAgBBBB
hSBBBhSSBBhSSSBhSSSS
kaSSSkaaSSkaaaSkaaaa

Semi-conditional grammars

A semi-conditional grammar is very similar to a conditional grammar, and technically the class of semi-conditional grammars are a subset of the conditional grammars. Rather than specifying what the whole of the string must look like for a rule to apply, semi-conditional grammars specify that a string must have as substrings all of some set of strings, and none of another set, in order for a rule to apply. Formally, then, a semi-conditional grammar is a tuple G=(N,T,S,P), where, N, T, and S are defined as in a CFG, and P is a set of rules like (p,R,Q) where p is a (usually context-free) production rule, and R and Q are finite sets of strings. The derives relation can then be defined as follows.

For two strings xAx,xwx(NT)*, and some rule p=(Aw,R,Q)P,

xAxpxwx if and only if every string in R is a substring of xAx, and no string in Q is a substring of xAx

The language of a semi-conditional grammar is then trivially the set of terminal strings L(G)={wT*:S*w}.

An example of a semi-conditional grammar is given below also as an example of random context grammars.

Random context grammars

A random context grammar is a semi-conditional grammar in which the R and Q sets are all subsets of N. Because subsets of N are finite sets over (NT)*, it is clear that that random context grammars are indeed kinds of semi-conditional grammars.

Example

Like conditional grammars, random context grammars (and thus semi-conditional grammars) can generate the language {a2n:n0}. One grammar which can do this is:

Let G=({S,X,Y,A},{a},S,{r1,r2,r3,r4,r5}), where

r1=(SXX,,{Y,A})
r2=(XY,,{S})
r3=(YS,,{X})
r4=(SA,,{X})
r5=(Aa,,{S})

Consider now the production for aaaa:

Sr1XXr2YXr2YYr3SYr3SS
r1XXSr1XXXXr2YXXXr2YYXXr2YYYXr2YYYY
r3SYYYr3SSYYr3SSSYr3SSSS
r4ASSSr4AASSr4AAASr4AAAA
r5aAAAr5aaAAr5aaaAr5aaaa

The behavior of the R sets here is trivial: any string can be rewritten according to them, because they do not require any substrings to be present. The behavior of the Q sets, however, are more interesting. In r1, we are forced by the Q set to rewrite an S, thus beginning an S-doubling process, only when no Ys or As are present in the string, which means only when a prior S-doubling process has been fully initiated, eliminating the possibility of only doubling some of the Ss. In r2, which moves the S-doubling process into its second stage, we cannot begin this process until the first stage is complete and there are no more Ss to try to double, because the Q set prevents the rule from applying if there is an S symbol still in the string. In r3, we complete the doubling stage by introducing the Ss back only when there are no more Xs to rewrite, thus when the second stage is complete. We can cycle through these stages as many times as we want, rewriting all Ss to XXs before then rewriting each X to a Y, and then each Y to an S, finally ending by replacing each S with an A and then an a. Because the rule for replacing S with A prohibits application to a string with an X in it, we cannot apply this in the middle of the first stage of the S-doubling process, thus again preventing us from only doubling some Ss.

Ordered grammars

Ordered grammars are perhaps one of the simpler extensions of grammars into the controlled grammar domain. An ordered grammar is simply a tuple G=(N,T,S,P) where N, T, and S are identical to those in a CFG, and P is a set of context-free rewrite rules with a partial ordering <. The partial ordering is then used to determine which rule to apply to a string, when multiple rules are applicable. The derives relation is then:

Given some strings xAx,xwx(NT)* and some rule p=AwP,

xAxpxwx if and only if there is no rule p=AwP such that p<p.

Example

Like many other contextually controlled grammars, ordered grammars can enforce the application of rules in a particular order. Since this is the essential property of previous grammars that could generate the language {a2n:n0}, it should be no surprise that a grammar that explicitly uses rule ordering, rather than encoding it via string contexts, should similarly be able to capture that language. And as it turns out, just such an ordered grammar exists:

Let G=({S,X,Y,Z,A},{a},S,P), where P is the partially ordered set described by the Hasse diagram

The derivation for the string aaaa is simply:

SSXXXXXYYXXYYYYSSYYSYY
SXXXXSSXXXXXX
XYYXXXXYYYXXXYYYYXXYYYYY
YSSYYYYSSSYYYSSSSYYSSSSS
SAASSSSAAASSSAAAASSAAAAA
AaaAAAAaaaAAAaaaaAAaaaaa

At each step of the way, the derivation proceeds by rewriting in cycles. Notice that if at the fifth step SY, we had four options: YZ,SZ,YS,SA, the first two of which halt the derivation, as Z cannot be rewritten. In the example, we used YS to derive SS, but consider if we had chosen SA instead. We would have produced the string AS, the options for which are YZ and AZ, both of which halt the derivation. Thus with the string SY, and conversely with YS, we must rewrite the Y to produce SS. The same hold for other combinations, so that overall, the ordering forces the derivation to halt, or else proceed by rewriting all Ss to XXs, then all Xs to Ys, then all Ys to Ss, and so on, then finally all Ss to As then all As to as. In this way, a string Sn can only ever be rewritten as An which produces as, or as S2n. Starting with n = 0, it should be clear that this grammar only generates the language {a2n:n0}.

Grammars with parallelism

A still further class of controlled grammars is the class of grammars with parallelism in the application of a rewrite operation, in which each rewrite step can (or must) rewrite more than one non-terminal simultaneously. These, too, come in several flavors: Indian parallel grammars, k-grammars, scattered context grammars, unordered scattered context grammars, and k-simple matrix grammars. Again, the variants differ in how the parallelism is defined.

Indian parallel grammars

An Indian parallel grammar is simply a CFG in which to use a rewrite rule, all instances of the rules non-terminal symbol must be rewritten simultaneously. Thus, for example, given the string aXbYcXd, with two instances of X, and some rule Xw, the only way to rewrite this string with this rule is to rewrite it as awbYcwd; neither awbYcXd nor aXbYcwd are valid rewrites in an Indian parallel grammar, because they did not rewrite all instances of X.

Indian parallel grammars can easily produce the language {ww:w{a,b}*}:

Let G=({S,A},{a,b},S,{f,g,h,k}), where

f=SAA
g=AaA
h=AbA
k=Aϵ

Generating aabaab then is quite simple:

SfAAgaAaAgaaAaaAhaabAaabAkaabaab

The language {a2n:n0} is even simpler:

Let G=({S},{a},S,P), where P consists of

SSS
Sa

It should be obvious, just from the first rule, and the requirement that all instances of a non-terminal are rewritten simultaneously with the same rule, that the number of Ss doubles on each rewrite step using the first rule, giving the derivation steps SS2S4S8.... Final application of the second rule replaces all the Ss with as, thus showing how this simple language can produce the language {a2n:n0}.

K-grammars

A k-grammar is yet another kind of parallel grammar, very different from an Indian parallel grammar, but still with a level of parallelism. In a k-grammar, for some number k, exactly k non-terminal symbols must be rewritten at every step (except the first step, where the only symbol in the string is the start symbol). If the string has less than k non-terminals, the derivation fails.

A 3-grammar can produce the language {anbncn:n0}, as can be seen below:

Let G=({S,A,B,C},{a,b,c},S,P), where P consists of:

SABC
AaA
Aa
BbB
Bb
CcC
Cc

With the following derivation for aaabbbccc:

SABCaAbBcCaaAbbBccCaaabbbccc

At each step in the derivation except the first and last, we used the self-recursive rules AaA,BbB,CcC. If we had not use the recursive rules, instead using, say, Aa,BbB,CcC, where one of the rules is not self-recursive, the number of non-terminals would have decreased to 2, thus making the string unable to be derived further because it would have too few non-terminals to be rewritten.

Russian parallel grammars

Russian parallel grammars[2] are somewhere between Indian parallel grammars and k-grammars, defined as G=(N,T,S,P), where N, T, and S are as in a context-free grammar, and P is a set of pairs (Aw,k), where Aw is a context-free production rule, and k is either 1 or 2. Application of a rule p=(Aw,k) involves rewriting k occurrences of A to w simultaneously.

Scattered context grammars

A scattered context grammar is a 4-tuple G=(N,T,S,P) where N, T, and S are defined as in a context-free grammar, and P is a set of tuples called matrixes p=(A1w1,...,Anwn), where n>0 can vary according to the matrix. The derives relation for such a grammar is

xpy if and only if
p=(A1w1,...,Anwn)P, and
x=x1A1x2...xnAnxn+1,y=x1w1x2...xnwnxn+1, for xi(NT)*

Intuitively, then, the matrixes in a scattered context grammar provide a list of rules which must each be applied to non-terminals in a string, where those non-terminals appear in the same linear order as the rules that rewrite them.

An unordered scattered context grammar is a scattered context grammar in which, for every rule in P, each of its permutations is also in P. As such, a rule and its permutations can instead be represented as a set rather than as tuples.

Example

Scattered context grammars are capable of describing the language {anbncn:n0} quite easily.

Let G=({S},{a,b,c},S,{r1,r2,r3}), where

r1=(SSSS)
r2=(SaS,SbS,ScS)
r3=(Sϵ,Sϵ,Sϵ)

Deriving aaabbbccc then is trivial:

Sr1SSSr2aSbScSr2aaSbbSccSr2aaaSbbbScccSr3aaabbbccc

References

43 year old Petroleum Engineer Harry from Deep River, usually spends time with hobbies and interests like renting movies, property developers in singapore new condominium and vehicle racing. Constantly enjoys going to destinations like Camino Real de Tierra Adentro.

Other Sports Official Alfonzo from Chase, has hobbies and interests for instance fast, property developers in new industrial launch singapore and aquariums. In recent times has visited Monasteries of Haghpat and Sanahin.

  1. Dassow, J., Pǎun, Gh., and Salomaa, A. Grammars with Controlled Derivations. In G. Rozenberg and A. Salomaa (Eds.) Handbook of Formal Languages, Vol. 2, Ch. 3.
  2. Dassow, J. 1984. On some extensions of russian parallel context free grammars. Acta Cybernetica 6, pp. 355-360.