Kniha Automatic Biological Term Annotation Sittichai Jiampojamarn

Automatic Biological Term Annotation

Using n-gram and Classification Models

Jazyk: Angličtina
Vazba: Brožovaná
Dostupnost: U nakladatele na objednávku
Odesíláme za 17-27 dnů
1 173
Exciting research in biology has resulted in a large§amount of biological publications. §Knowledge d...

Informace o knize

Jazyk
Angličtina
Vazba
Kniha - Brožovaná
Vydáno
2009
Stránek
96
EAN
9783639107333
Enbook ID
06819675
Hmotnost
159
Rozměry
150 x 220 x 6

Kompletní popis

Exciting research in biology has resulted in a large§amount of biological publications. §Knowledge discovery in biology becomes an interesting§task which can be established§by recognizing terms in text to extract useful§information such as interaction relationships.§§We propose the Automatic Biological Term Annotation§(ABTA) system which uses classification methods to§annotate terms in text. A novel method is presented§to express lexical features in pattern notations.§Prefix and suffix characters are used instead of§lists of potential terms or external resources. We§demonstrate that part-of-speech tag information is§the most effective attribute. Creating classification§exemplars is conducted from text by using word n-gram§model. We illustrate improvements on our system's§performance which depends on the feature attributes§we define. Biological concept markers are also§assigned to each located term indicating its meaning.§Our results are comparable to the performance of§other existing systems while our system retains§simplicity and generalizability. Exciting research in biology has resulted in a large§amount of biological publications. §Knowledge discovery in biology becomes an interesting§task which can be established§by recognizing terms in text to extract useful§information such as interaction relationships.§We propose the Automatic Biological Term Annotation§(ABTA) system which uses classification methods to§annotate terms in text. A novel method is presented§to express lexical features in pattern notations.§Prefix and suffix characters are used instead of§lists of potential terms or external resources. We§demonstrate that part-of-speech tag information is§the most effective attribute. Creating classification§exemplars is conducted from text by using word n-gram§model. We illustrate improvements on our system''s§performance which depends on the feature attributes§we define. Biological concept markers are also§assigned to each located term indicating its meaning.§Our results are comparable to the performance of§other existing systems while our system retains§simplicity and generalizability.

Mohlo by vás zajímat

185

Low Power Cmos Based Flash ADC

Sudakar Singh Chauhan
1 173
2 072
251

Darkly Flows the Taff

MR Simon Barnes
271
429
5 272

Mighty Miss Malone

Christopher Paul Curtis
329
698

Standup Trainer

Ellen C. Dowling
202

Zákaznicí kteří koupili tuto knihu koupili také

1 407
481