We describe our submission for task 1B of the BioCreAtIvE competition which is concerned with grounding gene mentions with respect to databases of organism gene identifiers. Several approaches to gene identification, lookup, and disambiguation are presented. Results are presented with two possible baseline systems and a discussion of the source of precision and recall errors as well as an estimate of precision and recall for an organism-specific tagger bootstrapped from gene synonym lists and the task 1B training data.
|Title of host publication||Proceeding of the BioCreAtIvE (Critical Assessment of Information Extraction Systems in Biology) Workshop 2004|
|Number of pages||5|
|Publication status||Published - 2004|