BACK TO CONTENTS   |    PDF   |    PREVIOUS   |    NEXT

Title

 

 

 

 

 

JUZBOX: A web server for extracting biomedical words from the protein sequence

Authors

 

Bobby Paul, S. Balaji*, V. Sathyanath, Santhosh J. Eapen

Affiliation

 

Indian Institute of Spices Research, Calicut, Kerala, India

Email

 

blast_balaji@rediffmail.com

Article Type

 

Web Server

 

Date

 

Received May 25, 2009; Revised July 31, 2009; Accepted September 11, 2009; Published November 17, 2009

 

Abstract

The recognition of gene/protein names in literature is one of the pivotal steps in the processing of biological literatures for information extraction or data mining. We have compiled a lexicon of biomedical words (conserved patterns/ potential motifs) which has the combination of only 20 alphabets of amino acids. The remaining 6 letters of the English alphabets (B, J, O, U, X, Z) are treated as invalid amino acid characters (to our context), We have jumbled the 6 letters for the sake of usage and convenience and termed as ‘JUZBOX’ and these characters were filtered in the biomedical lexicon. Undoubtedly, the generation of biomedical words from protein sequence using JUZBOX have applications specific for functional annotation.

Keywords

JUZBOX; biomedical words; lexicon

Availability

JUZBOX is available freely at http://www.spices.res.in/juzbox

Citation

 

Paul et al., Bioinformation 4(5):  179-181 (2009)

Edited by

 

P. Kangueane

 

ISSN

 

0973-2063

 

Publisher

 

Biomedical Informatics

License

 

 

This is an Open Access article which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. This is distributed under the terms of the Creative Commons Attribution License.