An insight into cyanobacterial genomics--a perspective.

At the turn of the millennium, cyanobacteria deserve attention to be reviewed to understand the past, present and future. The advent of post genomic research, which encompasses functional genomics, structural genomics, transcriptomics, pharmacogenomics, proteomics and metabolomics that allows a systematic wide approach for biological system studies. Thus by exploiting genomic and associated protein information through computational analyses, the fledging information that are generated by biotechnological analyses, could be well extrapolated to fill in the lacuna of scarce information on cyanobacteria and as an effort this paper attempts to highlights the perspectives available and awakens researcher to concentrate in the field of cyanobacterial informatics.


Background: Characteristics and importance of Cyanobacteria
The algae are the simplest members of the plant kingdom, and the cyanobacteria (blue-green algae) are the simplest of the algae, having considerable and increasing economic importance. They are relatives of the bacteria, not eukaryotes and are only the chloroplast in eukaryotic algae to which the cyanobacteria are related and like all eubacteria, their cell walls contain peptidoglycan. Studies of metabolic similarities and ribosomal RNA sequence suggest cyanobacteria to form a good monophyletic taxon. Although they are truly prokaryotic, cyanobacteria have an elaborate and highly organized system of internal membranes which function in photosynthesis. Chlorophyll a and several accessory pigments (phycoerythrin and phycocyanin) embedded in these photosynthetic lamellae are analogs of the eukaryotic thylakoid membranes. These are widely distributed over land and water, often in environments where no other vegetation can exist. Found in almost every conceivable habitat, from oceans to fresh water to bare rock to soil, cyanobacteria produce the compounds responsible for "earthy" odors we detect in soil. The greenish slime on the side of damp flower pot, the wall of house or the trunk of the big tree are more likely due to cyanobacteria and have also been found on the fur of polar bears, to which they impart a greenish tinge!. They have also been tremendously important in shaping the course of evolution and ecological change throughout earth's history. Heterocyst-forming species of cyanobacteria that "fix" atmospheric nitrogen have formed a niche in soil which has promoter effect on rice paddies of Asia [1] that feed about 75% of the world's human population. The coating of bluegreens on prairie soil binds the particles of the soil to their mucilage coating, maintains a high water content and reduces erosion. Humans also consume Spirulina, which contains all amino acids essential for humans forming a staple food in parts of Africa and Mexico. In China, Taiwan and Japan, several blue-greens are served as a side dish and a delicacy. World wide, it is cultured and commercially processed for various food and medicinal products such as vitamins, drug compounds, and growth factors and also for the production of hydrogen gas and fertilizers.

Status of cyanobacterial genomics
Genomics allows us to make new and unexpected links between the function of unrelated and hitherto uncharacterized genes and to suggest hypotheses and biochemistry.

Comment on heterocyst coding genes
In N. punctiforme a heterocyst regulatory-gene product, HetR may [8] or may not [9] have been involved in the induction of akinite differentiation. However, a gene, avaK, whose gene products was found to be enriched in akinites of Anabaena variabilis was found in homologue to both N. punctiforme and Anabeana 7120 genomes. [10] Moreover, their associations with plants regulated hormogonium inducing factor (HIF), that induced the transcriptional genes SigH and ctpH. [11] Perhaps, in a heterotrophic metabolic mode the rate of nitrogen fixation was elevated with the excess fixed nitrogen being released to the plant partner. Although the informatics comparison on different components of cyanobacteria has been initialized, work is a standstill due to unavailability of information on some of the other organisms (Table 1 in supplementary material).

Informatics on cyanobacterial phylogeny
Although, systematic classification of microbes in general are present, the phylogeny of the cyanobacteria is poorly understood till date because most classification schemes were organized by cell or colony shape, but recent efforts with the use of genomic tools have been widely used for molecular sequences that support to interpret the phylogeny for nitrogenase providing a true evolutionary schemes especially for nifH. Yes, hundreds of published and available nitrogenase gene sequences from many different species can now be interpreted to match with ribosomal genes to determine their consistency with multiple losses or with multiple transfers or with some combination of both processes to support genetic diversity analysis. Thus, in accordance to the above, the nitrogenase enzymes of all the nitrogen-fixing organisms (with few exceptions) were found to be similar and were clearly derived from a common ancestor mainly through the process of horizontal gene transfer or independent loss in many lineages. However, the clades formed also indicated a large number of more distant relatives; some of these were well known proteins involved in the synthesis of photosynthetic pigment, namely protochlorophyllide reductase and chlorine reductase. As gene sequences became available, it was realized that nifE is a homolog of nifD and nifN is a homolog of nifK. Hence, it seems certain that nifDK and nifEN are the products of ancient gene duplication. Similarly, phylogenetic studies on the true branching cyanobacteria [12] revealed them to exhibit a high degree of morphological complexity. The phylogenetic trees constructed on the basis of the 16S rRNA, rpoB and rbcLX gene sequences of the genera Anabaena, Aphanizomenon, Trichormus and Nostoc showed that the classification proposed [13] for these anabaenoids needed revision. With this being the current status, although the diversity of cyanobacteria is rich world wide requires more careful attention in the present scenario (Table 1 in supplementary  material).

Conclusion:
It is too far for conclusion, because, although we are witnessing a remarkable change in the scale of molecular microbiological research and are entering an integrative "genomic science", the access to genome information is still very limited. Perhaps, the information on the sequences, gene organization and phylogeny reviewed in the present perspective would provide us the possibilities of enumerating the other potentials that clue about the origin of the nitrogenase genes in them. The location of the genes may indicate whether, they are long-term residents or recent intruders. Besides this, the genome information is most valuable in predicting the enzymatic machinery required for expression of proteins. Now with the availability of more complete sets of nitrogenase sequences in almost hundreds of microbial genomes, it is possible to reinforce the complexity and distinctness of the A, B and C types. Although, analyses of genomic context of the genes are at its infant level, still would promise to provide new insights into the distribution of nitrogenase genes and their integration into the metabolism of their host organisms.