Identification and characterization of conserved miRNAs with its targets mRNA in Trichinella Spiralis

microRNAs (Small regulatory non-coding RNAs) have an important role in gene regulation and evolutionarily conserved molecules. Trichinella spiralis infect majority of species. Therefore, it is of interest to identify conserved miRNAs and their targets using sequences from EST, GSS and full length nucleotides obtained from NCBI against previously reported worm miRNAs. We identify 11 novel miRNAs in T. spiralis by using bioinformatics-homology based search. In addition, we predicted target mRNA genes form complementary base pair in seed region of miRNAs. Further, gene annotation using Uniprot shows that these target genes of miRNAs are involved in various metabolism, enzymatic activity and constituents of membrane components.


Background:
MicroRNAs (miRNAs) are endogenous small, single stranded, non-coding RNA (~19-25 nucleotides) molecules that regulate gene expression at post-transcriptional level either by mRNA cleavage or repressing translation [1,2]. Larger number of miRNAs have been identified and characterized in various organisms such as animals, plants, viruses and in parasites [3][4][5]. miRNA sequences are highly conserved across the species [6,7]. Therefore, it is of interest to identify and characterize conserved miRNA in T. spiralis by using computational analysis.
Nematode of the genus Trichinella are zoonotic parasites which are distributed Worldwide [8,9]. Different species in the genus Trichinella have been identified. Trichinella spiralis (T. spiralis) is most prevalent and pathogenic disease causing nematode in human. It causes food-borne diseases called trichinellosis which infect majority of species involving human [10]. Human trichinellosis has been documented in 55 countries, hence considered as an emerging or re-emerging diseases [11][12][13]. Infection occurs by consumption of raw or undercooked meat contaminated with infective muscle larvae of the parasite [14,15]. After ingestion muscle larvae, releases from their capsules and gets activated into the intestinal infective larvae and then penetrate into the host intestinal epithelium where they molt and mature to adult worm [16]. The female worm reproduce the new born larvae, they are passed into tissue, enter the lymphatic system and travel through the bloodstream into the muscle fibre, once the worms reach the muscular tissue it gets induced into encapsulated nurse cell [17,18] which can survive in host for years. They then wait for a new host infection, thereby the new generation begins. However no miRNA has been reported in miRBase for this genus which provided new opportunity for predicting conserved novel miRNAs and their putative target genes. Identification of miRNAs and the function of target genes in T. spiralis may provide the idea to know the mechanism of infection and their metabolic processes.

Computational tools
The freely available BioEdit software [20] is used for local alignment tool BlastN search, used to predict conserved miRNA in T. spiralis using EST, GSS and nucleotides. The sequences were retrieved from NCBI and saved in FASTA format. Commercial available Geneious software version 7.1.5 were used for pairwise and multiple alignments with default parameter, in addition, it contain RNAfold program [21] to predict secondary structure of pre-miRNAs.

Nomenclature of miRNAs
The predicted miRNA in T. spiralis were named according with miRBase [22]. The mature miRNA sequence are designed with 'miR', and precursor hairpin are labelled as 'mir' with the prefix in three letter 'tsp' for Trichinella spiralis, first letter from genus and first two letter from species.  Table 1: Predicted characterized features of precursor miRNA in T. spiralis are given.  ©2016

283
Identification and prediction of precursor microRNAs Procedure for identifying potential conserved novel miRNAs is summarized in Figure 1. The following steps were considered: local nucleotide BLAST were run with the reference miRNAs set sequences searched against T. spiralis sequences at low e-value threshold using BioEdit software. The results of BLASTN were further analyzed by applying two conditions: 1) at least 19-24 nucleotides length were assumed between the predicted mature miRNAs, more than 90% identity between the sequences. 2) Mismatches between the sequences with 0-4 nucleotide are allowed with reference set of worm mature miRNAs. The precursor sequences were searched at 50 nucleotides upstream and downstream from the location of mature miRNAs.
The following conditions are considered to predict miRNA sequences according to Zhang [23]. 1) The minimum length of pre-miRNA to be 60 nucleotides.
2) The precursor miRNA sequences can fold into an appropriate hairpin secondary structure, the mature miRNA sequence site may be in any one arm of hairpin structure. 3) Not more than 6 mismatches between predicted mature miRNA and the opposite miRNA* sequence in the hairpin structure. 4) No breakage or loop in mature miRNA and miRNA* sequences. 5) Predicted secondary structures had retained minimum free energy (MFE) (≤-18 kcal/mol). 6) 30-70% of G+C content. To search the targets we prefer miRanda software version 3.3a [24], released 2010 available at http://www.microrna.org /miranda. The program used thermodynamics and dynamic programming alignments, along with statistical parameters were followed in these studies with the MFE threshold -20kcal/mol, Score Threshold 120, Scaling Parameter 4 for complementary nucleotide match score. Outputs were subsequently screened out to reduce false positive results. In addition, function of predicted target genes are allowed to understand regulatory network in the cellular, biological and molecular processes using gene annotation Uniprot database (www.uniprot.org).

Results and Discussion:
In silico prediction of miRNAs in T. spiralis Different computational approaches were used to identify miRNA in both animal and plants [25][26][27]. In the present study, we identify the conserved miRNAs in T. spiralis using EST, GSS and nucleotides (DNA and RNA sequences) by comparative genomic approach. The collected sequences from these sources were retrieved and saved in FASTA format. In BioEdit software we created local blast for nucleotide database as reference miRNAs set which contains all known worm mature miRNA sequences retrieved from miRBase, then BlastN program were performed against the T. spiralis sequences with default parameter and low e-value threshold. The resulted output were filtered by applying two conditions 1) at least 18-24 nucleotides length were assumed between the predicted mature miRNAs, more than 90% identity between the sequences. 2) Mismatches between the sequences with 0-4 nucleotides are allowed with reference sets of worm mature miRNAs. Figure 2 shows the high degree of sequence similarity between the predicted mature miRNA sequences and previously known miRNAs. Gap in alignment is due to insertion or deletion of the nucleotides. Further, the precursor sequences were searched at 50 nucleotides upstream and downstream from the location of mature miRNA. Since miRNA are non-coding nucleotides, we performed BLASTX 2.2.30 search against non-redundant (NR) protein databases in order to discard the protein coding miRNA candidates. The remaining precursor miRNA candidates were analysed to predict secondary structure by using RNAfold. Figure 3 shows precursor miRNA and mature sequences in stem loop structures. All the mature miRNA were found in the stem portion of hairpin structures containing less than 6 mismatches in the opposite arm without break or loop inside the mature sequences. Mature sequence resides either in 3' arm or in 5' arm of the secondary structures. In our studies we considered from 60-120 nucleotides and 19 to 24 nucleotides of precursor miRNAs and mature miRNA sequences respectively. Within the genome, repeated sequences were removed, finally we identified 11 potential novel miRNAs namely tsp- Further the functional information of each miRNA target genes were search using Uniprot database. Gene annotations were functionally categorized into three parts: cellular component, biological process and molecular functions. In our analysis around 38% are contribute for molecular function, 33% and 29% responsible in biological process and cellular component respectively Figure 4. The target genes in the biological category are highly involved in various metabolism and signal transduction, in the cellular process mainly the component of the membranes followed by nucleus and in the molecular process highest percentage in enzymatic activity, nucleic acid and ion binding. The predicted miRNA candidates target genes were found in various regulatory roles during growth and developments. Our studies help us to understand important regulatory network and roles of miRNAs in T. spiralis.

Conclusion:
T. spiralis is food borne zoonotic nematode causative agent of trichinellosis in Human. Our study is the first attempt to identify and characterize conserved novel miRNAs with its target mRNA in T. spiralis. In this report we identify 11 novel miRNA candidates which were evolutionarily conserved with previously known miRNAs register in miRBase. Analysis of miRNA targets within the genome of T. spiralis, help to understand the regulatory function of each miRNA candidates. In addition, gene annotation analysis shows that most of the target transcripts are involved in various metabolism, enzymatic activity and constituents of membrane components. Overall this helps to understand conserved miRNAs in evolution, genome organization and gene regulatory networks in T. spiralis. The predicted result from computational approach would be the next step in future to guide experimental methods to design, which would through more light into the pathogenesis related control, in better understanding of prevention of infection.