We are developing transcriptome analyses using mRNA-Seq. cDNAs libraries obtained by retrotrascription of mRNA random fragments, were sequenced with Illumina Genome Analyzer II who provides million of short sequences 36 or 75 nt long in paired-ends or single reads. By using two different softwares for alignment of sequences on the reference genomes (ELAND and BOWTIE) we mapped the GAII reads to exons, introns or intergenic regions, hence identifying new exons or new genes not predicted. We then adapted modules of the ERANGE program for digital gene expression analysis, detection of alternative splicing (annotated or new) and SNPs discovery. Assigned expression values showed a high reproducibility among technical replicates. Specific features of this values are that they are not subjected to background noises or saturation as this simply derives from a the number of reads obtained from GAII for each gene, and this number is directly proportional to the copies number of that transcript. Starting from mRNA isolated from total RNA is therefore possible, with this analysis, to estimate gene expression level of all gene transcripts in a cell in a given biological moment or in a particular physiological or pathological state. A strong bioinformatics engagement was necessary to develop a modified module in ERANGE software dedicated to the alternative splicing detection.

Transcrip profiling using next-gen sequencing technologies

GIACOMELLI, Enrico;XUMERLE, Luciano;FERRARINI, Alberto;DELLEDONNE, Massimo
2009-01-01

Abstract

We are developing transcriptome analyses using mRNA-Seq. cDNAs libraries obtained by retrotrascription of mRNA random fragments, were sequenced with Illumina Genome Analyzer II who provides million of short sequences 36 or 75 nt long in paired-ends or single reads. By using two different softwares for alignment of sequences on the reference genomes (ELAND and BOWTIE) we mapped the GAII reads to exons, introns or intergenic regions, hence identifying new exons or new genes not predicted. We then adapted modules of the ERANGE program for digital gene expression analysis, detection of alternative splicing (annotated or new) and SNPs discovery. Assigned expression values showed a high reproducibility among technical replicates. Specific features of this values are that they are not subjected to background noises or saturation as this simply derives from a the number of reads obtained from GAII for each gene, and this number is directly proportional to the copies number of that transcript. Starting from mRNA isolated from total RNA is therefore possible, with this analysis, to estimate gene expression level of all gene transcripts in a cell in a given biological moment or in a particular physiological or pathological state. A strong bioinformatics engagement was necessary to develop a modified module in ERANGE software dedicated to the alternative splicing detection.
2009
RNA-seq; bioinformatics; next-gen sequencing
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11562/430151
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact