摘要:
High-throughput RNA sequencing was performed for comprehendsively analyzing the transcriptome of thegeese.A total of 28,803,759 by of raw sequence data were generated,of which 28,730,361bp remained afteradaptor trimming (99% of sequenced bases).After removal of adaptor sequences,117,279 reads were obtained,with an average length of 244 bases.Assembly ofall the reads from both 454 and illumine platforms,formed56,839 contigs,with 12,995 readsremaining as singletons,Ranging from 38 to 28206 by in size,with an averagesize of 2584 by and an N50 of 4624.The assembly produced a substantial number of large contigs:35,545 (62.5%)were longer thanlkb,of which 8850 15.6%)were longer than Skb.The sequencing depth was 85 X on average.Weperformed comprehensive function annotations on unigenes including protein sequence similarity,GO termclassification,and KEGG pathway enrichment.Gene ontology (GO) analysis showed that approximately 63% ofthe contigs had annotation information,among the 36,756 annotated sequences in Nr database,24,299(66.1%)sequences were assigned with one or more GO terms.There were 19,757(81.3%)isotigs for biological processes,10,643(43.4%)isotigs for cellular component,22818 (93.9%) isotigs for molecular function.The result of KEGGpathway mapping 8,737 sequences had the pathway annotation,and took part in477 pathways.In addition,different expression genes in ovarian tissue between different states of Sichuan White geese were detected in thecurrent study.