Annotating genomes the Ensembl way¶
This session is part of Biodiversity Genomics Academy 2023
Ensembl Genome Annotation Project Leader
Jose Maria Gonzalez Perez-Silva
Part 1: General concepts + Case study¶
By the end of this part you will have:
- Obtained an overview of state-of-the art methods and tools for genome annotation
- Understood their advantages/limitations in different use-cases
- Gained a picture of the quality measures of existing annotations
- Discussed the future of genome annotation methodologies"
Slides for this session can be found here.
A list of useful tools for annotating genomes can be found here (download this file to access hyperlinks to tools).
Post-session additional notes can be found here
Part 2: Hands on - From RNAseq reads to gene models¶
By the end of this part you will be able to:
- Perform quality control and pre-processing of RNA-Seq reads: This includes trimming of adapter sequences and quality control to ensure that the reads are of sufficient quality to be used for analysis.
- Align RNA-Seq reads to the genome: The reads are aligned to the genome assembly, allowing for the identification of expressed genes and transcripts.
- Assemble transcripts: The aligned reads are used to reconstruct the transcripts, or messenger RNAs (mRNAs), that are expressed in the sample.
- Annotate genes: The identified genes can be annotated using a combination of homology-based and ab initio gene prediction methods. The homology-based methods use existing gene information from closely related species, while the ab initio methods use the transcript assembly information to predict new genes.
- Understand the terms genome assembly, reads, contigs
- Understand what a gene model is, what introns and exons are
- Understand what transcriptomic and protein data are, including understanding the different types of RNA
- Understand the concept of sequence alignment
Please make sure you MEET THE PREREQUISITES and READ THE DESCRIPTION above
You will get the most out of this session if you meet the prerequisites above.
Please also read the description carefully to see if this session is relevant to you.
If you don't meet the prerequisites or change your mind based on the description or are no longer available at the session time, please email tol-training at sanger.ac.uk to cancel your slot so that someone else on the waitlist might attend.