Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Packages for annotations #11

Open
TomSmithCGAT opened this issue May 12, 2022 · 1 comment
Open

Packages for annotations #11

TomSmithCGAT opened this issue May 12, 2022 · 1 comment

Comments

@TomSmithCGAT
Copy link
Member

'Annotation and Visualisation of Differential Expression Results' explains there are multiple ways to obtain annotations and then demonstrates how to do this with AnnotationHub, which seems perfectly reasonable.

Today we will use the `AnnotationHub` method. A workflow for annotation with
biomaRt is included in the extended materials section accessible on the course
website.

However, after this, we then load a set of annotations that was obtained via biomaRt. I assume this is a legacy issue. This is a bit confusing and I would suggest the loaded annotations should be obtained with AnnotationHub too.

## One we prepared earlier and one-to-many relationships
To ensure everyone is working with same annotation, we have created an annotation table.
In this case we used the `biomaRt` package to download annotations directly from
Ensembl. In this cases we can get additional columns, but will also sometimes get
one-to-many relationships, where one Ensembl ID maps to multiple Entrez IDs. This
sort of problem is common when mapping between annotation sources, but they have
already been dealt with for us in AnnotationHub. If we wanted more control over
this we would need to manually curate these one-to-many relationships ourselves.
In annotation table below we have modified the column names and dealt with the
one-to-many/missing issues for Entrez IDs. The code we used for doing this is
available in the [extended materials section](S6_Annotation_With_BioMart.html).
We will load out pre-created annotation table, and then combine it with our
results table.
```{r addAnnotation, message=FALSE}
ensemblAnnot <- readRDS("RObjects/Ensembl_annotations.rds")

@AshKernow
Copy link
Contributor

Yes, we should look at resolving this discrepancy. We no longer need the Chr, Start etc columns, just Symbol and Entrez.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants