New DEG list comparing temperature - annotation, enrichment; annotation of oyster differentially abundant protein list
Today I worked with the new DEG list from
DESeq2 - comparing elevated (10˚C) and decreased (4˚C) temperature treatments. Annotated with uniprot-SP-GO and used DAVID to get enriched biological processes. Additionally, since it could be interesting to compare between oyster and crab projects, I annotated the list of differentially abundant proteins from the oyster project.
NEW DEG list
DESeq2 performed by SR. Then I went through script.
Comparison –> libraries 8+10 to libraries 9+11 (new crab paper table 1)
USED SAME TRANSCRIPTOME AS THE ONE USED FOR THE INFECTED STATUS DEG COMPARISON (transcriptome v. 1.5)
In new repo for paper: RobertsLab/paper-crab
DESeq2 –> scripts/02-DeSeq-Temperature.Rmd
List of 423 differentially expressed genes: analyses/DEG-temperature.tab
I did the same process that I did for the previous DEG list comparing infection statuses. However the .tab DEGlist output from
DESeq2 has the Trinity IDs as row names, not as a column, so in order to name the Trinity IDs column as “Trinity_ID” for
join-ing purposes, I opend file up in excel and named the first column (analyses/DEG-temperature.txt).
- Annotated the 423 DEG list with uniprot-SP-GO.sorted and
blastoutput from the transcriptome (v. 1.5): scripts/annotate-temp-DEGlist.Rmd
- Resulting file: analyses/pool_temp-annot_DEGlist.tab
The same thing happened here as with the infection status annotation –> the annotated list has more rows that the original list of 423 DEGs… likely because the transcriptome
blast output has duplicate Trinity IDs listed. Therefore, the number of annotated DEGs (339) is likely not a true number. Still need to figure out how to fix this.
(also need to fix that in the infections status DEG list annotation: RobertsLab/project-crab/blob/master/scripts/annotate-2019infection_status-DEGs.Rmd)
Oyster: annotated list of differentially abundant proteins
I had a list of differentially abundant proteins already, but it wasn’t annotated with GO or the C. gigas
blat output. So I did that:
R script: scripts/03-join-annotate-diffexp-prot.R
Annotated list: analyses/diff_abundant-prot-annotated.tab
There were 69 differentially abundant proteins. The number was gotten by using a cut-off of >2 log2 fold change and <-2 log2 fold change from the list of 2,808 detected proteins in the samples.
Next steps for thesis work:
- analyses for more crab results - differential expression analsyes for temperature, probably some heat maps, look at variation of certain DEGs across all libraries, …
- practice talk Friday: working on flow of information - will be a very rough run-through. Defense date June 2nd.
- maybe do some comparisons between oyster and crab differentially abundant/expressed lists…?
before turn in thesis (due by June 26th at latest):
- write introduction to thesis tying two projects together (maybe also a conclusion tying the results together?)