Starting the process of working with summer 2022 Pycnopodia helianthoides coelomocyte RNAseq data. This will be part of my talk for SICB in January, and for WSN I’ll be focusing on presenting the work I have already done from summer 2021.

Summer 2022 Raw Data

The raw RNAseq data lives on owl: https://owl.fish.washington.edu/nightingales/P_helianthoides/

The data for summer 2022 are the files with the prefix “PSC-0”. Here’s a link to the google sheet with sample info (will put into github repo soon): here

Pre-trimming QC:

Following my protocol here: RNAseq-pipeline.md

Some adjustments to that specific protocol for summer 2022 data are:

2. Quality Check and Trimming

1. Untrimmed Data Quality Check Part I: FASTQC

Summer 2022 .fastq.gz files from Owl are on Raven in: /home/shared/8TB_HDD_02/graceac9/pycnornaseq2022

Multiqc report for untrimmed data on owl: owl.fish.washington.edu/scaphapoda/grace/pycno_2022/multiqc/mutliqc_report.html

3. Trim RNAseq data: Run fastp on Mox, then multiqc

A. rsync RNAseq data (fastq.gz) from nightingales to /gscratch/srlab/graceac9/data/pycno/RNAseq/summer2022

B. Script for summer 2022 will be in:

/gscratch/scrubbed/graceac9/jobs/20231101_pycno2022_fastp.sh

Trimmed data on Mox is: /gscratch/srlab/graceac9/analyses/pycno/20231101_PSC2022_trimming

Moved the trimmed data’s multiqc report to owl: owl.fish.washington.edu/scaphapoda/grace/pycno_2022/multiqc/trimmed/multiqc_report.html