But I've discovered that first I need to get the various files sorted out and properly named. I have the fastq files from the sequencing centre on a little USB backup drive plugged into my laptop, but these files have the original confusing names, not the new informative names.
I also have a nice table made by Josh that shows both names for each sample, so in principle I could just carefully rename my files.
- There are 72 files and that's a lot of opportunities for error. If Scott has the renamed files on the Zoology cluster, he could put a copy of them into my directory there, and then I could download them onto my backup drive. That would be inefficient but safe.
- Should I really be working with files on the Zoology cluster, rather than local files?
- I want to get the 24 RNAseq files from last year too.
While waiting for advice I think I'll go ahead and use Josh's table to identify the original name of the problem sample (new name antx_M2_C, old name Sample_9C30), and pull out the reads and do those BLAST searches.