I would like to trim/mark adapters using trimmomatic or picard MarkIlluminaAdapters from a series of Illumina Paired-End read fastqs. The fastq files may have been done using different kits or different generations/versions of illumina chips.
Without knowing ahead of time how the libraries were prepared (or which kit was used), Is there a safe "catch all" adapter sequence (or set of such sequences) that can be passed to a trimming tool and that would cover all Illumina kits? In other words, what are we expected to know about the fastq to ensure correct trimming?
edit: Picard defines a few built-in adapters which cover many standard kits, but it's not immediately obvious if you need to pass a common one (built-in sequences defined here).
One benefit of using MarkIlluminaAdapters is that it simply "marks" the sequences using
– init_js Mar 19 '19 at 21:28XT:i:SAM trimming tag, rather than removing them altogether from the output bam/sam.XT:iadapter trimming annotations) into fastq that has adapters hard-clipped. The mapper is run on that clipped fastq (like you say, because most of them only support fastq). And then this tool, MergeBamAlignment, combines the mapping information from the aligned BAM into the full, soft-clipped, reads in the unaligned SAM. It might be straighforward to compute theXT:iannotations by comparing the fastp input/output , but I haven't fiddled with it. I'll consider. – init_js Mar 21 '19 at 19:01