Is there a way to estimate the size of a BAM file will have after mapping with BWA?
The input file are two mates fastq files, compressed with gzip, each one about 70G.
Is there a way to estimate the size of a BAM file will have after mapping with BWA?
The input file are two mates fastq files, compressed with gzip, each one about 70G.
A BAM file should have a roughly similar size to a compressed FASTQ file, because they're both compressed files that contain similar information. BAM will be slightly larger, because it contains information about mapping and differences from the reference.
If you are concerned about space and have a reference sequence available, you can store mapped reads as CRAM format, which will save a lot more space.