Most Popular
1500 questions
5
votes
1 answer
Is it possible to check if a patient has the HLA-B27 antigen from his exome stored in a VCF file
Is it possible to check if patient has the HLAB27 antigen by comparing his exome stored in VCF with all known HLAB27 variants?
For instance, rs4349859 is a known HLAB27 SNP.
If I can find rs4349859 in the patient's VCF file, could I say he is…
marcos
- 153
- 3
5
votes
1 answer
CRISPR Sequence Finder and Database Download
I am searching for tools to pull CRISPR Spacers from Bacterial Genomes. I am aware of the CRISPRDB and the corresponding identification tool on the web server.
Are there other tools for finding CRISPR Spacers?
Is there a way to download the spacer…
Cody Glickman
- 173
- 1
- 7
5
votes
1 answer
LD analysis in PLINK based on reference and a SNP list
I'm new to PLINK and genetics, and getting confused with two PLINK commands for LD analysis:
plink --bfile hapmap --r2 --ld-window-r2 --ld-snp-list --ld-window
plink --noweb --bfile --clump hapmap --clump inputfile --clump-field P --clump-p1…
Rob John
- 221
- 2
- 6
5
votes
1 answer
About CAFA experiment
I have a question about CAFA1 and CAFA2 experiment.
CAFA is an experiment that compare computational methods predicting protein functions. In the paper discussing CAFA2, there is an image comparing 2 methods (CAFA1 versus CAFA2):
How do I…
Amine
- 91
- 2
5
votes
2 answers
Interpreting 0x200 flag in bwa-mem alignments
I am looking at the bwamem.h code in http://github.com/lh3/bwa and found that BWA-MEM will give flag 0x200 to what it calls MEM_F_SOFTCLIP:
#define MEM_F_PE 0x2
#define MEM_F_NOPAIRING 0x4
#define MEM_F_ALL 0x8
#define MEM_F_NO_MULTI …
719016
- 2,324
- 13
- 19
5
votes
2 answers
How to get Nanopore MinION fast5 from SRA
I found some Nanopore MinION data on SRA, which I would like to investigate. I use sratoolkit for Illumina data all the time, but I am not sure how to get the fast5 file from the sra file.
What I have tried is to prefecth it, but then I don't know…
benn
- 3,571
- 9
- 28
5
votes
1 answer
Estimating BAM file from compressed fastq file size
Is there a way to estimate the size of a BAM file will have after mapping with BWA?
The input file are two mates fastq files, compressed with gzip, each one about 70G.
gc5
- 1,783
- 18
- 32
5
votes
2 answers
bcftools output vs. bgzip
I've got what seems like it should be a simple question, but I can't seem to figure it out from google. bcftools has 4 output options:
Output compressed BCF (b), uncompressed BCF (u), compressed VCF (z), uncompressed VCF (v).
Are any of these the…
heathobrien
- 1,816
- 7
- 16
5
votes
3 answers
Is there any publicly available multi-omics dataset?
I am searching for a multi-omics dataset which may include genomics, transcriptomics and proteomics (eg. snyderome).
I need such a dataset for an introductory data-exploration purpose. So it can be from any host organism and any experimental…
user345394
- 675
- 6
- 20
5
votes
1 answer
How to query a network for a bacterium, specifically Streptomyces caatingaensis or Streptomyces thioluteus?
I have a list of gene/protein IDs and I could not query a network using String, BioGrid, Intact and IIS, for Streptomyces thioluteus.
The bacteria I am working with is the Streptomyces caatingaensis. Since it is newer than others, I thought I would…
Henry
- 161
- 5
5
votes
1 answer
What is local realignment and what is the problem it solves?
I am trying to understand local realignment but I could not get a clear idea of what is the problem solved by it.
For example, reading Homer and Nelson (2010):
Because alignment algorithms map each read independently to the reference genome,…
gc5
- 1,783
- 18
- 32
5
votes
2 answers
RapMap: reference transcriptome for simulated reads
I am very new to bioinformatics and trying to repeat the benchmark in the RapMap paper with an experimental tool working in a similar but different fashion.
In the paper (taken from their github) about RapMap, an alignment/mapping tool for rna-seq…
JMC
- 153
- 4
5
votes
2 answers
BiomaRt error: Error in martCheck
I am experiencing an error while using getSequence() in biomaRt.
The code is given below:
library("biomaRt")
ensemblp <- useMart(biomart="plants_mart",host="plants.ensembl.org")
osj = useDataset("osativa_eg_gene", mart = ensemblp)
genes <-…
Learner
- 51
- 2
5
votes
1 answer
How can I specify to DEseq2 to only perform comparisons on which I am interested with?
I am currently performing a large RNA-seq analysis from mice PBMCs. The dataset contains around 6,000 transcriptomic profiles and I would like to use DESeq2 to identify the sets of differentially expressed genes in the different conditions. In…
Bob
- 95
- 1
5
votes
2 answers
Differential Expression With 2 Treatments
I have a expression data from a small cohort of samples taken at baseline and after 2 independent treatments. I can do differential expression contrasting T1 and T2 or I can contrast T1 vs baseline and T2 vs baseline and look at the differences.…
Nitro
- 153
- 4