Highest Voted Questions - Bioinformatics Stack Exchange

3

votes

1 answer

How to create a nexus file for BEAST?

I am learning to use BEAUTI and BEAST software. I followed their tutorial and was able to run example data without any problem. Now, I want to do phylodynamic analysis on my data. I have downloaded 70 picornavirus sequences from the gene bank. My…

asked May 26 '18 at 21:19

L R Joshi

719
3
11

3

votes

1 answer

How to extract the protein fasta file from a genbank file?

Let's say I have a genbank file, e.g. the one downloaded from here, that contains entries as gene 190..255 /gene="thrL" /locus_tag="b0001" /gene_synonym="ECK0001; JW4367" …

asked May 25 '18 at 21:04

Cleb

743
7
18

3

votes

1 answer

RunUMAP in Seurat not working: module 'umap' has no attribute 'UMAP'

I am trying to run UMAP in the following way: RunUMAP(seurat_object, dims.use = 1:15, reduction.use = "pca", reduction.name = "umap", reduction.key = "UMAP", n_neighbors = 30L, min_dist = 0.3, metric = "correlation") Or just…

asked May 25 '18 at 00:38

Nikita Vlasenko

2,558
3
26
38

3

votes

1 answer

How to apply function on bedfile based on a reference bedfile

I have two bed-like files: File1 chr1 1 10 . 1 Sample1 chr1 20 180 . 1 Sample1 chr1 1 5 . 1 Sample2 chr1 70 100 . 1 Sample3 File2 (reference) chr1 1 5 . 0 chr1 5 …

asked May 24 '18 at 20:56

gc5

1,783
18
32

3

votes

1 answer

How can I get a list of papers where a particular mutation has been mentioned?

I have recently tried a few tools (MutationFinder, tmVar, PubTator etc.) that extract mutations from a given text. However, for my work I need something that will take a list of mutations as an input and return a list of pubmed id in which the…

mutations

asked May 24 '18 at 15:04

Musfiqur Rahman

33
4

3

votes

3 answers

Is there a way to do GWAS on phenotype data that is not normally distributed?

Is there a way to do GWAS on phenotype data that is not normally distributed? For example, if you were measuring a trait as a proportion, and the majority of the data consisted of 0.00 with a long tail of variation. Or, perhaps the data was skewed…

asked May 15 '18 at 21:30

jhurst5

69
1
5

3

votes

1 answer

How to extract exome on-target reads in batch?

I was given a list of target regions in BED and many exome alignments in BAM. I was asked to extract on-target alignments from these BAMs to save disk space. I know I can use bedtools to extract sub-BAMs. I am thinking to write a script to apply…

asked Jun 03 '17 at 23:19

medbe

847
1
7
9

3

votes

1 answer

BioMart Ensembl id imperfect retrieval

I have a list of gene symbols: c("cd45", "Tmem119", "CD11b", "P2Yr12", "Csf1r", "Bst2", "Cd74", "Cx3cr1", "Trem2", "Lyz2", "GLAST", "GFAP", "ALDH1L1", "Aldoc", "Aqp4", "Glul", "S100a", "Olig1", "Olig2", "Olig3", "Mbp", "Pdgfra", "Pecam", "Cldn5",…

asked May 04 '18 at 19:13

Nikita Vlasenko

2,558
3
26
38

3

votes

1 answer

tximport txidcol and geneidcol selecting wrong column

I'm trying to run a differential expression analysis of 2 samples of the same species, but with different treatments using DESeq2. I'm using Ensembldb to create a tx2gene dataframe for tximport, however tximport does not select the columns I…

asked May 02 '18 at 21:49

wolff

33
4

3

votes

1 answer

Will blastn remove sequences from a search with low identity?

I'm using this command (excuse the duplicate naming, I know it's bad form): blastn -query cm-seqs/combined_seqs.fna -db combined_seqs.fna -out cm-matched.txt -num_alignments 1 -outfmt 10 to take a set of fasta sequences to blast against another…

blast

asked Apr 30 '18 at 13:16

Samaj Oruel

33
2

3

votes

1 answer

What are negative FSC-A values in flow cytometry data?

I did flow cytometry analysis of E. coli cells 7 hours post inoculation (1% overnight culture in 3ml). I am analysing the FCS3.0 files using the flowCore package in R. I observe that FSC-A has negative values in some events, although the…

flow-cytometry

asked Apr 27 '18 at 12:49

WYSIWYG

263
2
10

3

votes

2 answers

Official documentation for textual BLAST output?

This is pretty vexing: I got asked to explain the meaning of the match line in a blastp output, especially the subtle difference between a blank and plus. E.g.: Query 181 NHIEEL... N I+EL... Sbjct 181 NQIKEL... For me this is…

blast

asked Apr 24 '18 at 20:33

BaCh

734
4
9

3

votes

3 answers

How can I find the chromosomal location of a list of genes?

I have a list of genes nearly 20000: gene name (column1) and coordinates (columns 2 and 3) head genes Ma0_73Ns 2359 2923 Ma0_73Ns 4727 4868 I want to find the chromosomal location of each gene, on which…

asked Apr 24 '18 at 18:49

Anna1364

516
2
8

3

votes

1 answer

Is there a publicly available tumor-normal sample?

I am looking for a publicly available matched tumor-normal sample. I need Illumina fastq reads (or an aligned bam file, since I could extract the reads from it) from a tumor and a matching, non-tumor control set from the same patient. Ideally, I…

asked Apr 20 '18 at 14:52

terdon

10,071
5
22
48

3

votes

2 answers

Generate SNP/indel annotation in Arabidopsis

I have a genome-wide list of germline SNPs and short indels for Arabidopsis thaliana, which I generated with Varscan. Regardless of the tool used to generate them, I would like to annotate them, i.e. knowing which ones can cause an aminoacid change…

asked Apr 18 '18 at 23:56

Federico Giorgi

141
5

Most Popular