What Approach Would You Recommend For Large Indel Detection With Solid Data
I've been spending quite some time on following problem: I sequenced a bacterial genome using paired-end reads (SOLiD) and I have a quite good reference sequence. My goal is to detect changes in the...
View ArticleCalculating Tn (True Negatives) For An Indel Detection Method
This is a kind of follow-up inspired by the very good question/answers I read in "How to calculate sensitivity/selectivity of an algorithm that returns locations of possible matches?"My goal is to...
View ArticlePython - finding INDELS and SNPS
seq_a = 'GAGAGATTTTCCAATTCGACG-------CGGGGTCAGG--GAAATTT'Â Â seq_b = 'GAGAGATTGGCCTTAACTACCCAACCCACGGCCTGACCGAGGTCTTC'G A C T = Bases- = INDELPYTHON - I am very new to programming and need some help, I...
View ArticleMemory Allocation for VarScan
I am running the somatic variant caller in VarScan on several files. Â On average, the input .pileup files are ~50 GB each (with both tumor and normal samples).I've noticed that I've gotten the...
View ArticleGatk Threshold Parameter For Clipping Of Reads During Indel Calling Running...
GATK 1.6 claims to apply a Q20 threshold to clip ends of the reads for the indel caller:http://www.broadinstitute.org/gatk/guide/article?id=12373. Indel Calling with the Unified Genotyper [...] while...
View ArticleMerging Vcf Files And Filling In Missing Genotypes
I have vcf files with genomic variants (e.g. SNPs and Indels) from several patients. I now want to merge them into one vcf file e.g. using vcftools' merge-vcf. Doing that I get a file that holds for...
View ArticleHow To Handle Reads Ending With Deletions In Gatk?
Hello,before asking my question, I should point out that I'm working with data that's not my own (publicly available), to learn and establish a proper workflow when real data wlll arrive in the...
View ArticleFiltering Vcf File
I was wondering how to filter the vcf file based on a few input arguments ( DP>10, MQ>30 and QD>20 or GT = "1/1" etc)? I m planning to use simple command on the command line to extract the...
View ArticleClassify Mutations Which Are Found Ny Using Exome Sequencing
Hi, everyone. I used exome sequencing to sequence tumor and blood, and after I remove mutations in blood and snp in dbsnp131, I found I still have about 2000 mutations. I read the exome sequencing...
View ArticleHow To Identify/Flag The Presence Of Indels On The Read Level Nearby A Given...
Let's say I have list of genomic positions, e.g. SNP calls in a VCF file, and a BAM file with read alignments. What would be the best way to, for each of those positions, determine if in the BAM file...
View ArticleWhat Kind Of Analysis Can Do You With Vcf Files?
What kinds of analyses can be completed if you have SNP, indel, and SV data in vcf files? I just need a few very broad and generic examples.
View ArticleGatk Somaticindeldetector Vcf Info Field Missing
Hey everyone,I am a bit confused. I called my INDELs pairwise using the SomaticIndelDetector. But in the resulting VCF the info field seems to be missing, is this normal? Or am I getting something...
View ArticleIndel Discovery Delly, Pindel, Samtools, Gatk
I'm testing out samtools vs GATK for snp and indel calling, and looking at using pindel for SV in particular focusing on insertions and Delly for the other SV. What experience do people have of SNP...
View ArticleHow To Transform Adjacent Complex Variants In Vcf Format To Individuals'...
Hello, everyone. Recently, I have downloaded the latest variant data of 1000genomes project from ftp://ftp.ncbi.nlm.nih.gov/1000genomes/ftp/release/20110521/ and the mapped data from...
View ArticleStrelka Indel Allele Counts
Hopefully there are some Strelka users out there who can help with this one. I'm looking for allelic counts of the ref and non-ref allele of the indels.I see these flags in the VCF header, do these...
View ArticleIndel Notation In Variant Calling
HelloI am sorry for the basic question but I am struggling to find any details of the nomenclature of indels by variant calling software. Unfortunately i am unable to access the details of the software...
View ArticleHow To Call Variants (Snp, Indel, Sv) On A Bac Contig Aligned With Bwa Mem To...
I aligned a BAC contig (assembled from sanger sequences) to a reference genome using BWA-mem. The output alignments are very similar to the best end to end alignment I got from aligning the bac contig...
View Article1000 Genomes Indel/Snps File
Is there any place to download 1000 genome SNPs and Indels data in a single merged file for each pilot (pilot 1 , 2 , and 3) ? I am trying to use GATK and thought to use 1KG indels and SNPs vcf files...
View ArticleHow To Retrieve Reads Supporting A Snp/Indel
I used samtools and varscan for variant calling like this:samtools mpileup \ -f genome.fa \ tku.sorted.rmdup.bam \ | java -jar VarScan.v2.3.5.jar pileup2snp \ --min-coverage 8 \ --min-reads2 2 \...
View ArticleHow Do You Do Local Realignment Around Indels For Exome Sequencing?
Hi all, I am wondering how people generally go about local realignment around indels in their workflow? The most popular programs seem to be SRMA and GATK - does anyone use anything else?In my initial...
View Article