[miso-users] error with miso run
Yarden Katz
yarden at MIT.EDU
Tue Jan 29 15:14:58 EST 2013
Hi Fan,
A few questions on this:
1. Is your BAM file sorted and indexed, with the index (which in this case should be called "010_accepted_hits.bam") in the directory "010_bam/", i.e. in the same directory of the BAM? If not, the call will fail.
2. Do your chromosome headers match? Note that "Homo_sapiens.GRCh37.65.min_1000.const_exons.gff" is Ensembl derived, so it will contain Ensembl style chromosome headers (e.g. "1", "2", "3" for chromosomes 1, 2, 3, respectively), while if your genome index for Tophat/Bowtie was made with UCSC, it will contain the "chr" prefix. If that's the case, none of your reads will map by "tagBam" and the call will fail. See:
http://genes.mit.edu/burgelab/miso/docs/#answer4
http://genes.mit.edu/burgelab/miso/docs/#human-mouse-gene-models-for-isoform-centric-analyses
3. Finally, try executing:
tagBam -i /home/fan/Meningiomas_RNAseq/010_bam/010_accepted_hits.bam -files /home/fan/Downloads/exon_hg19/Homo_sapiens.GRCh37.65.min_1000.const_exons.gff -labels gff -intervals -f 1 | samtools view -h - | egrep '^@|:gff:'
If this call comes out empty, it's likely issue #2, a mismatch of headers.
Best, --Yarden
On Jan 29, 2013, at 3:03 PM, Fan Gao wrote:
> Dear MISO community,
>
> I am a freshman to this software. Based on installation instruction, I installed MISO on my laptop that has been pre-installed with Ubuntu. I also installed bedtools and samtools as required.
>
> I got the following error message when I tried to compute insert length. The bam file I used was from tophat.
> Does anyone know what is the problem?
>
> Thanks a lot.
>
> Fan
> ***************************************************************************************************************
> fan at fan-X45U:~/Meningiomas_RNAseq$ python ~fan/yarden-MISO-4f32ca4/misopy/pe_utils.py --compute-insert-len 010_bam/010_accepted_hits.bam /home/fan/Downloads/exon_hg19/Homo_sapiens.GRCh37.65.min_1000.const_exons.gff --output-dir 010_insert-dist/
> Computing insert length distribution of 1 files:
> /home/fan/Meningiomas_RNAseq/010_bam/010_accepted_hits.bam
> - Using const. exons from: /home/fan/Downloads/exon_hg19/Homo_sapiens.GRCh37.65.min_1000.const_exons.gff
> - Outputting to: /home/fan/Meningiomas_RNAseq/010_insert-dist
> - Minimum exon size used: 500
> Making directory: /home/fan/Meningiomas_RNAseq/010_insert-dist
> Getting constitutive exons...
> - Input GFF: /home/fan/Downloads/exon_hg19/Homo_sapiens.GRCh37.65.min_1000.const_exons.gff
> - Output dir: /home/fan/Meningiomas_RNAseq/010_insert-dist
> - Output format: gff
> - Including only exons greater than or equal to 500-bp
> Constitutive exons GFF was given, so not outputting another one.
> Filtering BAM reads
> Fetching reads in constitutive exons
> Mapping BAM to GFF...
> - BAM: /home/fan/Meningiomas_RNAseq/010_bam/010_accepted_hits.bam
> - GFF: /home/fan/Downloads/exon_hg19/Homo_sapiens.GRCh37.65.min_1000.const_exons.gff
> - Output file: /home/fan/Meningiomas_RNAseq/010_insert-dist/bam2gff_Homo_sapiens.GRCh37.65.min_1000.const_exons.gff/010_accepted_hits.bam
> Preparing to call bedtools 'tagBam'
> tagBam -i /home/fan/Meningiomas_RNAseq/010_bam/010_accepted_hits.bam -files /home/fan/Downloads/exon_hg19/Homo_sapiens.GRCh37.65.min_1000.const_exons.gff -labels gff -intervals -f 1 | samtools view -h - | egrep '^@|:gff:' | samtools view -Shb -o /home/fan/Meningiomas_RNAseq/010_insert-dist/bam2gff_Homo_sapiens.GRCh37.65.min_1000.const_exons.gff/010_accepted_hits.bam -
> [samopen] SAM header is present: 25 sequences.
> [sam_read1] reference 'ID:TopHat CL:/home/fangao/tophat-2.0.4.Linux_x86_64/tophat -p 12 -o CGATGT_fusion_new2 --solexa1.3-quals --fusion-search --bowtie1 /home/fangao/seqlib/bowtie_index_hg19/hg19 s_1_1_CGATGT_sequence.txt s_1_2_CGATGT_sequence.txt VN:2.0.4
> ' is recognized as '*'.
> [main_samview] truncated file.
> Traceback (most recent call last):
> File "/home/fan/yarden-MISO-4f32ca4/misopy/pe_utils.py", line 528, in <module>
> main()
> File "/home/fan/yarden-MISO-4f32ca4/misopy/pe_utils.py", line 525, in main
> sd_max=sd_max)
> File "/home/fan/yarden-MISO-4f32ca4/misopy/pe_utils.py", line 272, in compute_insert_len
> output_dir)
> File "/usr/local/lib/python2.7/dist-packages/misopy/exon_utils.py", line 187, in map_bam2gff
> raise Exception, "Error: tagBam call failed."
> Exception: Error: tagBam call failed.
>
> **************************************************
> Fan Gao, Ph.D
> Research Associate
> Zilkha Neurogenetic Institute
> Eli and Edythe Broad Center for Regenerative Medicine and Stem Cell Research
> Keck School of Medicine
> University of Southern California
> 1425 San Pablo Street, BCC 409B
> Los Angeles, CA 90089
> United States
> Phone: 323-442-2497
> Email: gaof at usc.edu
> **************************************************
> _______________________________________________
> miso-users mailing list
> miso-users at mit.edu
> http://mailman.mit.edu/mailman/listinfo/miso-users
More information about the miso-users
mailing list