mock

Oligotyping Results for "mock"

A user-friendly interface to make sense of oligotyping results.

Overview

» A summary of what happened.

Oligotyping analysis was performed on 859,302 reads from 3 samples for "mock" with oligotyping pipeline version 0.96 (available from http://oligotyping.org) using 1 component automatically selected form the highest entropy values following the initial entropy analysis. To reduce the noise, each oligotype required to (1) appear in at least 1 sample, (2) occur in more than 0.0% of the reads for at least one sample, (3) represent a minimum of 0 reads in all samples combined, and (4) have a most abundant unique sequence with a minimum abundance of 50. Oligotypes that did not meet these criteria were removed from the analysis. The final number of quality controlled oligotypes revealed by the analysis was 3, and they represented 859302 reads, which was equivalent to 100.00% of all reads analyzed.

General Information

» Meta

Run date	16 Jul 13 15:49:12
Library version	0.96

» Given Parameters

Number of entropy components to be selected automatically	1
Min number of samples oligotype expected to appear	1
Min % abundance of oligotype in at least one sample	0.0%
Min total abundance of oligotype in all samples	0
Min substantive abundance of an oligotype (-M)	50
Quality scores were provided	False
Oligotype sets were requested to be generated	False

» Components used for Oligotyping

Base locations of interest in the alignment

» Quality filtering results

Number of sequences analyzed	859,302

Number of unique oligotypes (raw)	3
Oligotypes after "min number of samples" elimination	3
Oligotypes after "min % abundance in a sample" elimination	3
Oligotypes after "min substantive abundance (-M)" elimination	3

Number of sequences analyzed	859,302
Number of samples found	3
Number of sequences represented after quality filtering	859,302
Percentage of reads represented in results	100.00%

Downloads

» Files to analyze results further via third partry applications

Representative sequences per oligotype	oligo-representatives.fa.txt
Sample/oligotype abundance data matrix (percents)	matrix_percents.txt
Sample/oligotype abundance data matrix (counts)	matrix_counts.txt
Read distribution among samples table	read_distribution.txt
GEXF file for network analysis	network.gexf
Environment file	environment.txt
FASTA file for abundant oligotypes	oligos.fa.txt
NEXUS file for abundant oligotypes	oligos.nex.txt

Entropy

» Entropy values that components were picked from

Stackbar

» Figure shows the oligotype distribution profiles among samples. TAB separated files matrix_percents.txt and matrix_counts.txt hold the information that were used to generate this figure.

Read Distribution per Sample

» Total number of reads for each sample that were analyzed.

Oligotypes

» Abundant oligotypes along with their frequencies within the project are shown below. Every oligotype is followed by a representative sequence, which is the most frequent read in all reads that were collected by the given oligotype. Mouseover on an oligotype will popup a figure that shows the abundance distribution of unique sequences within the oligotype, along with the new entropy (in an ideal world there should be only one unique read with 0 entropy, but due to the random sequencing errors it is almost never the case). ✓ indicates that the most frequent sequence which was represented by the given oligotype hit something during the BLAST search with 100% identity in NCBI's nr database. In contrary, ✗ sign means that there was no perfect hit during the BLAST search.

C b |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

✓ 383,788 C b TACGGAGGATCCGAGCGTTATCCGGATTTATTGGGTTTAAAGGGAGCGTAGATGGATATTTAAGTCAGTTGTGAAAGTTTGCGGCTCAACCGTAAAATTGC

✓ 297,937 G b TACGGAGGATGCGAGCGTTATCCGGATTTATTGGGTTTAAAGGGAGCGTAGACGGGTTTTTAAGTCAGCTGTGAAAGTTTGGGGCTCAACCTTAAAATTGC

✓ 177,577 T b TACGGAGGATTCGAGCGTTATCCGGATTTATTGGGTTTAAAGGGAGCGTAGGCGGGTTGTTAAGTCAGTTGTGAAAGTTTGCGGCTCAACCGTAAAATTGC