Basic Statistics
Measure | Value |
---|---|
Filename | ERR522867_1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 6524394 |
Sequences flagged as poor quality | 0 |
Sequence length | 100 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 22180 | 0.3399549444745366 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 20726 | 0.3176693498277388 | No Hit |
GAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGT | 15997 | 0.2451875223967161 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 10936 | 0.16761709976436126 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TACCTGG | 6245 | 0.0 | 25.817688 | 2 |
TATAACG | 610 | 0.0 | 23.117832 | 2 |
GTACCTG | 7090 | 0.0 | 22.822031 | 1 |
GTACATG | 21670 | 0.0 | 22.400764 | 1 |
TACATGG | 21875 | 0.0 | 21.703463 | 2 |
GAGTACT | 20045 | 0.0 | 21.171785 | 12-13 |
ACATGGG | 22115 | 0.0 | 20.743973 | 3 |
TAACGCA | 710 | 0.0 | 20.522602 | 4 |
GGTATCA | 31245 | 0.0 | 20.202915 | 1 |
GTATCAA | 40110 | 0.0 | 20.182281 | 1 |
ATAACGC | 725 | 0.0 | 20.097996 | 3 |
ACCTGGG | 7615 | 0.0 | 20.060537 | 3 |
CATGGGG | 13905 | 0.0 | 19.74108 | 4 |
GTACTTT | 21520 | 0.0 | 19.403467 | 14-15 |
AGAGTAC | 30690 | 0.0 | 18.629635 | 10-11 |
TCAACGC | 43255 | 0.0 | 18.473179 | 4 |
CAACGCA | 43685 | 0.0 | 18.313427 | 5 |
ATCAACG | 43990 | 0.0 | 18.164522 | 3 |
AGTACTT | 21130 | 0.0 | 18.10509 | 12-13 |
AACGCAG | 44425 | 0.0 | 18.059061 | 6 |