Basic Statistics
Measure | Value |
---|---|
Filename | ERR1142536_1.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 709784 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2810 | 0.3958950892102386 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2302 | 0.32432401970176844 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1387 | 0.19541156182726013 | No Hit |
GTACATGGGAAGCAGTGGTATCAACGCAGAGTACATGGGAAGCAGTGGTA | 928 | 0.13074400099185104 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATCACG | 65 | 9.094947E-11 | 73.21352 | 2 |
CTCAACG | 340 | 0.0 | 45.518143 | 1 |
GTATCAA | 4300 | 0.0 | 30.592422 | 1 |
GGTATCA | 3150 | 0.0 | 30.423235 | 1 |
CATGGGG | 2230 | 0.0 | 28.275848 | 4 |
TTTACCG | 85 | 0.007216837 | 27.993402 | 3 |
ATCACGC | 170 | 4.4297485E-7 | 27.993402 | 3 |
TATACTC | 160 | 8.751178E-6 | 26.025116 | 5 |
ATCAACG | 5150 | 0.0 | 25.98902 | 3 |
CCGCAGA | 400 | 0.0 | 25.297583 | 1 |
TATCAAC | 5330 | 0.0 | 25.222948 | 2 |
TCAACGC | 5420 | 0.0 | 24.913872 | 4 |
CAACGCA | 5465 | 0.0 | 24.817574 | 5 |
AACGCAG | 5605 | 0.0 | 24.622208 | 6 |
ATGGGGG | 1260 | 0.0 | 23.605547 | 5 |
ACGCAGA | 6020 | 0.0 | 22.626791 | 7 |
GTATCAC | 240 | 3.5453377E-7 | 22.321396 | 1 |
GAGTACT | 2545 | 0.0 | 22.201853 | 12-13 |
CGCAGAG | 6290 | 0.0 | 21.654007 | 8 |
ACATGGG | 5535 | 0.0 | 21.601955 | 3 |