Basic Statistics
Measure | Value |
---|---|
Filename | ERR1630038.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1293983 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 47 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 5046 | 0.3899587552541262 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 4827 | 0.3730342670653324 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 3577 | 0.27643330708363245 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2422 | 0.18717402006054173 | No Hit |
CTCCAGGACAGGCTGCATCAGAAGAGGCCATCAAGCAGATCAC | 1827 | 0.14119196310925258 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 1305 | 0.0 | 25.659004 | 1 |
TACGCAG | 50 | 0.0070353476 | 18.5 | 10 |
GTATAGG | 135 | 1.1532393E-9 | 17.814816 | 1 |
TTAGACT | 125 | 8.580173E-9 | 17.76 | 4 |
ACGTTAC | 75 | 2.0675201E-4 | 17.266666 | 26 |
GACGTTA | 75 | 2.0675201E-4 | 17.266666 | 25 |
TACCCCG | 120 | 1.0414624E-7 | 16.958334 | 5 |
TAGACTA | 120 | 1.0414624E-7 | 16.958334 | 5 |
GTATCAA | 2010 | 0.0 | 16.659203 | 2 |
GACAGGC | 600 | 0.0 | 16.341665 | 7 |
CGTGCGA | 205 | 0.0 | 16.243904 | 10 |
ATTCGCG | 80 | 3.3829754E-4 | 16.1875 | 30 |
TATCCGG | 70 | 0.0025927299 | 15.857142 | 10 |
CGAGAAT | 225 | 0.0 | 15.622222 | 14 |
GTATTGG | 185 | 3.0559022E-10 | 15.0 | 1 |
CGTTACT | 100 | 1.0935323E-4 | 14.8 | 27 |
CGACGTT | 75 | 0.0041047777 | 14.799999 | 24 |
ACGCTAA | 75 | 0.0041047777 | 14.799999 | 34 |
AACTACG | 75 | 0.0041047777 | 14.799999 | 7 |
GCATCAG | 555 | 0.0 | 14.666667 | 15 |