Basic Statistics
Measure | Value |
---|---|
Filename | ERR1042037.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 722756 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 52 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGGTAGGCACACGCTGAGCCAGTCAGTGTAGCGCGCGTGCAGC | 937 | 0.12964264565081438 | No Hit |
GTGTAGCGCGCGTGCAGCCCCGGACATCTAAGGGCATCACAGA | 742 | 0.10266258599029271 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GATCGTT | 50 | 2.700758E-4 | 22.199999 | 11 |
TAACGCC | 120 | 2.382876E-10 | 20.041666 | 4 |
TATATGA | 85 | 1.2439996E-6 | 19.588234 | 2 |
TCGAACG | 85 | 1.2439996E-6 | 19.588234 | 3 |
CGAACGT | 90 | 2.1499673E-6 | 18.5 | 4 |
ATCAGTA | 80 | 1.615318E-5 | 18.5 | 7 |
TATATCG | 60 | 9.2322956E-4 | 18.5 | 5 |
ATTCGAA | 90 | 2.1499673E-6 | 18.5 | 1 |
GGTATCA | 370 | 0.0 | 18.5 | 1 |
ATAACGC | 150 | 1.2732926E-11 | 18.5 | 3 |
ATAGTAC | 70 | 1.21840305E-4 | 18.5 | 3 |
GTATTAA | 50 | 0.0070324186 | 18.499998 | 1 |
GGTCTAG | 85 | 2.7208522E-5 | 17.411764 | 1 |
AATAACG | 140 | 1.8644641E-9 | 17.178572 | 2 |
GAATAAC | 155 | 4.0017767E-10 | 16.709677 | 1 |
CGCATCG | 145 | 2.9722287E-9 | 16.586206 | 13 |
AACGCCG | 145 | 2.9722287E-9 | 16.586206 | 5 |
TAATACC | 90 | 4.4422843E-5 | 16.444445 | 4 |
ATTAGAG | 90 | 4.4422843E-5 | 16.444445 | 3 |
TGGTTAA | 195 | 1.8189894E-12 | 16.128206 | 37 |