Basic Statistics
Measure | Value |
---|---|
Filename | ERR1630598.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1079085 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 4417 | 0.4093282734909669 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 4199 | 0.38912597246741454 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 3415 | 0.3164718256671161 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2081 | 0.1928485707798737 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AACTCCG | 45 | 0.0038256592 | 20.555555 | 7 |
CTATCGC | 55 | 5.142814E-4 | 20.181818 | 32 |
AAGACGG | 225 | 0.0 | 18.911112 | 5 |
CGGATTC | 50 | 0.007034609 | 18.5 | 16 |
CGAATTA | 110 | 3.8509825E-8 | 18.5 | 15 |
TATCGCC | 60 | 9.236315E-4 | 18.5 | 33 |
TCTATCG | 60 | 9.236315E-4 | 18.5 | 31 |
GGTATCA | 2200 | 0.0 | 17.490908 | 1 |
TTATACC | 75 | 2.0671188E-4 | 17.266666 | 4 |
GACGGAC | 250 | 0.0 | 17.02 | 7 |
TCTAATA | 175 | 7.2759576E-12 | 16.914286 | 2 |
TACCGTC | 110 | 7.806502E-7 | 16.818182 | 7 |
CGCAAGA | 265 | 0.0 | 16.754715 | 2 |
TAATACT | 170 | 8.54925E-11 | 16.32353 | 4 |
ATACCGT | 125 | 1.6573176E-7 | 16.28 | 6 |
AATACTG | 195 | 1.8189894E-12 | 16.128206 | 5 |
CACACTA | 115 | 1.2419459E-6 | 16.086956 | 10 |
GCGCAAG | 265 | 0.0 | 16.056602 | 1 |
CGCCTAT | 70 | 0.0025923564 | 15.857142 | 36 |
GTAGCAC | 70 | 0.0025923564 | 15.857142 | 3 |