Basic Statistics
Measure | Value |
---|---|
Filename | ERR1391188.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 710374 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 43 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
TCCAGGGATTTATAAGCCGATGACGTCATAACATCCCTGACCCTTTAAATA | 2712 | 0.3817707292215087 | No Hit |
TCGTTGGAATTCCTCGGGGAATTCGGTATTCCCAGGCGGTCTCCCATCCAA | 1758 | 0.24747527358827884 | No Hit |
AAGAGCGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAAT | 1473 | 0.20735556199973532 | Illumina Paired End PCR Primer 2 (96% over 30bp) |
AAGAGCGGTTCAGCAGGAATGCCGAGACCGATTAAGATCTCGTATGCCGTC | 960 | 0.13514008114035705 | Illumina Paired End PCR Primer 2 (96% over 32bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGATTCG | 70 | 0.0 | 41.78506 | 10 |
GCCGATG | 275 | 0.0 | 41.726624 | 16 |
TAAGCCG | 275 | 0.0 | 41.726624 | 13 |
AGCCGAT | 280 | 0.0 | 40.981503 | 15 |
CGTCATA | 280 | 0.0 | 40.177944 | 24 |
TCGGTAT | 305 | 0.0 | 39.83544 | 23 |
TGACGTC | 285 | 0.0 | 39.47307 | 21 |
GGCGGTC | 320 | 0.0 | 39.374382 | 35 |
ATTCGGT | 310 | 0.0 | 39.192932 | 21 |
CCGATGA | 290 | 0.0 | 38.792496 | 17 |
ACGTCAT | 290 | 0.0 | 38.792496 | 23 |
GCGGTCT | 325 | 0.0 | 38.768627 | 36 |
TTCGGTA | 315 | 0.0 | 38.570824 | 22 |
CGGTATT | 315 | 0.0 | 38.570824 | 24 |
GATGACG | 295 | 0.0 | 38.135 | 19 |
CGATGAC | 295 | 0.0 | 38.135 | 18 |
AAGCCGA | 305 | 0.0 | 37.622364 | 14 |
GACGTCA | 300 | 0.0 | 37.499416 | 22 |
ATGACGT | 300 | 0.0 | 37.499416 | 20 |
TCCTCGG | 270 | 0.0 | 37.499413 | 11 |