Basic Statistics
Measure | Value |
---|---|
Filename | SRR1546151_1.fastq.gz |
File type | Conventional base calls |
Encoding | Illumina 1.5 |
Total Sequences | 4210181 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 38 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 48391 | 1.149380513569369 | No Hit |
GAATGATACCTGTCTCTTATACACATCTGACGCACCTCGTTTCGTATGCC | 19194 | 0.4558948890795907 | No Hit |
GAATCTGTCTCTTATACACATCTGACGCACCTCGTTTCGTATGCCGTCTT | 14176 | 0.3367076142332123 | No Hit |
GAATGATACGGCTGTCTCTTATACACATCTGACGCACCTCGTTTCGTATG | 8154 | 0.19367338363837563 | No Hit |
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 4976 | 0.1181896930321998 | No Hit |
GCTGTCTCTTATACACATCTGACGCACCTCGTTTCGTATGCCGTCTTCTG | 4555 | 0.10819012294245782 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGAACCG | 20 | 7.8588736E-4 | 44.0 | 2 |
AACGCAT | 25 | 4.444868E-5 | 44.0 | 37 |
CCGAATA | 30 | 2.5293302E-6 | 44.0 | 25 |
ATCGACG | 30 | 2.5293302E-6 | 44.0 | 1 |
CGTTTTT | 33230 | 0.0 | 42.563347 | 1 |
CTATGCG | 150 | 0.0 | 42.533333 | 1 |
GACGTAA | 265 | 0.0 | 42.339622 | 9 |
CACGTTA | 100 | 0.0 | 41.8 | 43 |
ACGTAGC | 65 | 0.0 | 40.615387 | 16 |
GGATGCC | 6295 | 0.0 | 40.610004 | 8 |
TAATGCG | 190 | 0.0 | 40.526318 | 1 |
CGAGGGT | 405 | 0.0 | 40.19753 | 4 |
CGACGAA | 55 | 7.8216544E-11 | 40.0 | 19 |
GTGATCG | 165 | 0.0 | 40.0 | 9 |
CGCTTAA | 110 | 0.0 | 40.0 | 41 |
CGACGGT | 215 | 0.0 | 39.906975 | 28 |
GGGATGC | 7605 | 0.0 | 39.892174 | 7 |
GTATGCG | 160 | 0.0 | 39.875 | 1 |
TAGGGAC | 4705 | 0.0 | 39.604675 | 5 |
CTTAACG | 50 | 1.3496901E-9 | 39.6 | 1 |