Basic Statistics
Measure | Value |
---|---|
Filename | ERR1041442.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 5365378 |
Sequences flagged as poor quality | 0 |
Sequence length | 42 |
%GC | 44 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 34724 | 0.6471864610471061 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 27812 | 0.5183604957563102 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTT | 18613 | 0.34690938830404866 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7485 | 0.139505548350927 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 6010 | 0.0 | 25.727121 | 1 |
GTATCAA | 14200 | 0.0 | 20.24366 | 1 |
TTAAGCG | 240 | 0.0 | 15.000001 | 36 |
TTAACGG | 490 | 0.0 | 13.224489 | 35 |
TATCAAC | 21795 | 0.0 | 13.156229 | 2 |
ATCAACG | 21790 | 0.0 | 13.051859 | 3 |
TAACGGC | 470 | 0.0 | 13.021276 | 36 |
AACGCAG | 21980 | 0.0 | 12.963603 | 6 |
TCAACGC | 21955 | 0.0 | 12.961967 | 4 |
CAACGCA | 22210 | 0.0 | 12.805043 | 5 |
TAGACCG | 175 | 7.407816E-7 | 12.342858 | 5 |
ACGCAGA | 23695 | 0.0 | 12.002533 | 7 |
CGCAGAG | 23765 | 0.0 | 11.974753 | 8 |
AGAGTAC | 24715 | 0.0 | 11.4853325 | 11 |
CAGAGTA | 24885 | 0.0 | 11.421338 | 10 |
GTATTAG | 1540 | 0.0 | 11.337663 | 1 |
GCAGAGT | 25070 | 0.0 | 11.337057 | 9 |
TTAGACT | 750 | 0.0 | 11.04 | 4 |
GTATAAA | 1515 | 0.0 | 10.811882 | 1 |
GTAGCAC | 550 | 0.0 | 10.8 | 3 |