Basic Statistics
Measure | Value |
---|---|
Filename | ERR1041969.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1072726 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 49 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1799 | 0.16770358880086805 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CCGTTTA | 25 | 0.0054961815 | 29.6 | 13 |
CTTATAC | 1095 | 0.0 | 19.598173 | 37 |
TCTATAC | 175 | 0.0 | 19.02857 | 3 |
TTAGACG | 50 | 0.0070345826 | 18.5 | 4 |
TCGTATG | 50 | 0.0070345826 | 18.5 | 10 |
GTATAGA | 100 | 2.8744762E-7 | 18.5 | 1 |
AATTTCG | 115 | 6.403752E-8 | 17.695652 | 28 |
GTCGGTT | 105 | 4.7965295E-7 | 17.619047 | 12 |
TAACGAG | 65 | 0.0015798408 | 17.076923 | 5 |
GTCCTAA | 225 | 0.0 | 16.444445 | 1 |
TGGACCG | 80 | 3.3823022E-4 | 16.1875 | 5 |
ACAGCGT | 105 | 9.341855E-6 | 15.857142 | 8 |
CGAACTA | 130 | 2.5882946E-7 | 15.653846 | 24 |
GCGAACT | 130 | 2.5882946E-7 | 15.653846 | 23 |
CCTATAC | 215 | 0.0 | 15.488372 | 3 |
TAGACAG | 120 | 1.93511E-6 | 15.416666 | 5 |
CTATACA | 145 | 5.3434633E-8 | 15.310345 | 4 |
GGGTAAG | 145 | 5.3434633E-8 | 15.310345 | 1 |
TTTAGAC | 135 | 3.9711267E-7 | 15.074075 | 3 |
ATCTATA | 135 | 3.9711267E-7 | 15.074075 | 2 |