Basic Statistics
Measure | Value |
---|---|
Filename | ERR1630310.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1771301 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 13990 | 0.7898149439310429 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 9363 | 0.5285945189439852 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 9155 | 0.5168517377904716 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 4031 | 0.2275728405279509 | No Hit |
ATTGAAAGCTGAGTATTTTTAAGACAAAGGTTTCAGGAAGAAA | 2010 | 0.11347591403155083 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTAGCGT | 40 | 0.0019315978 | 23.125 | 4 |
CGCACTA | 45 | 0.003826614 | 20.555555 | 29 |
GTGCGTA | 85 | 2.7244636E-5 | 17.411764 | 15 |
TGCGTAC | 85 | 2.7244636E-5 | 17.411764 | 16 |
GGTATCA | 3830 | 0.0 | 17.389034 | 1 |
TCGTTTA | 150 | 2.5102054E-10 | 17.266666 | 30 |
CAGTCGG | 240 | 0.0 | 16.958334 | 10 |
TACGGCG | 110 | 7.813469E-7 | 16.818182 | 19 |
CGTTATT | 210 | 0.0 | 16.738096 | 2 |
CTTATAC | 1040 | 0.0 | 16.721153 | 37 |
CCGGATA | 90 | 4.4481436E-5 | 16.444445 | 4 |
GCAGTCG | 260 | 0.0 | 15.653846 | 9 |
GCGTTAT | 225 | 0.0 | 15.622221 | 1 |
TACCGTC | 225 | 0.0 | 15.622221 | 7 |
TATACTG | 155 | 7.215931E-9 | 15.516129 | 5 |
GACGGAC | 275 | 0.0 | 15.472728 | 7 |
AAGACGG | 300 | 0.0 | 15.416667 | 5 |
ACGGACC | 300 | 0.0 | 15.416667 | 8 |
TACGACG | 180 | 2.0190782E-10 | 15.416666 | 5 |
GCTTAGG | 550 | 0.0 | 15.136364 | 1 |