Basic Statistics
Measure | Value |
---|---|
Filename | SRR1547428_1.fastq.gz |
File type | Conventional base calls |
Encoding | Illumina 1.5 |
Total Sequences | 3378991 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 41 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 58268 | 1.7244201005566455 | No Hit |
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 7161 | 0.21192716997470548 | No Hit |
GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATAC | 6601 | 0.19535417525527593 | No Hit |
CGTTTCTGTCTCTTATACACATCTGACGCTATAGGACTCGTATGCCGTCTT | 4995 | 0.14782519397062616 | No Hit |
CGTTTTCTGTCTCTTATACACATCTGACGCTATAGGACTCGTATGCCGTCT | 4121 | 0.12195948435494502 | No Hit |
CGCTGTCTCTTATACACATCTGACGCTATAGGACTCGTATGCCGTCTTCTG | 3477 | 0.10290054042760102 | TruSeq Adapter, Index 21 (95% over 21bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATTACGA | 20 | 7.0352305E-4 | 45.000004 | 26 |
TCGGATC | 20 | 7.0352305E-4 | 45.000004 | 23 |
GCGTATT | 20 | 7.0352305E-4 | 45.000004 | 31 |
CGTGACG | 35 | 1.2128658E-7 | 45.0 | 19 |
CGCGACT | 25 | 3.892418E-5 | 45.0 | 17 |
CTACGCG | 25 | 3.892418E-5 | 45.0 | 1 |
AGCGCGT | 35 | 1.2128658E-7 | 45.0 | 24 |
CCAACGA | 60 | 0.0 | 44.999996 | 23 |
TCGAATT | 60 | 0.0 | 44.999996 | 34 |
TACGTCA | 30 | 2.1667583E-6 | 44.999996 | 11 |
CGTTTTT | 25975 | 0.0 | 43.743984 | 1 |
TATTACG | 110 | 0.0 | 42.954544 | 1 |
CTAAGCG | 90 | 0.0 | 42.500004 | 1 |
CGGTCTA | 725 | 0.0 | 41.58621 | 31 |
CGACGGT | 730 | 0.0 | 41.301373 | 28 |
TGCCGAT | 55 | 6.184564E-11 | 40.909092 | 21 |
CTGTCGG | 275 | 0.0 | 40.90909 | 2 |
CCAATCG | 325 | 0.0 | 40.846153 | 24 |
CACGACG | 750 | 0.0 | 40.8 | 26 |
ACGGGTA | 375 | 0.0 | 40.8 | 5 |