Basic Statistics
Measure | Value |
---|---|
Filename | SRR1547957_1.fastq.gz |
File type | Conventional base calls |
Encoding | Illumina 1.5 |
Total Sequences | 2997942 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 12600 | 0.4202883177860012 | No Hit |
GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATAC | 3939 | 0.13139013363167135 | No Hit |
CCTGTCTCTTATACACATCTGACGCTTCTCTGCTCGTATGCCGTCTTCTGC | 3255 | 0.10857448209471697 | TruSeq Adapter, Index 23 (95% over 24bp) |
GGGGTTGGGGATTTAGCTCAGTGGTAGAGCGCTTGCCTAGCAAGCGCAAGG | 3204 | 0.10687331509415458 | No Hit |
CTGTCTCTTATACACATCTGACGCTTCTCTGCTCGTATGCCGTCTTCTGCT | 3202 | 0.10680660266275997 | Illumina Single End Adapter 1 (95% over 22bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CAACGTA | 20 | 7.035104E-4 | 45.0 | 38 |
TCACGAC | 970 | 0.0 | 42.912373 | 25 |
CGACGGT | 960 | 0.0 | 42.656246 | 28 |
CGGTCTA | 970 | 0.0 | 41.75258 | 31 |
GGCGATA | 910 | 0.0 | 40.796703 | 8 |
CTCACGA | 1050 | 0.0 | 39.857143 | 24 |
CGTTTTT | 7430 | 0.0 | 39.82167 | 1 |
GCGATAC | 130 | 0.0 | 39.807693 | 9 |
ATAGCGG | 425 | 0.0 | 39.705883 | 2 |
CGTAAGG | 375 | 0.0 | 39.600002 | 2 |
CTATGCG | 165 | 0.0 | 39.545452 | 1 |
TATAGCG | 240 | 0.0 | 39.374996 | 1 |
GCGATAT | 335 | 0.0 | 38.955223 | 9 |
TTAGGGA | 5120 | 0.0 | 38.847656 | 4 |
AGGGTAA | 1480 | 0.0 | 38.76689 | 6 |
TCGGGTA | 105 | 0.0 | 38.571426 | 5 |
GGGCGAT | 4125 | 0.0 | 38.4 | 7 |
CGCATCG | 135 | 0.0 | 38.333336 | 21 |
GACGGTC | 1070 | 0.0 | 38.271027 | 29 |
TAAGGGA | 4585 | 0.0 | 38.227917 | 4 |