Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR3126374.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2348335 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 23187 | 0.987380420595869 | No Hit |
| GGGCAGGGACTTAATCAACGCAAGCTTATGACCCGCACTTACTGGGAATT | 2888 | 0.12298075019109285 | No Hit |
| CTGTCTCTTATACACATCTGACGCGTGTGAGATCGTATGCCGTCTTCTGC | 2403 | 0.10232781949764407 | Illumina Single End Adapter 1 (95% over 22bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTTTTT | 8655 | 0.0 | 85.535255 | 1 |
| GGTAAGG | 1305 | 0.0 | 71.76721 | 1 |
| CGTAGGG | 630 | 0.0 | 70.1558 | 2 |
| AGTAGGG | 3770 | 0.0 | 70.092445 | 2 |
| ATAGGGC | 1320 | 0.0 | 68.00365 | 3 |
| GTAGGGC | 1150 | 0.0 | 67.43089 | 3 |
| GGAGCTA | 4465 | 0.0 | 67.04876 | 9 |
| CCGTACA | 1120 | 0.0 | 66.29974 | 3 |
| ATAGCGG | 540 | 0.0 | 66.23741 | 1 |
| AGTAAGG | 1405 | 0.0 | 65.98928 | 1 |
| TCCGTAC | 1165 | 0.0 | 65.78668 | 2 |
| CGGGTAT | 195 | 0.0 | 65.07316 | 5 |
| AAGGGGC | 6270 | 0.0 | 64.98668 | 4 |
| GTAGGGA | 1840 | 0.0 | 64.876686 | 3 |
| ATAAGGG | 3105 | 0.0 | 64.66101 | 2 |
| GAATAGG | 1355 | 0.0 | 64.60367 | 1 |
| TAGGGTA | 650 | 0.0 | 64.35013 | 4 |
| AGAAGGG | 8105 | 0.0 | 63.93002 | 2 |
| TAGAGGG | 4025 | 0.0 | 63.78272 | 2 |
| ATAGAGG | 1640 | 0.0 | 63.707806 | 1 |