Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR3128470.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2696003 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 26723 | 0.9912080958366886 | No Hit |
| TCCGCTACGACCAACTCATACACCTCCTATGAAAAAACTTCCTACCACTC | 2870 | 0.10645388747712817 | No Hit |
| CTGTCTCTTATACACATCTGACGCCAGGACTCTCGTATGCCGTCTTCTGC | 2719 | 0.10085300350185071 | TruSeq Adapter, Index 8 (95% over 23bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTTTTT | 9005 | 0.0 | 82.518654 | 1 |
| CTACGAC | 650 | 0.0 | 73.749695 | 5 |
| ATAGCGG | 610 | 0.0 | 70.976036 | 1 |
| CGTAGGG | 830 | 0.0 | 70.24679 | 2 |
| AGTAGGG | 4380 | 0.0 | 70.20799 | 2 |
| TAGGGCA | 1440 | 0.0 | 70.16966 | 4 |
| ATAGGGA | 2455 | 0.0 | 69.68244 | 3 |
| ATAACGG | 240 | 0.0 | 68.62945 | 1 |
| ATAGGGC | 1595 | 0.0 | 68.359795 | 3 |
| AGGGATG | 4055 | 0.0 | 68.03317 | 5 |
| GAATAGG | 1635 | 0.0 | 67.927864 | 1 |
| CGAAGGG | 2780 | 0.0 | 66.97817 | 2 |
| TAGCGGG | 1485 | 0.0 | 66.492966 | 2 |
| TAGAGGG | 4435 | 0.0 | 66.368744 | 2 |
| TATAGGG | 2010 | 0.0 | 65.500534 | 2 |
| GTAGGGC | 1150 | 0.0 | 65.387634 | 3 |
| TAGACGG | 355 | 0.0 | 64.95633 | 1 |
| AGTAAGG | 1965 | 0.0 | 64.662865 | 1 |
| ACGGGAT | 845 | 0.0 | 64.51708 | 4 |
| AGAGGGC | 3625 | 0.0 | 64.435 | 3 |