Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR3128987.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2142075 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 14155 | 0.6608078615361274 | No Hit |
| CCTGTCTCTTATACACATCTGACGCGGACCTTATCGTATGCCGTCTTCTG | 4089 | 0.1908896747312769 | TruSeq Adapter, Index 23 (95% over 21bp) |
| CTGTCTCTTATACACATCTGACGCGGACCTTATCGTATGCCGTCTTCTGC | 4024 | 0.18785523382701352 | TruSeq Adapter, Index 20 (95% over 23bp) |
| GCTGTCTCTTATACACATCTGACGCGGACCTTATCGTATGCCGTCTTCTG | 3803 | 0.177538134752518 | TruSeq Adapter, Index 23 (95% over 21bp) |
| TCCGCTACGACCAACTCATACACCTCCTATGAAAAAACTTCCTACCACTC | 3279 | 0.15307587269353315 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTTTTT | 5720 | 0.0 | 78.89344 | 1 |
| AGGGATG | 3780 | 0.0 | 75.59313 | 5 |
| ACGGGTA | 245 | 0.0 | 74.81163 | 4 |
| AGGGTAC | 1275 | 0.0 | 74.08946 | 5 |
| AAGGGTA | 1790 | 0.0 | 73.51493 | 4 |
| CGTAGGG | 610 | 0.0 | 72.46566 | 2 |
| GAGGGTA | 1350 | 0.0 | 72.41026 | 4 |
| ATAAGGG | 2470 | 0.0 | 70.63357 | 2 |
| TAGGGCA | 910 | 0.0 | 70.23734 | 4 |
| TAGAGGG | 3465 | 0.0 | 69.6223 | 2 |
| GGGTACC | 1770 | 0.0 | 69.300735 | 6 |
| TAAGGGA | 1870 | 0.0 | 69.11331 | 3 |
| AGTAAGG | 1505 | 0.0 | 69.09937 | 1 |
| AGGGTAT | 1290 | 0.0 | 68.85614 | 5 |
| AGGGAAT | 2670 | 0.0 | 68.82339 | 5 |
| ATAGGGA | 2030 | 0.0 | 68.759224 | 3 |
| ATAGCGG | 445 | 0.0 | 68.73401 | 1 |
| TAGGGTA | 630 | 0.0 | 68.63061 | 4 |
| AGGGTAA | 980 | 0.0 | 68.57733 | 5 |
| AGAGGGA | 4935 | 0.0 | 66.94817 | 3 |