Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR522993_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 5546751 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 15870 | 0.2861134382992855 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 13165 | 0.23734615092691197 | No Hit |
| CTTATACACATCTCCGAGCCCACGAGACCGTACTAGATCTCGTATGCCGT | 11350 | 0.20462429267151167 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8480 | 0.1528822909122836 | No Hit |
| ATACACATCTCCGAGCCCACGAGACCGTACTAGATCTCGTATGCCGTCTT | 5631 | 0.10151888916592794 | TruSeq Adapter, Index 11 (95% over 21bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TACCTGG | 10770 | 0.0 | 57.695774 | 2 |
| ACCTGGG | 11775 | 0.0 | 51.014576 | 3 |
| GTACCTG | 14275 | 0.0 | 43.89042 | 1 |
| CCTGGGG | 10195 | 0.0 | 37.989567 | 4 |
| TATAACG | 635 | 0.0 | 35.53005 | 2 |
| TAACGCA | 695 | 0.0 | 32.462414 | 4 |
| CTGGGGG | 6090 | 0.0 | 30.718355 | 5 |
| GGTATCA | 15220 | 0.0 | 28.896055 | 1 |
| ATAACGC | 815 | 0.0 | 28.259392 | 3 |
| GTATCAA | 21900 | 0.0 | 28.136415 | 1 |
| GTACCCG | 1195 | 0.0 | 25.585098 | 1 |
| GTACATG | 19030 | 0.0 | 24.198338 | 1 |
| TCAACGC | 25240 | 0.0 | 23.892504 | 4 |
| TACCGGG | 810 | 0.0 | 23.79179 | 2 |
| CAACGCA | 25645 | 0.0 | 23.515604 | 5 |
| ATCAACG | 25780 | 0.0 | 23.410273 | 3 |
| TATCAAC | 25950 | 0.0 | 23.347685 | 2 |
| TATCACG | 550 | 0.0 | 23.074343 | 2 |
| TACATGG | 19330 | 0.0 | 22.95453 | 2 |
| AACGCAG | 26515 | 0.0 | 22.81246 | 6 |