Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR522899_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 3933975 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 11607 | 0.29504508798352813 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 10193 | 0.2591017990709143 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5265 | 0.13383409909824032 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TACCTGG | 4900 | 0.0 | 41.063206 | 2 |
| GGTATCA | 8860 | 0.0 | 38.55036 | 1 |
| ACCTGGG | 5580 | 0.0 | 34.87737 | 3 |
| GTATCAA | 13805 | 0.0 | 33.738388 | 1 |
| GTACCTG | 6255 | 0.0 | 32.567627 | 1 |
| TCAACGC | 16120 | 0.0 | 28.228504 | 4 |
| CAACGCA | 16390 | 0.0 | 27.792519 | 5 |
| ATCAACG | 16395 | 0.0 | 27.69767 | 3 |
| TATAACG | 445 | 0.0 | 27.467459 | 2 |
| ATAACGC | 465 | 0.0 | 27.295332 | 3 |
| TATCAAC | 16660 | 0.0 | 27.287052 | 2 |
| AACGCAG | 17140 | 0.0 | 26.790695 | 6 |
| TAACGCA | 475 | 0.0 | 26.720692 | 4 |
| CCTGGGG | 5000 | 0.0 | 24.25645 | 4 |
| ACGCAGA | 19290 | 0.0 | 23.804392 | 7 |
| GTACATG | 13205 | 0.0 | 23.15797 | 1 |
| CGCAGAG | 19775 | 0.0 | 23.148382 | 8 |
| TACATGG | 13240 | 0.0 | 22.511604 | 2 |
| TATACCG | 275 | 6.002665E-11 | 22.222542 | 5 |
| ACATGGG | 13175 | 0.0 | 21.479464 | 3 |