Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR522840_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 9123970 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 26520 | 0.2906629460640489 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 23549 | 0.25810036639752215 | No Hit |
| CTTATACACATCTCCGAGCCCACGAGACCGAGGCTGATCTCGTATGCCGT | 18306 | 0.2006363458012247 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 12582 | 0.13790049726160872 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TACCTGG | 10995 | 0.0 | 41.173138 | 2 |
| ACCTGGG | 12650 | 0.0 | 34.74365 | 3 |
| TATAACG | 900 | 0.0 | 32.384117 | 2 |
| GTACCTG | 15785 | 0.0 | 29.145723 | 1 |
| ATAACGC | 1065 | 0.0 | 26.923687 | 3 |
| TAACGCA | 1085 | 0.0 | 26.860489 | 4 |
| CCTGGGG | 11510 | 0.0 | 24.095093 | 4 |
| GTACATG | 32195 | 0.0 | 23.860432 | 1 |
| GGTATCA | 36155 | 0.0 | 23.198694 | 1 |
| TACATGG | 33020 | 0.0 | 22.878191 | 2 |
| GTATCAA | 47455 | 0.0 | 22.056093 | 1 |
| CATGGGG | 19540 | 0.0 | 22.035498 | 4 |
| ACATGGG | 32590 | 0.0 | 22.024649 | 3 |
| GAGTACT | 26365 | 0.0 | 21.13232 | 12-13 |
| ATGGGGG | 12000 | 0.0 | 20.957006 | 5 |
| TCAACGC | 52490 | 0.0 | 19.701443 | 4 |
| CAACGCA | 53060 | 0.0 | 19.516592 | 5 |
| ATCAACG | 53465 | 0.0 | 19.377438 | 3 |
| TGGGGGG | 12320 | 0.0 | 19.26529 | 6 |
| GTACTTT | 28615 | 0.0 | 19.232588 | 14-15 |