Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR523035_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 12653303 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 34972 | 0.2763863316953684 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 29640 | 0.23424713689382134 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 18959 | 0.14983439501922935 | No Hit |
| GTACATGGGAAGCAGTGGTATCAACGCAGAGTACATGGGAAGCAGTGGTA | 14169 | 0.11197866675602411 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TACCTGG | 22520 | 0.0 | 58.475197 | 2 |
| GTACCTG | 26390 | 0.0 | 50.66941 | 1 |
| ACCTGGG | 25600 | 0.0 | 50.407455 | 3 |
| CCTGGGG | 22520 | 0.0 | 36.162994 | 4 |
| CTGGGGG | 13360 | 0.0 | 26.451004 | 5 |
| TAACGCA | 1500 | 0.0 | 26.316141 | 4 |
| GTATCAA | 52410 | 0.0 | 26.051823 | 1 |
| GGTATCA | 37900 | 0.0 | 24.773907 | 1 |
| TATAACG | 1465 | 0.0 | 24.7016 | 2 |
| ATAACGC | 1610 | 0.0 | 23.350615 | 3 |
| GTACACG | 2830 | 0.0 | 22.59478 | 1 |
| CATGGGG | 30650 | 0.0 | 22.323647 | 4 |
| TCAACGC | 60585 | 0.0 | 22.098438 | 4 |
| GTACATG | 62450 | 0.0 | 21.96138 | 1 |
| CAACGCA | 61145 | 0.0 | 21.926617 | 5 |
| ATCAACG | 61750 | 0.0 | 21.673908 | 3 |
| AACGCAG | 62680 | 0.0 | 21.539675 | 6 |
| TATCACG | 1385 | 0.0 | 21.377787 | 2 |
| TATCAAC | 63105 | 0.0 | 21.29229 | 2 |
| TACATGG | 64060 | 0.0 | 21.150942 | 2 |