Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR522867_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 6524394 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 22180 | 0.3399549444745366 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 20726 | 0.3176693498277388 | No Hit |
| GAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGT | 15997 | 0.2451875223967161 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 10936 | 0.16761709976436126 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TACCTGG | 6245 | 0.0 | 25.817688 | 2 |
| TATAACG | 610 | 0.0 | 23.117832 | 2 |
| GTACCTG | 7090 | 0.0 | 22.822031 | 1 |
| GTACATG | 21670 | 0.0 | 22.400764 | 1 |
| TACATGG | 21875 | 0.0 | 21.703463 | 2 |
| GAGTACT | 20045 | 0.0 | 21.171785 | 12-13 |
| ACATGGG | 22115 | 0.0 | 20.743973 | 3 |
| TAACGCA | 710 | 0.0 | 20.522602 | 4 |
| GGTATCA | 31245 | 0.0 | 20.202915 | 1 |
| GTATCAA | 40110 | 0.0 | 20.182281 | 1 |
| ATAACGC | 725 | 0.0 | 20.097996 | 3 |
| ACCTGGG | 7615 | 0.0 | 20.060537 | 3 |
| CATGGGG | 13905 | 0.0 | 19.74108 | 4 |
| GTACTTT | 21520 | 0.0 | 19.403467 | 14-15 |
| AGAGTAC | 30690 | 0.0 | 18.629635 | 10-11 |
| TCAACGC | 43255 | 0.0 | 18.473179 | 4 |
| CAACGCA | 43685 | 0.0 | 18.313427 | 5 |
| ATCAACG | 43990 | 0.0 | 18.164522 | 3 |
| AGTACTT | 21130 | 0.0 | 18.10509 | 12-13 |
| AACGCAG | 44425 | 0.0 | 18.059061 | 6 |