Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1512022_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2050825 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 25 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTT | 4549 | 0.2218131727475528 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTT | 2726 | 0.13292211670912926 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTT | 2670 | 0.13019150829544207 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGACCGC | 90 | 5.4200063E-7 | 12.6643095 | 6 |
| GGTATCA | 1030 | 0.0 | 12.602513 | 1 |
| CCGGAAC | 60 | 0.005865689 | 11.086137 | 3 |
| AGAACCG | 100 | 2.3927096E-5 | 10.452643 | 5 |
| GACCGCT | 75 | 0.0026576503 | 10.128977 | 7 |
| TAGGACC | 385 | 0.0 | 10.119442 | 4 |
| TTAGGAC | 565 | 0.0 | 10.091047 | 3 |
| GCGTGCG | 85 | 6.620976E-4 | 10.0545 | 9 |
| TGGACCG | 105 | 4.0945502E-5 | 9.954898 | 5 |
| GTCCTAA | 600 | 0.0 | 9.862702 | 1 |
| TAGGACA | 465 | 0.0 | 9.808932 | 4 |
| GTAGGAC | 1135 | 0.0 | 9.795428 | 3 |
| AGGACGT | 1000 | 0.0 | 9.787475 | 5 |
| GTCTTAA | 245 | 0.0 | 9.739338 | 1 |
| GTATCAA | 2020 | 0.0 | 9.639051 | 1 |
| GGACGTG | 1000 | 0.0 | 9.593214 | 6 |
| TAGGACG | 1050 | 0.0 | 9.592901 | 4 |
| TTAGACT | 160 | 6.6102075E-8 | 9.502403 | 4 |
| AGGACGG | 90 | 0.0011112588 | 9.502403 | 5 |
| ATAACAC | 120 | 1.6955939E-5 | 9.502403 | 3 |