Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042097.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 7473879 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 47 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 34208 | 0.4577007468277183 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 32729 | 0.43791182597416944 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 20777 | 0.2779948671901164 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 14363 | 0.19217597715991924 | No Hit |
| GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTT | 8538 | 0.11423786764543553 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| ACGGACC | 1830 | 0.0 | 20.521856 | 8 |
| GACGGAC | 1845 | 0.0 | 20.254744 | 7 |
| AAGACGG | 2010 | 0.0 | 19.880598 | 5 |
| CGGACCA | 1990 | 0.0 | 18.31407 | 9 |
| AGACGGA | 2080 | 0.0 | 18.14423 | 6 |
| CGCAAGA | 2105 | 0.0 | 17.840857 | 2 |
| GGTATCA | 14310 | 0.0 | 17.414047 | 1 |
| TAAACGC | 1170 | 0.0 | 16.444445 | 28 |
| GTAAACG | 1160 | 0.0 | 16.426723 | 27 |
| TACCGTC | 1340 | 0.0 | 16.291044 | 7 |
| CTAGCGG | 920 | 0.0 | 16.288044 | 29 |
| TCTAGCG | 910 | 0.0 | 16.263735 | 28 |
| CAAGACG | 2465 | 0.0 | 16.210955 | 4 |
| TCTATAC | 1130 | 0.0 | 16.207964 | 3 |
| TTCGGGC | 1250 | 0.0 | 16.132 | 35 |
| GCGAAAG | 2340 | 0.0 | 15.574785 | 18 |
| TAAGACG | 250 | 0.0 | 15.54 | 4 |
| GCGCAAG | 2440 | 0.0 | 15.467214 | 1 |
| GTATTAG | 2285 | 0.0 | 15.463896 | 1 |
| CGAAAGC | 2425 | 0.0 | 15.105154 | 19 |