Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1142541_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 427819 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 125 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1527 | 0.3569266442116877 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1264 | 0.2954520486467408 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 825 | 0.19283856023224774 | No Hit |
| GTACATGGGAAGCAGTGGTATCAACGCAGAGTACATGGGAAGCAGTGGTA | 527 | 0.12318293483926614 | No Hit |
| CCCATGTACTCTGCGTTGATACCACTGCTTCCCATGTACTCTGCGTTGAT | 443 | 0.10354846325198272 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TTAACGC | 20 | 8.4888533E-4 | 89.22107 | 4 |
| TATCACG | 40 | 1.7735589E-4 | 59.480717 | 2 |
| TAACGCA | 55 | 8.536014E-4 | 43.258698 | 5 |
| ACAACGC | 70 | 0.0027914715 | 33.98898 | 3 |
| GTGTTAT | 130 | 1.7370985E-6 | 32.028076 | 1 |
| TACAACG | 75 | 0.0039139725 | 31.72305 | 2 |
| GTATCAA | 2515 | 0.0 | 30.7455 | 1 |
| CGACGGA | 80 | 0.0053669843 | 29.740358 | 1 |
| GGTATCA | 1910 | 0.0 | 25.84764 | 1 |
| CAACGCA | 3165 | 0.0 | 25.558855 | 5 |
| AACGCAG | 3270 | 0.0 | 25.283852 | 6 |
| ATCAACG | 3075 | 0.0 | 25.146317 | 3 |
| TCAACGC | 3270 | 0.0 | 24.738157 | 4 |
| TATCAAC | 3280 | 0.0 | 23.574673 | 2 |
| ACGCAGA | 3525 | 0.0 | 23.454807 | 7 |
| CGGGCGT | 130 | 0.0020539348 | 22.877197 | 6 |
| CGCAGAG | 3645 | 0.0 | 22.356264 | 8 |
| GCAGAGT | 3940 | 0.0 | 20.68238 | 9 |
| TACTATA | 260 | 7.6957804E-7 | 20.589478 | 2 |
| GACACCG | 145 | 0.0038707282 | 20.510592 | 7 |