Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR938766_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 4305073 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 101 |
| %GC | 45 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGT | 9964 | 0.2314478755644794 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 9914 | 0.23028645507288725 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8496 | 0.1973485699313345 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5616 | 0.1304507496156279 | No Hit |
| GCTTACTCTGCGTTGATACCACTGCTTACTCTGCGTTGATACCACTGCTT | 4389 | 0.10194949075195704 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TACATGG | 14400 | 0.0 | 22.512209 | 2 |
| GTACATG | 15070 | 0.0 | 22.005703 | 1 |
| ACATGGG | 14765 | 0.0 | 20.894875 | 3 |
| GAGTACT | 12195 | 0.0 | 19.82353 | 12-13 |
| GTATCAA | 25095 | 0.0 | 19.642117 | 1 |
| GTACTTT | 13035 | 0.0 | 18.382261 | 14-15 |
| AGAGTAC | 20230 | 0.0 | 18.212276 | 10-11 |
| GGTATCA | 20170 | 0.0 | 18.21072 | 1 |
| AGTACTT | 12985 | 0.0 | 17.978016 | 12-13 |
| CATGGGG | 7885 | 0.0 | 17.817629 | 4 |
| AACGTCG | 165 | 0.0014753507 | 17.254045 | 6 |
| TCAACGC | 28240 | 0.0 | 17.24418 | 4 |
| ATCAACG | 28370 | 0.0 | 17.165161 | 3 |
| CAACGCA | 28400 | 0.0 | 17.14703 | 5 |
| TATCAAC | 28575 | 0.0 | 17.108458 | 2 |
| GTATAGC | 1650 | 0.0 | 17.013113 | 1 |
| AACGCAG | 29030 | 0.0 | 16.818684 | 6 |
| TACCTGG | 2955 | 0.0 | 16.062103 | 2 |
| CATGGGA | 7715 | 0.0 | 15.564834 | 4 |
| ACGCAGA | 31655 | 0.0 | 15.3483305 | 7 |