Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR4062033_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 823077 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 42 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CTGTAGGACGTGGAATATGGCAAGAAAACTGAAAATCATGGAAAATGAGA | 2354 | 0.28599997327103055 | No Hit |
| GTCCTACAGTGGACATTTCTAAATTTTCCACCTTTTTCAGTTTTCCTCGC | 1779 | 0.21614016671587227 | No Hit |
| CTTTAGGACGTGAAATATGGCGAGGAAAACTGAAAAAGGTGGAAAATTTA | 1501 | 0.1823644689379001 | No Hit |
| GTCCTAAAGTGTGTATTTCTCATTTTCCGTGATTTTCAGTTTTCTCGCCA | 1225 | 0.14883176179142413 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GTATGCG | 25 | 0.0023461657 | 35.218914 | 1 |
| ACCGCGC | 25 | 0.002353171 | 35.19752 | 8 |
| TACGGCG | 35 | 3.2183054E-4 | 31.42636 | 16 |
| AATACGG | 35 | 3.2183054E-4 | 31.42636 | 14 |
| GTATCAA | 740 | 0.0 | 25.878767 | 1 |
| CGCACAA | 45 | 0.0013974189 | 24.445692 | 44 |
| TAGGACC | 380 | 0.0 | 24.314077 | 4 |
| GGTATCA | 320 | 0.0 | 22.699692 | 1 |
| TAGGACG | 2655 | 0.0 | 21.874166 | 4 |
| CTGTAGG | 2000 | 0.0 | 21.681646 | 1 |
| TGTAGGA | 2120 | 0.0 | 21.583385 | 2 |
| TTAGGAC | 1195 | 0.0 | 21.354145 | 3 |
| GTCCTAC | 1535 | 0.0 | 21.223125 | 1 |
| GTAGGAC | 2150 | 0.0 | 21.077585 | 3 |
| GGACGTG | 2585 | 0.0 | 21.019798 | 6 |
| AGGACGT | 2655 | 0.0 | 20.962742 | 5 |
| ACGGGAT | 85 | 6.106826E-6 | 20.70694 | 43 |
| GACGTGG | 1515 | 0.0 | 20.61901 | 7 |
| TATGACC | 130 | 1.6243575E-9 | 20.306263 | 4 |
| CGTGGAA | 1620 | 0.0 | 20.233143 | 9 |