Basic Statistics
Measure | Value |
---|---|
Filename | SRR4062033_1.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 823077 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 42 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CTGTAGGACGTGGAATATGGCAAGAAAACTGAAAATCATGGAAAATGAGA | 2354 | 0.28599997327103055 | No Hit |
GTCCTACAGTGGACATTTCTAAATTTTCCACCTTTTTCAGTTTTCCTCGC | 1779 | 0.21614016671587227 | No Hit |
CTTTAGGACGTGAAATATGGCGAGGAAAACTGAAAAAGGTGGAAAATTTA | 1501 | 0.1823644689379001 | No Hit |
GTCCTAAAGTGTGTATTTCTCATTTTCCGTGATTTTCAGTTTTCTCGCCA | 1225 | 0.14883176179142413 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GTATGCG | 25 | 0.0023461657 | 35.218914 | 1 |
ACCGCGC | 25 | 0.002353171 | 35.19752 | 8 |
TACGGCG | 35 | 3.2183054E-4 | 31.42636 | 16 |
AATACGG | 35 | 3.2183054E-4 | 31.42636 | 14 |
GTATCAA | 740 | 0.0 | 25.878767 | 1 |
CGCACAA | 45 | 0.0013974189 | 24.445692 | 44 |
TAGGACC | 380 | 0.0 | 24.314077 | 4 |
GGTATCA | 320 | 0.0 | 22.699692 | 1 |
TAGGACG | 2655 | 0.0 | 21.874166 | 4 |
CTGTAGG | 2000 | 0.0 | 21.681646 | 1 |
TGTAGGA | 2120 | 0.0 | 21.583385 | 2 |
TTAGGAC | 1195 | 0.0 | 21.354145 | 3 |
GTCCTAC | 1535 | 0.0 | 21.223125 | 1 |
GTAGGAC | 2150 | 0.0 | 21.077585 | 3 |
GGACGTG | 2585 | 0.0 | 21.019798 | 6 |
AGGACGT | 2655 | 0.0 | 20.962742 | 5 |
ACGGGAT | 85 | 6.106826E-6 | 20.70694 | 43 |
GACGTGG | 1515 | 0.0 | 20.61901 | 7 |
TATGACC | 130 | 1.6243575E-9 | 20.306263 | 4 |
CGTGGAA | 1620 | 0.0 | 20.233143 | 9 |