Basic Statistics
Measure | Value |
---|---|
Filename | SRR1033033_2.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 152308 |
Sequences flagged as poor quality | 0 |
Sequence length | 100 |
%GC | 43 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 2293 | 1.5055020090868503 | No Hit |
GAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGT | 252 | 0.16545421120361373 | No Hit |
GCTTACTCTGCGTTGATACCACTGCTTACTCTGCGTTGATACCACTGCTT | 198 | 0.12999973737426793 | No Hit |
GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 184 | 0.12080783675184495 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTCTAAG | 20 | 0.0021583452 | 70.508385 | 3 |
GTACATA | 25 | 0.0052180793 | 56.42525 | 1 |
GTATAAC | 25 | 0.0052180793 | 56.42525 | 1 |
TACATGA | 60 | 2.5340341E-8 | 54.83985 | 2 |
TACAAGA | 60 | 2.5340341E-8 | 54.83985 | 2 |
CTAACAA | 35 | 2.9276783E-4 | 53.720673 | 8 |
CAGGCTA | 55 | 5.257732E-5 | 42.732353 | 4 |
TCAACAC | 205 | 0.0 | 41.273197 | 3 |
GTACAAG | 80 | 2.4312612E-7 | 41.143414 | 1 |
GAGAACA | 260 | 0.0 | 39.78704 | 1 |
CGTATCA | 25 | 0.0017045642 | 37.57976 | 90-91 |
TTAGACC | 65 | 1.4067489E-4 | 36.158142 | 4 |
AGAACAA | 265 | 0.0 | 35.475914 | 2 |
GCTAACA | 55 | 0.0027080588 | 34.185883 | 7 |
GCCGTAT | 100 | 1.4064353E-6 | 32.882294 | 94 |
ACTATTA | 60 | 0.0041473308 | 31.337057 | 3 |
TACAGTG | 60 | 0.0041473308 | 31.337057 | 7 |
TCTATTC | 60 | 0.0041473308 | 31.337057 | 3 |
ATACGGC | 30 | 0.0041604894 | 31.32676 | 82-83 |
GGCTAAC | 60 | 0.004154012 | 31.32676 | 6 |