Basic Statistics
Measure | Value |
---|---|
Filename | SRR1544752_1.fastq.gz |
File type | Conventional base calls |
Encoding | Illumina 1.5 |
Total Sequences | 2295423 |
Sequences flagged as poor quality | 0 |
Sequence length | 52 |
%GC | 42 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 9263 | 0.40354217937173237 | No Hit |
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 5493 | 0.23930229853059765 | No Hit |
CTGTCTCTTATACACATCTGACGCTAAGTCCTTCGTATGCCGTCTTCTGCTT | 3130 | 0.13635830955775907 | Illumina Single End Adapter 2 (95% over 21bp) |
TTCAAAGGGACCTAATCGGAGGAGCTACTCTAGTATTAATAAATATTAGCCC | 2657 | 0.11575208578113924 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGATGCG | 35 | 1.020162E-7 | 46.000004 | 10 |
TTTACGC | 25 | 3.418229E-5 | 46.0 | 23 |
CGATTCG | 80 | 0.0 | 46.0 | 10 |
TATGCGT | 40 | 5.6152203E-9 | 46.0 | 20 |
CCCAACG | 30 | 1.8622213E-6 | 46.0 | 23 |
GTGTCGA | 25 | 3.418229E-5 | 46.0 | 33 |
CGAATGT | 50 | 1.6370905E-11 | 46.0 | 18 |
TATCGCC | 20 | 6.3127757E-4 | 46.0 | 34 |
TAGTCCG | 20 | 6.3127757E-4 | 46.0 | 15 |
CGAAGTA | 30 | 1.8622213E-6 | 46.0 | 39 |
TGCGTAG | 165 | 0.0 | 44.606064 | 1 |
TTACCGG | 120 | 0.0 | 44.083332 | 2 |
AATACGG | 420 | 0.0 | 43.261906 | 2 |
ACGCGAG | 70 | 0.0 | 42.714287 | 1 |
TACGGGT | 240 | 0.0 | 42.166668 | 4 |
TAACGCG | 60 | 1.8189894E-12 | 42.166668 | 1 |
CGACGGT | 290 | 0.0 | 42.034485 | 28 |
ACGGGAT | 1285 | 0.0 | 41.88327 | 5 |
TGTGACG | 105 | 0.0 | 41.619045 | 1 |
TACGGGA | 1000 | 0.0 | 41.4 | 4 |