FastQCFastQC Report
Fri 27 May 2016
SRR522047_1.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameSRR522047_1.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences25497707
Sequences flagged as poor quality0
Sequence length50
%GC47

[OK]Per base sequence quality

Per base quality graph

[OK]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[OK]Per base sequence content

Per base sequence content

[OK]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[WARN]Sequence Duplication Levels

Duplication level graph

[WARN]Overrepresented sequences

SequenceCountPercentagePossible Source
CGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAATGCCG670320.2628942280966677Illumina Paired End PCR Primer 2 (100% over 31bp)
GATCGGAAGAGCGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAG561570.22024333403784113Illumina Paired End PCR Primer 2 (97% over 36bp)
AAGCAGTGGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT487830.19132308642498713No Hit

[OK]Adapter Content

Adapter graph

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
TATGCCG26950.021.14223943
GACCGAT29250.019.62780832
GATATCG25350.019.43706136
ACCGATA25300.019.3013633
AAGCAGT191100.019.2335641
GAGTACT138450.018.93931820
CGATATC26150.018.6741235
GTACTTT155900.016.875922
CGAGATC184100.016.50982318
AGATCGG185900.016.41262820
GAGATCG186500.016.3244419
CCGATAT30400.015.91874434
CCGAGAT203350.015.11997917
AGTACTT177000.015.05053321
TCAACGC229550.014.81222812
CAACGCA230900.014.71610113
TCGTATG40000.014.683043540
AGCAGTG249700.014.6483172
TGCCGAG218050.014.54687626
ATCGTAT35150.014.51854839