FastQCFastQC Report
Thu 26 May 2016
SRR522102_1.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameSRR522102_1.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences40598193
Sequences flagged as poor quality0
Sequence length50
%GC45

[OK]Per base sequence quality

Per base quality graph

[OK]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[WARN]Per base sequence content

Per base sequence content

[OK]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[FAIL]Sequence Duplication Levels

Duplication level graph

[WARN]Overrepresented sequences

SequenceCountPercentagePossible Source
AAGCAGTGGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT1897630.4674173552502694No Hit
GATCGGAAGAGCGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAG1336200.32912794911832655Illumina Paired End PCR Primer 2 (97% over 36bp)
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA873960.21527066487909943No Hit
CGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAATGCCG790520.19471802599687135Illumina Paired End PCR Primer 2 (100% over 31bp)

[OK]Adapter Content

Adapter graph

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
AAGCAGT597750.031.3581241
TCAACGC582500.031.28690112
ATCAACG588050.030.99161511
CAACGCA593000.030.61431313
GGTATCA615900.029.5880368
AACGCAG613200.029.49470714
GTATCAA618850.029.4683769
ACGCAGA609850.029.4332815
GTGGTAT622400.029.1842736
TGGTATC628350.028.9288397
CGCAGAG623250.028.76512716
AGTGGTA640750.028.4795
AGAGTAC625850.027.88184419
AGCAGTG668550.027.8564362
TATCAAC663350.027.6160810
GAGTACT376150.027.05849520
CAGAGTA652100.026.92710718
GCAGAGT665300.026.52889617
TATGCCG37600.025.3151343
CAGTGGT726450.025.285794