Basic Statistics
| Measure | Value |
|---|---|
| Filename | ENCFF001HPI_trimmed.fq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 16988516 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 20-36 |
| %GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 49533 | 0.2915675506913023 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| ATGCCGA | 895 | 0.0 | 12.8270645 | 14 |
| GCGGTTC | 935 | 0.0 | 10.548308 | 1 |
| CGGTTCA | 1620 | 0.0 | 10.086201 | 1 |
| GATCGGA | 405 | 0.0 | 9.473841 | 21 |
| TGCCGAG | 1210 | 0.0 | 8.760588 | 15 |
| AGATCGG | 490 | 0.0 | 8.43139 | 20 |
| ATCGGTT | 360 | 1.714188E-4 | 7.8001056 | 30 |
| AATGCCG | 1730 | 0.0 | 7.741962 | 14 |
| AGCGGTT | 970 | 0.0 | 7.4784517 | 30 |
| GAGCGGT | 805 | 0.0 | 7.3131285 | 6 |
| CGGAAGA | 920 | 0.0 | 6.5601788 | 1 |
| AGAGCGG | 970 | 0.0 | 6.3726935 | 5 |
| CCCGATT | 480 | 6.5239455E-4 | 6.337586 | 30 |
| ATCGGAA | 445 | 1.4791858E-6 | 6.3028536 | 22 |
| GATCGGT | 325 | 0.0026993307 | 6.124565 | 29 |
| ACCGATC | 290 | 0.002780506 | 6.1083913 | 22 |
| AAGAGCG | 1120 | 0.0 | 5.6507344 | 4 |