capstone_kmer_compression

This repo contains the code for the capstone project of Rob Hazell, John Partee, and Anjli Solsi, with significant contributions and advisement by Dr. John Santerre.

Gist

K-mer analysis has been proven to be effective and extremely accurate for classifying bacteria. However, the space grows exponentially with an increase in k-mer size, which is necessary for higher accuracy analysis. In this paper, we explore different tools and compression methods for more space and time efficient analysis, with the goal of reducing the barrier to entry with K-mer analysis, and ultimately releasing our developed tools as a Python package.

Future plans

At this stage the methodology has been mostly proven, we're exploring ways to speed up the trials now so that we can explore larger k and token sizes.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitattributes		.gitattributes
0.3 vis.ipynb		0.3 vis.ipynb
0.3.csv		0.3.csv
0.4 vis.ipynb		0.4 vis.ipynb
0.4.csv		0.4.csv
0.5 vis.ipynb		0.5 vis.ipynb
0.5.csv		0.5.csv
README.md		README.md
kmer_compression_trials_with_col_comp.py		kmer_compression_trials_with_col_comp.py
runner1.sh		runner1.sh
submit.sh		submit.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

capstone_kmer_compression

Gist

Future plans

About

Releases

Packages

Languages

morganpartee/capstone_kmer_compression

Folders and files

Latest commit

History

Repository files navigation

capstone_kmer_compression

Gist

Future plans

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages