-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Project idea: Create package with master classes for population genetic data #8
Comments
Are you thinking kind of like a conversion engine, along the lines of PGDspider (http://www.cmpg.unibe.ch/software/PGDSpider/#Introduction) or Create (https://bcrc.bio.umass.edu/pedigreesoftware/node/2), but of course within R? I think that would be great and I think others would be interested too. Allan Strand and I have talked a bit about this before. |
I was thinking more along the lines of the data representation within R as opposed to flat file format (That's not to say different file format handlers in R are not needed). Generally along the lines of what Bioconductor emphasizes for future package development:
By creating a core set of classes that can be built upon, future developers can ensure interoperability within R. For example, I have created a class that contains the genind object from adegenet. These are still valid genind objects, so all of the methods associated with genind objects are also associated with the genclone objects and I didn't have to re-invent the wheel in terms of creating new methods to compute things like expected heterozygosity. Additionally, Bioconductor has a long presentation discussing the use of S4 classes and methods. |
I think it is essential indeed to reuse existing class wherever possible. Some classes have been around and used for a while, meaning they roughly do On Mon, Mar 9, 2015 at 8:23 PM, Zhian N. Kamvar [email protected]
|
My example actually does what you suggest and adds new slots onto the genind class (the @hierarchy slot is used to set up the data and feed it into the @pop slot). Admittedly, my initial suggestion is a bit of a lofty goal and, pragmatically, building off of the existing classes would be the thing to do (besides, adegenet already contains the modular virtual classes: gen, popinfo, and indinfo). Perhaps an alternative would be to construct a short tutorial for future developers that outlines the following:
The goal for either direction is to encourage future developers to contribute while maintaining interoperability between the packages and lowering the activation energy needed to do so. Thoughts? |
Hi Everyone, I feel it is absolutely essential to provide a 'referece' R object class to In 2004, the I and the other members of R-Genetics project ( One of my desires for the Hackathon is to revive, update, and extend The source code for all of the R-Genetics packages is available in the -Greg On Mon, Mar 9, 2015 at 3:09 PM, Sean [email protected] wrote:
"Whereas true religion and good morals are the only solid foundations of |
Hi there, On Tue, Mar 10, 2015 at 5:11 PM, Zhian N. Kamvar [email protected]
|
Would it be worth converting that to Git? |
Hi, |
I agree with all the great posts. Synthesis of available tools in a primer/wiki and move to github would be great. |
+1! |
Yes, absolutely. I'm very time constrained this week because of family health issue, so it would be a great help of someone could assist with doing this. Actually, the most recent Code for these packages is probably in the BioConductor svn tree. Change your thoughts and you change the world.
|
As mentioned in #4, we have a lot of different classes for handling population genetic data and they are all useful in their own right. One thing to think about is the fact that while the representation of the actual genetic data might be different (frequencies vs. values vs. bitwise), there are basic forms of metadata that is common between all of them: population assignment, individual assignment, etc.
Thinking along the same lines as modular synthesizers, I propose to create a package that defines a class or set of classes and methods that will formally define metadata that is often needed for population genetic analysis. This would allow for easier construction of conversion functions between future classes and result in a more consistent workflow between packages.
I realize that this might fall under the problem of proliferation of standards, but I believe that if we design these to be modular, it should not be an issue.
The text was updated successfully, but these errors were encountered: