The gsc-weighting package
In their 1994 paper "Volume Changes in Protein Evolution", Gerstein, Sonnhammer and Chothia developed a weighting procedure for protein sequences to avoid over-represented sequences in the appendix "A Method to Weight Protein Sequences to Correct for Unequal Representation". Although their method was developed for protein sequences, it is readily extended to work on any measurable set.
This package calculates GSC weights for any reasonable
dendrogram. If you want to recreate their algorithm, then just
UPGMA as linkage and residue identity as distance
function when creating the dendrogram.
|Versions||0.1, 0.1.0.1, 0.1.0.2, 0.1.1, 0.1.1.1, 0.2, 0.2.1, 0.2.2|
|Dependencies||base (==4.*), hierarchical-clustering (==0.*) [details]|
|Author||Felipe Almeida Lessa|
|Uploaded||Tue Aug 3 12:12:37 UTC 2010 by FelipeLessa|
|Downloads||1801 total (163 in the last 30 days)|
|Rating||(no votes yet) [estimated by rule of succession]|
|Status||Docs uploaded by user
Build status unknown [no reports yet]
Hackage Matrix CI
For package maintainers and hackage trustees