gsc-weighting: Generic implementation of Gerstein/Sonnhammer/Chothia weighting.
In their 1994 paper "Volume Changes in Protein Evolution", Gerstein, Sonnhammer and Chothia developed a weighting procedure for protein sequences to avoid over-represented sequences in the appendix "A Method to Weight Protein Sequences to Correct for Unequal Representation". Although their method was developed for protein sequences, it is readily extended to work on any measurable set.
This package calculates GSC weights for any reasonable
dendrogram. If you want to recreate their algorithm, then just
use UPGMA
as linkage and residue identity as distance
function when creating the dendrogram.
Changes in version 0.2:
Updated to work with
hierarchical-clustering
0.4.
Changes in version 0.1.1.1:
Use an stricter upper bound on
hierachical-clustering
.
Changes in version 0.1.1:
Now works even when some (or all) branches have distance zero (i.e. elements below that branches are all equal with respect to distance metric that was used to create the dendrogram).
Downloads
- gsc-weighting-0.2.tar.gz [browse] (Cabal source package)
- Package description (as included in the package)
Maintainer's Corner
For package maintainers and hackage trustees
Candidates
- No Candidates
Versions [RSS] | 0.1, 0.1.0.1, 0.1.0.2, 0.1.1, 0.1.1.1, 0.2, 0.2.1, 0.2.2 |
---|---|
Dependencies | base (>=4 && <5), hierarchical-clustering (>=0.4 && <0.5) [details] |
License | BSD-3-Clause |
Author | Felipe Almeida Lessa |
Maintainer | felipe.lessa@gmail.com |
Category | Clustering |
Source repo | head: darcs get http://patch-tag.com/r/felipe/gsc-weighting |
Uploaded | by FelipeLessa at 2011-10-30T18:46:13Z |
Distributions | |
Reverse Dependencies | 1 direct, 0 indirect [details] |
Downloads | 5768 total (19 in the last 30 days) |
Rating | (no votes yet) [estimated by Bayesian average] |
Your Rating | |
Status | Docs uploaded by user Build status unknown [no reports yet] |