Skip to content
Snippets Groups Projects
Commit 599a358b authored by Alexis  CRISCUOLO's avatar Alexis CRISCUOLO :black_circle:
Browse files

0.1

parent 8d995a81
No related branches found
No related tags found
No related merge requests found
...@@ -82,8 +82,8 @@ Launch _MSTclust_ without option to read the following documentation: ...@@ -82,8 +82,8 @@ Launch _MSTclust_ without option to read the following documentation:
* Specific rows can be selected using option `-r`. * Specific rows can be selected using option `-r`.
* A [minimum spanning tree](https://en.wikipedia.org/wiki/Minimum_spanning_tree) and a [single-linkage hierarchical classification tree](https://en.wikipedia.org/wiki/Single-linkage_clustering) can be written in [GraphML](http://graphml.graphdrawing.org/index.html) and [Newick](http://evolution.genetics.washington.edu/phylip/newicktree.html) output files using options `-m` and `-h`, respectively. These two options require important running times when considering large datasets (e.g. > 5,000 rows). * A [minimum spanning tree](https://en.wikipedia.org/wiki/Minimum_spanning_tree) and a [single-linkage hierarchical classification tree](https://en.wikipedia.org/wiki/Single-linkage_clustering) can be written in [GraphML](http://graphml.graphdrawing.org/index.html) and [Newick](http://evolution.genetics.washington.edu/phylip/newicktree.html) output files using options `-m` and `-h`, respectively. These two options require important running times when considering large datasets (e.g. > 5,000 rows).
* By definition, setting small cutoff values yield clustering with a large number of classes with fast running times. * By definition, setting small cutoff values yield clustering with a large number of classes with fast running times.
* Profile length is required to carry out data perturbation analyses (option `-L`). * Profile length is required to carry out data perturbation analyses (option `-L`). For each statistics, _MSTclust_ will compute the average value and the 95% CI based on different replicates (option `-R`).
* Data subsampling analyses progressively subsample raw data with rate values _b_/(_B_+1) where _b_ = 1 ... _B_ and _B_ > 1 is the number of bins specified using option `-B`. At least _B_ = 9 bins (i.e. rates = 10%, 20% ... 90%) are recommended. * Data subsampling analyses progressively subsample raw data with rate values _b_/(_B_+1) where _b_ = 1 ... _B_ and _B_ > 1 is the number of bins specified using option `-B`. At least _B_ = 9 bins (i.e. rates = 10%, 20% ... 90%) are recommended. For each statistics, _MSTclust_ will compute the average value and the 95% CI based on different replicates (option `-R`).
* For more details on the profile distance computation, clustering algorithm and descriptive statistics, see the technical notes pdf file. * For more details on the profile distance computation, clustering algorithm and descriptive statistics, see the technical notes pdf file.
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment