One of the recurrent – and vexing – problems of ecology is the decision regarding the most adequate number of groups for clustering multivariate data. I already provided some functions (making use of heatmaps and networks) to facilitate this process in the EcotoneFinder package.
Here I present another set of analyses that may be used to the same effect, using different partition indices and, particularly, their evolution when the data are subjected to segmentation by increasing number of groups.
The initial idea for these functions came from a figure in a publication by Pavão et.al, 2019 [1] – which I intended to reproduce for my own data – and the extension of this protocol from k-means clustering to the fuzzy-c-means clustering I was using at the time.
Three functions are currently in the repository:
cascadeFCM: and extension of thevegan::cascadeKMfor fuzzy-c-means clustering.KMeans_indices_testto produce the data needed to draw a plot similar to the one in Pavão et. al, 2019, with k-means clustering.FCM_indices_testto produce the data needed to draw a plot similar to the one in Pavão et. al, 2019, with fuzzy-c-means clustering.
All these functions might eventually be integrated in future versions of the EcotoneFinderpackage.
Considering the artificial data presented bellow, and provided in this repository:
The associated heatmap and networks (qgraph, running a spinglass algorithm to determine statistical communities in the network) both highlight three main groups, either as "squares" of more closely related species along the diagonal of the heatmap, or as groups of related nodes. This correctly describes the three main communities in the artificial data.
Now – running the KMeans_indices_test on the same data – we obtain an optimum at
The use of fuzzy indices (and thus, fuzzy clusters) – using the FCM_indices_test – now finds back the
[1] Pavão DC, Elias RB, Silva L (2019) Comparison of discrete and continuum community models: Insights from numerical ecology and Bayesian methods applied to Azorean plant communities. Ecological Modelling 402:93–106 doi:10.1016/j.ecolmodel.2019.03.021




