Fuzzy C-Means Clustering for Functional Data

Performs fuzzy c-means clustering on functional data, where each curve has a membership degree to each cluster rather than a hard assignment.

Usage

cluster.fcm(fdataobj, ncl, m = 2, max.iter = 100, tol = 1e-06, seed = NULL)

Arguments

fdataobj: An object of class 'fdata'.
ncl: Number of clusters.
m: Fuzziness parameter (default 2). Must be > 1. Higher values give softer cluster boundaries.
max.iter: Maximum number of iterations (default 100).
tol: Convergence tolerance (default 1e-6).
seed: Optional random seed for reproducibility.

Value

A list of class 'fuzzycmeans.fd' with components:

membership: Matrix of membership degrees (n x ncl). Each row sums to 1.
cluster: Hard cluster assignments (argmax of membership).
centers: An fdata object containing the cluster centers.
objective: Final value of the objective function.
fdataobj: The input functional data object.

Details

Fuzzy c-means minimizes the objective function: $$J = \sum_{i=1}^n \sum_{c=1}^k u_{ic}^m ||X_i - v_c||^2$$ where u_ic is the membership of curve i in cluster c, v_c is the cluster center, and m is the fuzziness parameter.

The membership degrees are updated as: $$u_{ic} = 1 / \sum_{j=1}^k (d_{ic}/d_{ij})^{2/(m-1)}$$

When m approaches 1, FCM becomes equivalent to hard k-means. As m increases, the clusters become softer (more overlap). m = 2 is the most common choice.

Examples