Abalone dataset contains 4177 entries in which each entry records the features of an abalone together with its age as the desired output. Characteristics of this dataset can be listed below.
- Data size: 4177 entries.
- Features: 8 features of an abalone's physical measurements, with no missing data.
- Whole weight
- Shucked weight
- Viscera weight
- Shell weight
- Classes: 28 classes corresponding to the age from 1 to 29 years of abalones.
We can display the data sizes among all classes, as follows:
We can display the feature distributions over different classes, as follows:
(In order not to clotter the plot, we have only shown the distributions among the first 8 classes.)
We can plot the classes w.r.t. each of the features:
We can have a scatter plot after projecting the dataset onto a 2D plane:
We can have another scatter plot after projecting the dataset onto a 3D space, which generates C(8, 3) = 56 subplots, as follows:
Since there are 28 classes, it is hard to observe the distribution of each class in either 2D or 3D projection.
Data Clustering and Pattern Recognition (資料分群與樣式辨認)