Ocena jakości aplikacyjnej odpornego algorytmu analizy skupień TCLUST na przykładzie zbioru danych dotyczących jakości powietrza w Krakowie
Evaluation of the Quality of Robust Clustering Algorithm TCLUST on the Example of Dataset of Air Pollutants Emission in Krakow
Author(s): Ewa Szlachtowska, Daniel Kosiorowski, Dominik MielczarekSubject(s): Economy, Energy and Environmental Studies, ICT Information and Communications Technologies
Published by: Główny Urząd Statystyczny
Keywords: robust cluster analysis; tclust algorithm; air quality testing
Summary/Abstract: Acquisition and data collection is currently a very dynamic processes. In order to obtain from data useful information, when huge quantities of data, the processing of the data is not a trivial task. Cluster analysis is very helpful in this and the result of grouping the result of grouping allows us to comprehend the available information and look at it from a different perspective. In any case, we are not able to show the entire spectrum of issues related to data analysis. Therefore we limit our discussion to the analysis of clusters, then we describe the TCLUST algorithm. The authors of the algorithm are H. Fritz, L. A. García-Escudero, A. Mayo-Iscar (see Fritz et al. 2011, 2012). In the paper we present the pros and cons robust clustering algorithm, and we discuss the available functions in the package tclust. Then on the example of dataset of air pollutants emission in Krakow we try to evaluate the quality of robust clustering algorithm.
Journal: Przegląd Statystyczny. Statistical Review
- Issue Year: 63/2016
- Issue No: 1
- Page Range: 67-80
- Page Count: 14
- Language: Polish