Data Transformation for Clustering Utilization for Feature Detection in Mass Spectrometry

Warning

This publication doesn't include Faculty of Education. It includes Faculty of Science. Official publication website can be found on muni.cz.

Authors	BARTOŇ Vojtěch SKUTKOVA Helena
Year of publication	2022
Type	Article in Proceedings
Conference	Bioinformatics and Biomedical Engineering (IWBBIO 2022) : Lecture Notes in Computer Science, vol 13347
MU Faculty or unit	Faculty of Science
Citation
Web	https://doi.org/10.1007/978-3-031-07802-6_24
Doi	http://dx.doi.org/10.1007/978-3-031-07802-6_24
Keywords	Mass spectrometry; Clustering; Feature identification
Description	Feature detection and peak detection are one of the first steps of mass spectrometry data processing. This data comes in large volumes; thus, the processing needs to be optimized, not overloaded. State-of-the-art clustering algorithms can not perform feature detection for several reasons. First issue is the volume of the data, second is the disparity of the sampling frequency in the MZ and RT axis. Here we show the data transformation to utilize the clustering algorithms without the need to redefine its kernel. Data are first pre-clustered to obtain regions that can be processed independently. Then we transform the data so that the numerical differences between consecutive points should be the same in both space axes. We applied a set of clustering algorithms for each region to find the features, and we compared the result with the Gridmass peak detector. These findings may facilitate better utilization of the 2D clustering method as feature detectors for mass spectra.
Related projects:	CETOCOEN Excellence CETOCOEN Excellence RECETOX RI