Тип публикации: доклад, тезисы доклада, статья из сборника материалов конференций
Конференция: International Workshop “Hybrid methods of modeling and optimization in complex systems” (HMMOCS 2022); Krasnoyarsk; Krasnoyarsk
Год издания: 2022
Идентификатор DOI: 10.15405/epct.23021.47
Ключевые слова: Logical analysis of data, interpretable machine learning, gene expression, cancer prediction
Аннотация: The paper analyzes gene expression data in relation to the diagnosis and prognosis of the development of oncological diseases. The goal is to create a hybrid prediction model based on gene expression data and interpretive machine learning. The experiments were carried out on four publicly available gene expression datasets in relatПоказать полностьюion to the prediction of breast and lung cancers. Data sets contain information about positive and negative observations, described by tens of thousands of attributes with gene expression data. Logical analysis of data is investigated as the main method for building a model. This method is based on combinatorics and optimization. As a result of logical analysis of data, a set of patterns is built, each of which involves only a small number of input attributes (genes). The search for a reference set of attributes, which is a step in the logical analysis of data, yields a small number of genes that have a combinatorial effect on the result. The resulting patterns have a small number of conditions and are understandable to the user. A comparison was made with other machine learning algorithms, including rule based classifiers: RIPPER, decision trees, and others. Logical analysis of data has advantages both in terms of classification accuracy and result interpretability, and therefore provides greater confidence in the recognition result.
Журнал: HYBRID METHODS OF MODELING AND OPTIMIZATION IN COMPLEX SYSTEMS
Номера страниц: 379-386
Место издания: London, United Kingdom
Издатель: European Proceedings