Statistical causality analysis
- Authors: Grusho A.A.1,2, Grusho N.A.1, Zabezhailo M.I.1, Samouylov K.E.2, Timonina E.E.1,2
-
Affiliations:
- Federal Research Center “Computer Sciences and Control” of the Russian Academy of Sciences
- RUDN University
- Issue: Vol 32, No 2 (2024)
- Pages: 213-221
- Section: Articles
- URL: https://journals.rudn.ru/miph/article/view/41389
- DOI: https://doi.org/10.22363/2658-4670-2024-32-2-213-221
- EDN: https://elibrary.ru/CPUADE
Cite item
Full Text
Abstract
The problem of identifying deterministic cause-and-effect relationships, initially hidden in accumulated empirical data, is discussed. Statistical methods were used to identify such relationships. A simple mathematical model of cause-and-effect relationships is proposed, in the framework of which several models of causal dependencies in data are described - for the simplest relationship between cause and effect, for many effects of one cause, as well as for chains of cause-and-effect relationships (so-called transitive causes). Estimates are formulated that allow using the de Moivre-Laplace theorem to determine the parameters of causal dependencies linking events in a polynomial scheme trials. The statements about the unambiguous identification of causeand-effect dependencies that are reconstructed from accumulated data are proved. The possibilities of using such data analysis schemes in medical diagnostics and cybersecurity tasks are discussed.
Full Text
1. Introduction The simplest idea of a causal relationship is given by the functional dependenceAbout the authors
Alexander A. Grusho
Federal Research Center “Computer Sciences and Control” of the Russian Academy of Sciences; RUDN University
Author for correspondence.
Email: grusho@yandex.ru
ORCID iD: 0000-0003-4400-2158
Principal scientist, Institute of Informatics Problems, Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences; professor of Department of Probability Theory and Cyber Security of Peoples’ Friendship University of Russia named after Patrice Lumumba
44 Vavilova St, bldg. 2, Moscow, 119133, Russian Federation; 6 Miklukho-Maklaya St, Moscow, 117198, Russian FederationNikolai A. Grusho
Federal Research Center “Computer Sciences and Control” of the Russian Academy of Sciences
Email: info@itake.ru
ORCID iD: 0000-0002-5005-2744
Candidate of Physical and Mathematical Sciences, Senior scientist, Institute of Informatics Problems, Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences
44 Vavilova St, bldg. 2, Moscow, 119133, Russian FederationMichael I. Zabezhailo
Federal Research Center “Computer Sciences and Control” of the Russian Academy of Sciences
Email: m.zabezhailo@yandex.ru
ORCID iD: 0000-0002-5067-5909
Professor, Doctor of Physical and Mathematical Sciences, Principal scientist, Institute of Informatics Problems, Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences
44 Vavilova St, bldg. 2, Moscow, 119133, Russian FederationKonstantin E. Samouylov
RUDN University
Email: samuylovke@rudn.ru
ORCID iD: 0000-0002-6368-9680
Professor, Doctor of Technical Sciences, Head of the Department of Probability Theory and Cyber Security of Peoples’ Friendship University of Russia named after Patrice Lumumba
6 Miklukho-Maklaya St, Moscow, 117198, Russian FederationElena E. Timonina
Federal Research Center “Computer Sciences and Control” of the Russian Academy of Sciences; RUDN University
Email: eltimon@yandex.ru
ORCID iD: 0000-0002-6493-3622
Professor, Doctor of Technical Sciences, Leading scientist, Institute of Informatics Problems, Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences; professor of Department of Probability Theory and Cyber Security of Peoples’ Friendship University of Russia named after Patrice Lumumba
44 Vavilova St, bldg. 2, Moscow, 119133, Russian Federation; 6 Miklukho-Maklaya St, Moscow, 117198, Russian FederationReferences
- Zhang, X., Hu, W. & Yang, F. Detection of Cause-Effect Relations Based on Information Granulation and Transfer Entropy. Entropy 24, 212. doi: 10.3390/e24020212 (2022).
- Reimer, J., Wang, Y., Laridi, S., Urdich, J., Wilmsmeier, S. & Palmer, G. Identifying cause-andeffect relationships of manufacturing errors using sequence-to-sequence learning. Scientific Reports 12, 22332. doi: 10.1038/s41598-022-26534-y (2022).
- Pearl, J. Causal Inference in Proceedings of Workshop on Causality: Objectives and Assessment at NIPS 2008 (eds Guyon, I., Janzing, D. & Schölkopf, B.) 6 (PMLR, Whistler, Canada, Dec. 2010), 39-58.
- Pearl, J. The mathematics of causal inference in Joint Statistical Meetings Proceedings. ASA (2013), 2515-2529.
- Yao, L., Chu, Z., Li, S., Li, Y., Gao, J. & Zhang, A. A Survey on Causal Inference. ACM Transactions on Knowledge Discovery from Data 15, 1-46. doi: 10.1145/3444944 (2021).
- Höfler, M. Causal inference based on counterfactuals. BMC Medical Research Methodology 5, 1- 28. doi: 10.1186/1471-2288-5-28 (2005).
- Richens, J. G., Lee, C. M. & Johri, S. Improving the accuracy of medical diagnosis with causal machine learning. Nature Communications 11, 3923. doi: 10.1038/s41467-020-17419-7 (2020).
- Grusho, A. A., Grusho, N. A. & Timonina, E. E. Root Cause Anomaly Localization [Lokalizatsiya iskhodnoy prichiny anomalii]. Information Security Problems. Computer Systems. in Russian, 9- 16 (2020).
- Smirnov, D. V. Methodology of problem-oriented Big Data analysis in limited time mode [Metodika problemno-orientirovannogo analiza Big Data v rezhime ogranichennogo vremeni]. International Journal of Open Information Technologies 9. in Russian, 88-94 (2021).
- Grusho, A., Grusho, N., Zabezhailo, M. & Timonina, E. Evaluation of Trust in ComputerComputed Results in Distributed Computer and Communication Network (eds Vishnevskiy, V. M., Samouylov, K. E. & Kozyrev, D. V.) (Springer International Publishing, Cham, 2022), 420-432. doi: 10.1007/978-3-030-97110-6_33.
- Guyatt, G. etal. Evidence-Based Medicine: A New Approach toTeaching the Practice of Medicine. JAMA 268, 2420-2425. doi: 10.1001/jama.1992.03490170092032 (Nov. 1992).
- Shiryaev, A. N. Probability [Veroyatnost’] in Russian. 521 pp. (MTsNMO, Moscow, 2004).
- Grusho, A., Grusho, N. & Timonina, E. Method of Several Information Spaces for Identification of Anomalies in Intelligent Distributed Computing XIII (eds Kotenko, I., Badica, C., Desnitsky, V., El Baz, D. & Ivanovic, M.) (Springer International Publishing, Cham, 2020), 515-520. doi:10. 1007/978-3-030-32258-8_60.