08.00.13 Mathematical and instrumental methods of Economics

BASIC REQUIREMENTS FOR DATA ANALYSIS METHODS (ON THE EXAMPLE OF CLASSIFICATION TASKS)
DescriptionThere is a need to clean up the classification methods. This will increase their role in solving applied problems, in particular, in the diagnosis of materials. For this, first of all, it is necessary to develop requirements that classification methods must satisfy. The initial formulation of such requirements is the main content of this work. Mathematical classification methods are considered as part of the applied statistics methods. The natural requirements to the considered methods of data analysis and the presentation of calculation results arising from the achievements and ideas accumulated by the national probabilistic and statistical scientific school are discussed. Concrete recommendations are given on a number of issues, as well as criticism of individual errors. In particular, data analysis methods must be invariant with respect to the permissible transformations of the scales in which the data are measured, i.e. methods should be adequate in the sense of measurement theory. The basis of a specific statistical method of data analysis is always one or another probabilistic model. It should be clearly described, its premises justified  either from theoretical considerations, or experimentally. Data processing methods intended for use in realworld problems should be investigated for stability with respect to the tolerances of the initial data and model premises. The accuracy of the solutions given by the method used should be indicated. When publishing the results of statistical analysis of real data, it is necessary to indicate their accuracy (confidence intervals). As an estimate of the predictive power of the classification algorithm, it is recommended to use predictive power instead of the proportion of correct forecasts. Mathematical research methods are divided into "exploratory analysis" and "evidencebased statistics." Specific requirements for data processing methods arise in connection with their "docking" during sequential execution. The article discusses limits of applicability of probabilisticstatistical methods. Concrete statements of classification problems and typical errors when applying various methods for solving them are also considered

DescriptionThe article discusses the use of machine learning methods and fuzzy production systems for studying the social and economic development of urban districts, areas and settlements of the Krasnodar region. The fundamental patterns and their connection with quantitative and qualitative indicators are considered

SCORING SYSTEM BASED ON INFORMATIONCOGNITIVE MODELING
DescriptionOne of the key problems facing the credit institution is the late payment of the loan. Firstly, it is a deeper analysis  in order to be carried out “manually” it is not even required several days, but weeks. Secondly, it allows you to work with clients much faster. And most importantly scoring allows you to negate the influence of the human factor. An automated system, no matter how you look, cannot be liked or not. Data analysis is only based on facts. Scoring is beneficial to all. The bank is able to work faster and reduce the risk of loan defaults. Clients, in turn, can apply for a loan on terms that are more favorable

STRATEGIC PLANNING AND MANAGEMENT OF A HOLDING BASED ON INFORMATION AND COGNITIVE TECHNOLOGIES
DescriptionIn the article, we develop the methodology of strategic planning and management of a holding, on the theoretical basis of automated systemcognitive analysis (ASCanalysis). This methodology provides scientific research of any holding by creating and researching its model. The methodology includes both the synthesis, adaptation and verification of systemcognitive models of the holding, and the use of these models for strategic planning and decision support for the management of the holding, as a complex, multiparametric, nonlinear system. The relevance of the research is due to the special role of holdings and other corporate integrated structures both in Russia as a whole and, in particular, in the Krasnodar region. Despite obvious system advantages, holdings face a wide range of problems related to management efficiency, ensuring their sustainable functioning, etc. The proposed methodology offers ways to solve these problems and can be successfully applied in holdings and other corporate integrated structures of various regions, volumes and areas of activity, which determines the relevance of the research topic. The level of significance and scientific novelty of the Research consists in the development of conceptual and theoretical and methodological provisions aimed at managing the development of holdings. The expected results and their significance are that the methodology developed as a result of the Research can be applied by holding companies and other corporate integrated structures and will significantly improve the quality of their management

PROBABILITYSTATISTICAL MODELS OF CORRELATION AND REGRESSION
DescriptionThe correlation and determination coefficients are widely used in statistical data analysis. According to measurement theory, Pearson's linear paired correlation coefficient is applicable to variables measured on an interval scale. It cannot be used in the analysis of ordinal data. The nonparametric Spearman and Kendall rank coefficients estimate the relationship of ordinal variables. The critical value when testing the significance of the difference of the correlation coefficient from 0 depends on the sample size. Therefore, using the Chaddock Scale is incorrect. When using a passive experiment, the correlation coefficients are reasonably used for prediction, but not for control. To obtain probabilisticstatistical models intended for control, an active experiment is required. The effect of outliers on the Pearson correlation coefficient is very large. With an increase in the number of analyzed sets of predictors, the maximum of the corresponding correlation coefficients — indicators of approximation quality noticeably increases (the effect of “inflation” of the correlation coefficient). Four main regression analysis models are considered. Models of the least squares method with a determinate independent variable are distinguished. The distribution of deviations is arbitrary, however, to obtain the limit distributions of parameter estimates and regression dependences, we assume that the conditions of the central limit theorem are satisfied. The second type of model is based on a sample of random vectors. The dependence is nonparametric, the distribution of the twodimensional vector is arbitrary. The estimation of the variance of an independent variable can be discussed only in the model based on a sample of random vectors, as well as the determination coefficient as a quality criterion for the model. Time series smoothing is discussed. Methods of restoring dependencies in spaces of a general nature are considered. It is shown that the limiting distribution of the natural estimate of the dimensionality of the model is geometric, and the construction of an informative subset of features encounters the effect of "inflation coefficient correlation". Various approaches to the regression analysis of interval data are discussed. Analysis of the variety of regression analysis models leads to the conclusion that there is no single “standard model”

DescriptionThis article is devoted to rating assessment of the socioeconomic situation of the Krasnodar region, presented by such agencies as "RAEKSAnalytics", "Expert RA" and "National Rating Agency". The methodologies used by these agencies were studied and analyzed. A comparison of these methodologies was also conducted. As a result, a number of their shortcomings were identified, including the lack of a complete methodological model in the public domain. Some agencies do not provide links to statistics that are used in the analysis. In the article using the STATISTICA environment, a statistical analysis of data reflecting the level of socioeconomic situation of the Krasnodar region is carried out. Based on the work [12], the article created a discriminant model for assessing the socioeconomic development of urban districts of the Krasnodar region with a confidence of 85%. The study conducted a cluster, discriminant, classification (decision trees), coefficient (proposed by the authors) based on the data of the Federal State Statistics website for the period from 2009 to 2018 in the city districts: Krasnodar, Anapa, Armavir, Gelendzhik, Goryachiy Klyuch, Novorossiysk Sochi. During the study, analyzes such as cluster and classification trees showed poor results, since they are not able to detect latent nonlinear relationships between the study indicators. Using the constructed discriminant model, we have carried out an analysis of the socioeconomic development of urban districts of the Krasnodar region for the period 20092018, which allows us to identify the leaders and the outsiders

EXISTENCE OF ASYMPTOTICALLY OPTIMAL PLANS IN DISCRETE PROBLEMS OF DYNAMIC PROGRAMMING
DescriptionDynamic programming is designed to solve discrete optimal control problems. According to this method, the optimal solution in a multidimensional problem is found by decomposing it into stages, each of which represents a subproblem with respect to one variable. In economic problems, the number of stages is the planning horizon. The choice of a planning horizon is necessary for a rigorous statement of the applied problem in the field of economics and management, but it is often difficult to justify. We see a way out in the use of asymptotically optimal plans for which the values of the optimization criterion differ little from its values for optimal plans for all sufficiently large planning horizons. The main result of the paper is the existence of an asymptotically optimal plan. The proof is carried out in several statements. If the sum of the maximums of the transition functions tends to 0, the existence of an asymptotically optimal plan is obtained in Theorem 1. A special case is models with a discount at a discount coefficient less than 1. The main part of the article is devoted to models with a discount coefficient equal to 1. Theorem 2 on the highway is proved for base set of a finite number of elements. In Theorem 3, a statement is obtained on the approximation of an arbitrary set by a finite one. In the final Theorem 4, the existence of an asymptotically optimal plan is proved in the general case. The term “magistral” is associated with a wellknown recommendation to drivers: in order to get from point A to point B, it is advisable to go to the highway (magistral) at the initial section of the road, and then exit the highway and get to point B. The recommendation for choosing the optimal one is similar trajectories using the Pontryagin maximum principle in the model of the optimal distribution of time between obtaining knowledge and developing skills. This fact underlines the methodological proximity of dynamic programming and the Pontryagin maximum principle

DescriptionWe use an adaptive management system for open systems to assess the impact of investments on the results of the Agroindustrial complex

DescriptionThe article provides a comparative analysis of assessments of the socioeconomic development of the Krasnodar region from such wellknown rating agencies as Standard & Poors, Moody’s, Fitch Ratings, which belong to the United States of America. The studied ratings are compared with the ratings of the national agency of the Russian Federation called “Expert RA”. The values of the established ratings are examined, as well as number of possible reasons why the ratings of the United States of America differ from the ratings of the Russian Federation, for example, economic and political reasons, and, subsequently, how these ratings affect the investment attractiveness of the Krasnodar region. The article explains positive and negative aspects of the integrated methodology used by international rating agencies, consisting of software and expert opinion, the level of access to it for study and analysis. We study another (local) source of information on the investment attractiveness of the Krasnodar region, which is a state institution, namely the Department of Investments and Development of Small and Medium Enterprises of the Krasnodar region. Options are proposed for improving the system of analysis of statistical data through methods that are based on a clear mathematical approach to provide an adequate assessment of the region and municipalities without the influence of subjective expert opinion

SYSTEM OF MODELS AND METHODS OF TESTING THE HOMOGENEITY OF TWO INDEPENDENT SAMPLES
DescriptionThe new paradigm of mathematical research methods allows us to give a systematic analysis of various statements of statistical analysis problems and methods for solving them, based on a probabilisticstatistical model of generating data accepted by the researcher. Methods for testing the homogeneity of two independent samples  a classic area of mathematical statistics. For more than 110 years since the publication of the fundamental Student’s article, various criteria have been developed for testing the statistical hypothesis of homogeneity in various statements, and their properties have been studied. However, the need for streamlining the totality of the scientific results found is urgent. It is necessary to analyze the whole variety of problem statements for testing the statistical hypotheses of the homogeneity of two independent samples, as well as the corresponding statistical criteria. This analysis is devoted to this article. It contains a summary of the main results concerning the methods for testing the homogeneity of two independent samples, and a comparative study of them, allowing the system to analyze the diversity of such methods in order to select the most appropriate for processing specific data. Based on the basic probabilisticstatistical model, the main statements of the problem of testing the homogeneity of two independent samples are formulated. A comparative analysis of the Student and Cramer  Welch criteria, designed to test the homogeneity of mathematical expectations, is given, a recommendation on the widespread use of the Cramer  Welch criterion is substantiated. From nonparametric methods for testing homogeneity, the criteria of Wilcoxon, Smirnov, Lehmann  Rosenblatt are considered. Dismantled two myths about the Wilcoxon criteria. Based on the analysis of the publications of the founders, the incorrectness of the term "Kolmogorov – Smirnov criterion" is shown. To verify absolute homogeneity, i.e. coincidence of the distribution functions of samples, it is recommended to use the Lehmann  Rosenblatt criterion. The current problems of the development and application of nonparametric criteria are discussed, including the difference between nominal and real significance levels, making it difficult to compare power of criteria, and the need to take into account coincidences of sample values (from the point of view of the classical theory of mathematical statistics, the probability of coincidences is 0)