Algorithm about selection of the characteristic element in a clustering process's set

Authors

  • Trofimov B.I. Kuban State University, Krasnodar, Российская Федерация
  • Koltsov Yu.V. Kuban State University, Krasnodar, Российская Федерация
  • Garnaga V.V. Kuban State University, Krasnodar, Российская Федерация

UDC

004.424.4, 004.93.14, 004.021, 0

Abstract

People use classification for objects organization into groups since ancient times. In one of his articles Robert Sokal notes that classification is high level of intellectual activity and it helps to understand the nature. Clustering is result of software algorithms applying to classification. This approach allows deploying data mining to classified information. The article describes an algorithm for a cluster characteristic element selection and its formal requirements definition. One of areas for the algorithm’s applying is intellectual text search systems. A main purpose of the article is description of an algorithm for characteristic element selection. The algorithm should have less asymptotic estimate operating time than enumeration of all elements. A main idea based on the classical method of branches and borders. An original part of the algorithm is errors estimates comparison for selected characteristic element. Also, the article describes two algorithms for random test data generation. Showed results of these tests illustrate and explain advantages of the main algorithm in comparison with the enumeration algorithm. An empirical assessment of the proposed algorithm convergence demonstrates its better efficiency. We plan to use the article results in intellectual text search area. Clustering and neural networks are main approaches used in this area.

Keywords:

branch and bound method, text search, graph models, Damerau-Lowenstein metric

Acknowledgement

Работа выполнена при поддержке РФФИ (13-01-00807).

Author Infos

Bogdan I. Trofimov

аспирант кафедры информационных технологий Кубанского государственного университета

e-mail: bogdan.i.trofimov@mail.ru

Yuriy V. Koltsov

канд. физ.-мат. наук, заведующий кафедрой информационных технологий Кубанского государственного университета

e-mail: dean@fpm.kubsu.ru

Valeriy V. Garnaga

канд. физ.-мат. наук, доцент кафедры информационных технологий Кубанского государственного университета

e-mail: Garnaga.Valeriy@fpm.kubsu.ru

References

  1. Sokel R.R. Klaster-analiz i klassifikatsiya: predposylki i osnovnye napravleniya [Cluster analysis and classification: the background and the main directions]. In Dzh. Ven Rayzina (Ed.). Klassifikatsiya i klaster [Classification and cluster]. Moscow, Mir Publ., 1980, pp. 7-19. (In Russian)
  2. Levenshteyn V. I. Dvoichnye kody s ispravleniem vypadeniy, vstavok i zameshcheniy simvolov [Binary codes with correction for deletions, insertions and substitutions of characters]. Doklady Akademii nauk SSSR [Rep. of the Academy of Sciences of the USSR], 1965, vol. 163, no. 4, pp. 845-848. (In Russian)
  3. Land A.H., Doig A.G. An automatic method of solving discrete programming problems. Econometrica, 1960, pp. 497-520. doi: 10.1.1.308.7332.
  4. Bard G.V. Spelling-error tolerant, order-independent pass-phrases via the Damerau-Levenshtein string-edit distance metric. Proc. of ACSW '07 Proceedings of the Fifth Australasian symposium on ACSW frontiers, 2007, vol. 68, pp. 117-124.
  5. Garnaga V.V., Kol'tsov Yu.V., Trofimov B.I. Postroenie mekhanizma neyrosetevogo poiska na osnove algoritma rasshiryayushchegosya neyronnogo gaza [The construction of the mechanism of neural network based search algorithm expanding neural gas]. Izvestiya vuzov. Severo-Kavkazskiy region. Tekhnicheskie nauki [Proc. of the universities. North Caucasus region. Technical science], 2014, iss. 6, pp. 12-17. doi: 10.17213/0321-2653-2014-6-12-17

Issue

Pages

69-74

Submitted

2015-10-19

Published

2015-12-28

How to Cite

Trofimov B.I., Koltsov Yu.V., Garnaga V.V. Algorithm about selection of the characteristic element in a clustering process's set. Ecological Bulletin of Research Centers of the Black Sea Economic Cooperation, 2015, no. 4, pp. 69-74. (In Russian)