We represent theoretical foundations for similarity-based machine learning system construction. The key technique is Formal Concept Analysis, a modern branch of Lattice Theory. We introduce bitset encoding algorithms for objects described by both discrete and continuous attributes. Then we discuss Markov chain Monte Carlo method. After presentations of main steps of machine learning we provide a result on sufficient number of hypotheses to generate. We conclude with discussion of results of experimental approbation of our approach with respect to several datasets from UCI Machine Learning repository.
lattice, FCA, JSM-method, bitset, machine learning.
1. Finn, V.K.: J.S. Mill’s inductive methods in artificial intelligence systems. Part I. Sci. Tech.Inf. Proc. 38, 385–402 (2011).
2. Finn, V.K.: J.S. Mill’s inductive methods in artificial intelligence systems. Part II. Sci. Tech.Inf. Proc. 39, 241–260 (2012).
3. Kuznetsov S.O.: On Computing the Size of a Lattice and Related Decision Problems. Order. 18(4), 313-321 (2001).
4. Vinogradov, D.V.: Machine Learning Based on Similarity Operation. In: Kuznetsov S., Osipov G., Stefanuk V. (eds) Artificial Intelligence. RCAI 2018. Communications in Computer and Information Science. 934, 46-59 (2018).
5. Ganter, B., Wille, R.: Formal Concept Analysis. Springer, Berlin (1999).
6. Vinogradov, D.V.: Accidental formal concepts in the presence of counterexamples. In: Proceedings of International Work-shop on Formal Concept Analysis for Knowledge Discovery (FCA4KD 2017): CEUR Workshop Proceedings. 1921, 104-112 (2017).
7. UCI Machine Learning Repository: Mushroom Data Set, https://archive.ics.uci.edu/ml/datasets/Mushroom. Last accessed 20 July 2020.
8. UCI Machine Learning Repository: Wine Quality Data Set, https://archive.ics.uci.edu/ml/datasets/Wine+Quality. Last accessed 20 July 2020.
9. Vinogradov, D.V. On Object Representation by Bit Strings for the VKF-Method // Automatic Documentation and Mathematical Linguistics, Vol. 52, Issue 3, pp. 113-116 (2018).
10. Quinlan, J.R.: C4.5 Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993).