Development of the algorithm of keyword search in the Kazakh language text corpus

doi:10.15587/1729-4061.2019.179036

Open AccessJournal Article10.15587/1729-4061.2019.179036

Development of the algorithm of keyword search in the Kazakh language text corpus

Akerke Akanova, +3 more

- 24 Sep 2019

- Eastern-European Journal of Enterprise T...

- Vol. 5, pp 26-32

3

TL;DR: The developed algorithm involves solving one of the problems of effective semantic analysis of the text in the Kazakh language and is used to develop a neurocomputer system that will automatically check the text works of online learners.

Abstract: The issue of semantic text analysis occupies a special place in computational linguistics. Researchers in this field have an increased interest in developing an algorithm that will improve the quality of text corpus processing and probabilistic determination of text content. The results of the study on the application of methods, approaches, algorithms for semantic text analysis in computational linguistics in international and Kazakhstan science led to the development of an algorithm of keyword search in a Kazakh text. The first step of the algorithm was to compile a reference dictionary of keywords for the Kazakh language text corpus. The solution to this problem was to apply the Porter (stemmer) algorithm for the Kazakh language text corpus. The implementation of the stemmer allowed highlighting unique word stems and getting a reference dictionary, which was subsequently indexed. The next step is to collect learning data from the text corpus. To calculate the degree of semantic proximity between words, each word is assigned a vector of the corresponding word forms of the reference dictionary, which results in a pair of a keyword and a vector. And the last step of the algorithm is neural network learning. During learning, the error backpropagation method is used, which allows a semantic analysis of the text corpus and obtaining a probabilistic number of words close to the expected number of keywords. This process automates the processing of text material by creating digital learning models of keywords. The algorithm is used to develop a neurocomputer system that will automatically check the text works of online learners. The uniqueness of the keyword search algorithm is the use of neural network learning for texts in the Kazakh language. In Kazakhstan, scientists in the field of computational linguistics conducted a number of studies based on morphological analysis, lemmatization and other approaches and implemented linguistic tools (mainly translation dictionaries). The scope of neural network learning for parsing of the Kazakh language remains an open issue in the Kazakhstan science. The developed algorithm involves solving one of the problems of effective semantic analysis of the text in the Kazakh language

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.15587/1729-4061.2022.263421

Development of a thematic and neural network model for data learning

Akerke Аkanova, +4 more

- 31 Aug 2022

- Eastern-European Journal of Enterprise T...

TL;DR: The experimental results showed that the use of deep neural networks gives the expected results of the quality of the LDA model in the processing of the Kazakh language.

...read moreread less

5

•Journal Article•10.21533/PEN.V9I3.2109.G866

The role of artificial intelligence technologies in long-term socio-economic development and integrated security

Sergey Grinyaev, +4 more

- 03 Jul 2021

- Periodicals of Engineering and Natural S...

TL;DR: The object of the paper is to study strategic documents that determine the prospects for the development of artificial intelligence technologies, primarily in the largest economies of the world to determine the contours of global socio-economic and technological development.

...read moreread less

4

•Journal Article•10.21533/PEN.V9I3.2113

Ancient theater architecture as an element of the world cultural landscape

Tatiana V. Portnova

- 03 Jul 2021

- Periodicals of Engineering and Natural S...

TL;DR: In this article, the history of the development of ancient theatre architecture in the context of the environment, which forms a territory that acquires the status of a cultural landscape, was examined.

...read moreread less

1

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

Journal Article•10.1162/NECO.2006.18.7.1527

A fast learning algorithm for deep belief nets

Geoffrey E. Hinton, +2 more

- 01 Jul 2006

- Neural Computation

TL;DR: A fast, greedy algorithm is derived that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory.

...read moreread less

18.3K

Proceedings Article•10.1145/2647868.2654889

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

- 03 Nov 2014

TL;DR: Caffe provides multimedia scientists and practitioners with a clean and modifiable framework for state-of-the-art deep learning algorithms and a collection of reference models for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

14.9K

•Posted Content

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

- 20 Jun 2014

- arXiv: Computer Vision and Pattern Recog...

TL;DR: Caffe as discussed by the authors is a BSD-licensed C++ library with Python and MATLAB bindings for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

13.1K

•Journal Article•10.1023/A:1009976227802

Learning Algorithms for Keyphrase Extraction

Peter D. Turney

- 21 May 2000

- Information Retrieval

TL;DR: In this paper, the problem of automatically extracting keyphrases from text is treated as a supervised learning task, where the learning algorithm must learn to classify as positive or negative examples of key phrases.

...read moreread less

946