Details

Title Analysis of User Reviews on Medical Institutions’ Services Aggregated on the Yandex-Maps Website with the Use of Machine Learning Algorithms: выпускная квалификационная работа магистра: направление 45.04.04 «Интеллектуальные системы в гуманитарной среде» ; образовательная программа 45.04.04_01 «Цифровая лингвистика (международная образовательная программа)/Digital Linguistics (International Educational Program)»
Creators Белкин Антон Дмитриевич
Scientific adviser Коган Марина Самуиловна
Organization Санкт-Петербургский политехнический университет Петра Великого. Гуманитарный институт
Imprint Санкт-Петербург, 2024
Collection Выпускные квалификационные работы; Общая коллекция
Subjects sentiment analysis; topic modelling; automatic text processing; computational linguistics; NLP; web scraping; polyclinic patients feedback
Document type Master graduation qualification work
File type PDF
Language Russian
Level of education Master
Speciality code (FGOS) 45.04.04
Speciality group (FGOS) 450000 - Языкознание и литературоведение
DOI 10.18720/SPBPU/3/2024/vr/vr24-5802
Rights Доступ по паролю из сети Интернет (чтение, печать, копирование)
Additionally New arrival
Record key ru\spstu\vkr\33250
Record create date 8/29/2024

Allowed Actions

Action 'Read' will be available if you login or access site from another network

Action 'Download' will be available if you login or access site from another network

Group Anonymous
Network Internet

The target of the research are user reviews on medical institutions’ services aggregated on the “Yandex-Maps” website. The scope of the research are methods for classifying natural language texts by sentiment and methods for modeling major topics of these texts. The objective of the study is to collect data to train a chosen sentiment analysis algorithm and model the main topics of user reviews. The tasks of the research: 1) To collect and analyze the scientific literature and technical documentation on the master thesis topic; 2) To develop and implement a program for extracting user reviews aggregated on the Yandex-Map website; 3) To build a corpus of user reviews on medical institutions’ services, to analyze it and preprocess it; 4) To train the ML models on the compiled corpus and identify the most effective one for the task of sentiment analysis; 5) To train the topic modelling algorithm on the compiled corpus; 6) To analyze the results of topic modelling and obtained key topics and the results of sentiment analysis. The research methodologies include descriptive method, comparative method, statistical method, experiment, and machine learning method. As a result of the study a program for automatic collection of user reviews from the “Yandex-Maps” website was created, and a corpus of reviews to medical facilities in Russian was collected. Machine learning algorithms for sentiment analysis and algorithms for topic modelling were trained and compared. Analysis of user reviews provides an opportunity to identify the main disadvantages and advantages concerning a product or service in any domain.

Network User group Action
ILC SPbPU Local Network All
Read Print Download
Internet Authorized users SPbPU
Read Print Download
Internet Anonymous

Access count: 0 
Last 30 days: 0

Detailed usage statistics