Details
Title | Analysis of User Reviews on Medical Institutions’ Services Aggregated on the Yandex-Maps Website with the Use of Machine Learning Algorithms: выпускная квалификационная работа магистра: направление 45.04.04 «Интеллектуальные системы в гуманитарной среде» ; образовательная программа 45.04.04_01 «Цифровая лингвистика (международная образовательная программа)/Digital Linguistics (International Educational Program)» |
---|---|
Creators | Белкин Антон Дмитриевич |
Scientific adviser | Коган Марина Самуиловна |
Organization | Санкт-Петербургский политехнический университет Петра Великого. Гуманитарный институт |
Imprint | Санкт-Петербург, 2024 |
Collection | Выпускные квалификационные работы; Общая коллекция |
Subjects | sentiment analysis; topic modelling; automatic text processing; computational linguistics; NLP; web scraping; polyclinic patients feedback |
Document type | Master graduation qualification work |
File type | |
Language | Russian |
Level of education | Master |
Speciality code (FGOS) | 45.04.04 |
Speciality group (FGOS) | 450000 - Языкознание и литературоведение |
DOI | 10.18720/SPBPU/3/2024/vr/vr24-5802 |
Rights | Доступ по паролю из сети Интернет (чтение, печать, копирование) |
Additionally | New arrival |
Record key | ru\spstu\vkr\33250 |
Record create date | 8/29/2024 |
Allowed Actions
–
Action 'Read' will be available if you login or access site from another network
Action 'Download' will be available if you login or access site from another network
Group | Anonymous |
---|---|
Network | Internet |
The target of the research are user reviews on medical institutions’ services aggregated on the “Yandex-Maps” website. The scope of the research are methods for classifying natural language texts by sentiment and methods for modeling major topics of these texts. The objective of the study is to collect data to train a chosen sentiment analysis algorithm and model the main topics of user reviews. The tasks of the research: 1) To collect and analyze the scientific literature and technical documentation on the master thesis topic; 2) To develop and implement a program for extracting user reviews aggregated on the Yandex-Map website; 3) To build a corpus of user reviews on medical institutions’ services, to analyze it and preprocess it; 4) To train the ML models on the compiled corpus and identify the most effective one for the task of sentiment analysis; 5) To train the topic modelling algorithm on the compiled corpus; 6) To analyze the results of topic modelling and obtained key topics and the results of sentiment analysis. The research methodologies include descriptive method, comparative method, statistical method, experiment, and machine learning method. As a result of the study a program for automatic collection of user reviews from the “Yandex-Maps” website was created, and a corpus of reviews to medical facilities in Russian was collected. Machine learning algorithms for sentiment analysis and algorithms for topic modelling were trained and compared. Analysis of user reviews provides an opportunity to identify the main disadvantages and advantages concerning a product or service in any domain.
Network | User group | Action |
---|---|---|
ILC SPbPU Local Network | All |
|
Internet | Authorized users SPbPU |
|
Internet | Anonymous |
|
Access count: 0
Last 30 days: 0