Skip navigation

An improved double channel long short-term memory model for medical text classification

An improved double channel long short-term memory model for medical text classification

Liang, Shengbin, Chen, Xinan ORCID logoORCID: https://orcid.org/0000-0001-7641-3897, Ma, Jixin ORCID logoORCID: https://orcid.org/0000-0001-7458-7412, Du, Wencai ORCID logoORCID: https://orcid.org/0000-0003-0428-0057, Ma, Huawei ORCID logoORCID: https://orcid.org/0000-0002-7432-5583 and Li, Xingwang (2021) An improved double channel long short-term memory model for medical text classification. Journal of Healthcare Engineering, 2021:6664893. pp. 1-8. ISSN 2040-2295 (Print), 2040-2309 (Online) (doi:10.1155/2021/6664893)

[thumbnail of Publisher VoR]
Preview
PDF (Publisher VoR)
36270_MA_An_improved_double_channel.pdf - Published Version
Available under License Creative Commons Attribution.

Download (1MB) | Preview

Abstract

There are a large number of symptom consultation texts in medical and healthcare Internet communities, and Chinese health segmentation is more complex, which leads to the low accuracy of the existing algorithms for medical text classification. The deep learning model has advantages in extracting abstract features of text effectively. However, for a large number of samples of complex text data, especially for words with ambiguous meanings in the field of Chinese medical diagnosis, the word-level neural network model is insufficient. Therefore, in order to solve the triage and precise treatment of patients, we present an improved Double Channel (DC) mechanism as a significant enhancement to Long Short-Term Memory (LSTM). In this DC mechanism, two channels are used to receive word-level and char-level embedding, respectively, at the same time. Hybrid attention is proposed to combine the current time output with the current time unit state and then using attention to calculate the weight. By calculating the probability distribution of each timestep input data weight, the weight score is obtained, and then weighted summation is performed. At last, the data input by each timestep is subjected to trade-off learning to improve the generalization ability of the model learning. Moreover, we conduct an extensive performance evaluation on two different datasets: cMedQA and Sentiment140. The experimental results show that the DC-LSTM model proposed in this paper has significantly superior accuracy and ROC compared with the basic CNN-LSTM model.

Item Type: Article
Additional Information: Special issue on "Artificial Intelligence in E-Healthcare and M-Healthcare."
Uncontrolled Keywords: medical text classification
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
R Medicine > RC Internal medicine
Faculty / School / Research Centre / Research Group: Faculty of Engineering & Science
Faculty of Engineering & Science > School of Computing & Mathematical Sciences (CMS)
Related URLs:
Last Modified: 17 May 2022 08:34
URI: http://gala.gre.ac.uk/id/eprint/36270

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics