Development of novel deep multimodal representation learning‐based model for the differentiation of liver tumors on B‐mode ultrasound images

Masaya Sato; Tamaki Kobayashi; Yoko Soroida; Takashi Tanaka; Takuma Nakatsuka; Hayato Nakagawa; Ayaka Nakamura; Makiko Kurihara; Momoe Endo; Hiromi Hikita; Mamiko Sato; Hiroaki Gotoh; Tomomi Iwai; Ryosuke Tateishi; Kazuhiko Koike; Yutaka Yatomi

doi:10.1111/jgh.15763

Original Article ‐ Gastroenterology (Clinical)
Development of novel deep multimodal representation learning‐based model for the differentiation of liver tumors on B‐mode ultrasound images

Masaya Sato, Tamaki Kobayashi, Yoko Soroida, Takashi Tanaka, Takuma Nakatsuka, Hayato Nakagawa, Ayaka Nakamura, Makiko Kurihara, Momoe Endo, Hiromi Hikita, Mamiko Sato, Hiroaki Gotoh, Tomomi Iwai, Ryosuke Tateishi, Kazuhiko Koike, Yutaka Yatomi

Source

Journal of Gastroenterology and Hepatology > 37 > 4 > 678 - 684

Abstract

Background and Aim

Recently, multimodal representation learning for images and other information such as numbers or language has gained much attention. The aim of the current study was to analyze the diagnostic performance of deep multimodal representation model‐based integration of tumor image, patient background, and blood biomarkers for the differentiation of liver tumors observed using B‐mode ultrasonography (US).

Method

First, we applied supervised learning with a convolutional neural network (CNN) to 972 liver nodules in the training and development sets to develop a predictive model using segmented B‐mode tumor images. Additionally, we also applied a deep multimodal representation model to integrate information about patient background or blood biomarkers to B‐mode images. We then investigated the performance of the models in an independent test set of 108 liver nodules.

Results

Using only the segmented B‐mode images, the diagnostic accuracy and area under the curve (AUC) values were 68.52% and 0.721, respectively. As the information about patient background and blood biomarkers was integrated, the diagnostic performance increased in a stepwise manner. The diagnostic accuracy and AUC value of the multimodal DL model (which integrated B‐mode tumor image, patient age, sex, aspartate aminotransferase, alanine aminotransferase, platelet count, and albumin data) reached 96.30% and 0.994, respectively.

Conclusion

Integration of patient background and blood biomarkers in addition to US image using multimodal representation learning outperformed the CNN model using US images. We expect that the deep multimodal representation model could be a feasible and acceptable tool for the definitive diagnosis of liver tumors using B‐mode US.

Identifiers

journal ISSN :	0815-9319
journal e-ISSN :	1440-1746
DOI	10.1111/jgh.15763

Authors

Masaya Sato

Department of Clinical Laboratory Medicine, Graduate School of Medicine, The University of Tokyo
Department of Gastroenterology, Graduate School of Medicine, The University of Tokyo

Tamaki Kobayashi

Department of Clinical Laboratory Medicine, Graduate School of Medicine, The University of Tokyo

Yoko Soroida

Department of Clinical Laboratory Medicine, Graduate School of Medicine, The University of Tokyo

Takashi Tanaka

Groovenauts, Inc

see all

Keywords

B‐mode convolutional neural network deep multimodal representation learning liver tumor machine learning

Additional information

Data set: Wiley

Fields of science

No field of science has been suggested yet.

article

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Original Article ‐ Gastroenterology (Clinical) Development of novel deep multimodal representation learning‐based model for the differentiation of liver tumors on B‐mode ultrasound images $("#expandableTitles").expandable();

Source

Abstract

Background and Aim

Method

Results

Conclusion

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Masaya Sato

Tamaki Kobayashi

Yoko Soroida

Takashi Tanaka

Keywords

Additional information

Fields of science

Fields of science

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Original Article ‐ Gastroenterology (Clinical)
Development of novel deep multimodal representation learning‐based model for the differentiation of liver tumors on B‐mode ultrasound images