测绘学报(英文版) ›› 2022, Vol. 5 ›› Issue (2): 38-48.doi: 10.11947/j.JGGS.2022.0205

• • 上一篇    下一篇

  

  • 接受日期:2022-04-27 出版日期:2022-06-20 发布日期:2022-07-22

Data Mining and Spatial Analysis of Social Media Text Based on the BERT-CNN Model to Achieve Situational Awareness: a Case Study of COVID-19

Jiawei ZHANG1,2(),Hua QI1()   

  1. 1. Faculty of Geosciences and Environmental Engineering, Southwest Jiaotong University, Chengdu 611756, China
    2. China Railway First Survey and Design Institute Group Co., Ltd., Xi’an 710043, China
  • Accepted:2022-04-27 Online:2022-06-20 Published:2022-07-22
  • Contact: Hua QI E-mail:zjw_giser@126.com;qi-3dgis@126.com
  • About author:Jiawei ZHANG (1996—), male, master, majors in GIS. E-mail: zjw_giser@126.com
  • Supported by:
    Science & Technology Department of Sichuan Province(21ZDYF2090)

Abstract:

In response to the COVID-19, social media big data has played an important role in epidemic warning, tracking the source of infection, and public opinion monitoring, providing strong technical support for China’s epidemic prevention and control work. The paper used Sina Weibo posts related to COVID-19 hashtags as the data source, and built a BERT-CNN deep learning model to perform fine-grained and high-precision topic classificationon massive social media posts. Taking Shenzhen as a region of interest, we mined the “epidemic data bulletin” and “daily life impact” posts during the epidemic for spatial analysis. The results show that the confirmed communities and designated hospitals in Shenzhen as a whole present the characteristics of “sparse east and dense west”, and there is a strong positive spatial correlation between the number of confirmed cases and social media response. Specifically, Nanshan District, Futian District and Luohu District have more confirmed cases due to large population movements and dense transportation networks, and social media has responded more violently, and people’s lives have been greatly affected. However, Yantian District, Pingshan District and Dapeng New District showed opposite characteristics. The case study results further show that using deep learning methods to mine text information in social media is scientifically feasible for improving situational awareness and decision support during the COVID-19.

Key words: COVID-19, Sina Weibo, BERT-CNN, topic classification, situational awareness