scholarly journals Detecting Lung Cancer Trends by Leveraging Real-World and Internet-Based Data: Infodemiology Study (Preprint)

2019 ◽  
Author(s):  
Chenjie Xu ◽  
Hongxi Yang ◽  
Li Sun ◽  
Xinxi Cao ◽  
Yabing Hou ◽  
...  

BACKGROUND Internet search data on health-related terms can reflect people’s concerns about their health status in near real time, and hence serve as a supplementary metric of disease characteristics. However, studies using internet search data to monitor and predict chronic diseases at a geographically finer state-level scale are sparse. OBJECTIVE The aim of this study was to explore the associations of internet search volumes for lung cancer with published cancer incidence and mortality data in the United States. METHODS We used Google relative search volumes, which represent the search frequency of specific search terms in Google. We performed cross-sectional analyses of the original and disease metrics at both national and state levels. A smoothed time series of relative search volumes was created to eliminate the effects of irregular changes on the search frequencies and obtain the long-term trends of search volumes for lung cancer at both the national and state levels. We also performed analyses of decomposed Google relative search volume data and disease metrics at the national and state levels. RESULTS The monthly trends of lung cancer-related internet hits were consistent with the trends of reported lung cancer rates at the national level. Ohio had the highest frequency for lung cancer-related search terms. At the state level, the relative search volume was significantly correlated with lung cancer incidence rates in 42 states, with correlation coefficients ranging from 0.58 in Virginia to 0.94 in Oregon. Relative search volume was also significantly correlated with mortality in 47 states, with correlation coefficients ranging from 0.58 in Oklahoma to 0.94 in North Carolina. Both the incidence and mortality rates of lung cancer were correlated with decomposed relative search volumes in all states excluding Vermont. CONCLUSIONS Internet search behaviors could reflect public awareness of lung cancer. Research on internet search behaviors could be a novel and timely approach to monitor and estimate the prevalence, incidence, and mortality rates of a broader range of cancers and even more health issues.

10.2196/16184 ◽  
2020 ◽  
Vol 22 (3) ◽  
pp. e16184 ◽  
Author(s):  
Chenjie Xu ◽  
Hongxi Yang ◽  
Li Sun ◽  
Xinxi Cao ◽  
Yabing Hou ◽  
...  

Background Internet search data on health-related terms can reflect people’s concerns about their health status in near real time, and hence serve as a supplementary metric of disease characteristics. However, studies using internet search data to monitor and predict chronic diseases at a geographically finer state-level scale are sparse. Objective The aim of this study was to explore the associations of internet search volumes for lung cancer with published cancer incidence and mortality data in the United States. Methods We used Google relative search volumes, which represent the search frequency of specific search terms in Google. We performed cross-sectional analyses of the original and disease metrics at both national and state levels. A smoothed time series of relative search volumes was created to eliminate the effects of irregular changes on the search frequencies and obtain the long-term trends of search volumes for lung cancer at both the national and state levels. We also performed analyses of decomposed Google relative search volume data and disease metrics at the national and state levels. Results The monthly trends of lung cancer-related internet hits were consistent with the trends of reported lung cancer rates at the national level. Ohio had the highest frequency for lung cancer-related search terms. At the state level, the relative search volume was significantly correlated with lung cancer incidence rates in 42 states, with correlation coefficients ranging from 0.58 in Virginia to 0.94 in Oregon. Relative search volume was also significantly correlated with mortality in 47 states, with correlation coefficients ranging from 0.58 in Oklahoma to 0.94 in North Carolina. Both the incidence and mortality rates of lung cancer were correlated with decomposed relative search volumes in all states excluding Vermont. Conclusions Internet search behaviors could reflect public awareness of lung cancer. Research on internet search behaviors could be a novel and timely approach to monitor and estimate the prevalence, incidence, and mortality rates of a broader range of cancers and even more health issues.


2021 ◽  
Author(s):  
Yingchao Yang ◽  
Xinyi Li ◽  
Qiang Ma ◽  
Zhihui Fu ◽  
Kaiming Su

Abstract Purpose : This study aimed to verify that adenoid hypertrophy (AH) and rhinosinusitis share similar epidemiologic patterns and that AH and allergic rhinitis (AR) are not related. Methods: Internet search engine query data from January 2011 to December 2019 were retrieved from the Baidu index. Monthly search volume was obtained in China for the following search terms in Chinese: “adenoid hypertrophy,” “rhinosinusitis,” and “allergic rhinitis”; the data obtained were then presented as percentages. Pearson’s and Spearman’ s correlation coefficients were used to detect the correlation among the search volumes of AH, rhinosinusitis, and AR. We also collected search data from the first 5 months of 2020, when segregation was implemented in China due to the coronavirus disease 2019 epidemic. Then, we compared the search data to those obtained during the same period in 2019 to detect the effects of segregation on AH and AR to varying degrees. Results: Statistically significant differences were found between the search variations of AH and rhinosinusitis during 2011–2019 (R=0.643, P<0.05). However, search variations of AH and AR were negatively related (R=-0.239, P<0.05). The average monthly search volume of AH and rhinosinusitis correlated well (R=0.836, P<0.01), but no correlation was found between AH and AR. The search volume of AH and rhinosinusitis during the first 5 months in 2020 decreased, whereas that of AR increased during January–February. Conclusions: AH and rhinosinusitis are epidemiologically related, whereas AH and AR are not correlated with each other.


Author(s):  
Lei Liu ◽  
Peng Wang ◽  
Su-Qin Jiang ◽  
Zi-Rong Zhong ◽  
Ting-Zheng Zhan ◽  
...  

Abstract Background This study aims to understand whether there is a seasonal change in the internet search interest for Toxoplasma by using the data derived from Google Trends (GT). Methods The present study searched for the relative search volume (RSV) for the search term ‘Toxoplasma’ in GT within six major English-speaking countries (Australia, New Zealand [Southern Hemisphere] and Canada, Ireland, the UK and the USA [Northern Hemisphere] from 1 January 2004 to 31 December 2019, utilizing the category of ‘health’. Data regarding the RSV of Toxoplasma was obtained and further statistical analysis was performed in R software using the ‘season’ package. Results There were significantly seasonal patterns for the RSV of the search term ‘Toxoplasma’ in five countries (all p&lt;0.05), except for the UK. A peak in December–March and a trough in July–September (Canada, Ireland, the UK and the USA) were observed, while a peak in June/August and a trough in December/February (Australia, New Zealand) were also found. Moreover, the presence of seasonal patterns regarding RSV for ‘Toxoplasma’ between the Southern and Northern Hemispheres was also found (both p&lt;0.05), with a reversed meteorological month. Conclusions Overall, our study revealed the seasonal variation for Toxoplasma in using internet search data from GT, providing additional evidence on seasonal patterns in Toxoplasma.


10.2196/18998 ◽  
2020 ◽  
Vol 22 (11) ◽  
pp. e18998
Author(s):  
Chenjie Xu ◽  
Zhi Cao ◽  
Hongxi Yang ◽  
Ying Gao ◽  
Li Sun ◽  
...  

Background As human society enters an era of vast and easily accessible social media, a growing number of people are exploiting the internet to search and exchange medical information. Because internet search data could reflect population interest in particular health topics, they provide a new way of understanding health concerns regarding noncommunicable diseases (NCDs) and the role they play in their prevention. Objective We aimed to explore the association of internet search data for NCDs with published disease incidence and mortality rates in the United States and to grasp the health concerns toward NCDs. Methods We tracked NCDs by examining the correlations among the incidence rates, mortality rates, and internet searches in the United States from 2004 to 2017, and we established forecast models based on the relationship between the disease rates and internet searches. Results Incidence and mortality rates of 29 diseases in the United States were statistically significantly correlated with the relative search volumes (RSVs) of their search terms (P<.05). From the perspective of the goodness of fit of the multiple regression prediction models, the results were closest to 1 for diabetes mellitus, stroke, atrial fibrillation and flutter, Hodgkin lymphoma, and testicular cancer; the coefficients of determination of their linear regression models for predicting incidence were 80%, 88%, 96%, 80%, and 78%, respectively. Meanwhile, the coefficient of determination of their linear regression models for predicting mortality was 82%, 62%, 94%, 78%, and 62%, respectively. Conclusions An advanced understanding of search behaviors could augment traditional epidemiologic surveillance and could be used as a reference to aid in disease prediction and prevention.


2020 ◽  
Author(s):  
Chenjie Xu ◽  
Zhi Cao ◽  
Hongxi Yang ◽  
Ying Gao ◽  
Li Sun ◽  
...  

BACKGROUND As human society enters an era of vast and easily accessible social media, a growing number of people are exploiting the internet to search and exchange medical information. Because internet search data could reflect population interest in particular health topics, they provide a new way of understanding health concerns regarding noncommunicable diseases (NCDs) and the role they play in their prevention. OBJECTIVE We aimed to explore the association of internet search data for NCDs with published disease incidence and mortality rates in the United States and to grasp the health concerns toward NCDs. METHODS We tracked NCDs by examining the correlations among the incidence rates, mortality rates, and internet searches in the United States from 2004 to 2017, and we established forecast models based on the relationship between the disease rates and internet searches. RESULTS Incidence and mortality rates of 29 diseases in the United States were statistically significantly correlated with the relative search volumes (RSVs) of their search terms (<i>P</i>&lt;.05). From the perspective of the goodness of fit of the multiple regression prediction models, the results were closest to 1 for diabetes mellitus, stroke, atrial fibrillation and flutter, Hodgkin lymphoma, and testicular cancer; the coefficients of determination of their linear regression models for predicting incidence were 80%, 88%, 96%, 80%, and 78%, respectively. Meanwhile, the coefficient of determination of their linear regression models for predicting mortality was 82%, 62%, 94%, 78%, and 62%, respectively. CONCLUSIONS An advanced understanding of search behaviors could augment traditional epidemiologic surveillance and could be used as a reference to aid in disease prediction and prevention.


2008 ◽  
Vol 61 (1-2) ◽  
pp. 16-21 ◽  
Author(s):  
Natasa Maksimovic ◽  
Kyriakos Spanopoulos

Introduction. Lung cancer represents the most common malignant tumour among men, and appears more and more frequently among women in many countries worldwide. The aims of this descriptive epidemiological study were to evaluate the mortality trends of all malignant tumours and lung cancer in Central Serbia from 1990 to 1999, and to estimate the incidence, mortality and the basic demographic characteristics of lung cancer in Central Serbia in 1999. Material and methods. The source of data concerning cancer cases in 1999 was the Cancer Registry of Central Serbia, while data of the Republic Statistics Institute were used for the analysis of mortality trends for the period 1990-1999. All rates were standardized by the direct method, to the world standard population. Confidence intervals for mortality rates were assessed with 95% level of probability. Linear regression coefficient was determined by Fisher's test. Results. The mortality rates showed rising tendencies for both lung cancer (y=-1876.26+0.96x, p=0.028 for men; y=654.78U).33x, p-0.001 for women) and all malignant tumours (y=-4139.88+2.15x, p=0.163 for men; y=3649.68 + 1.88x, p=0.016 for women), with statistically significant increase being observed for all trends, except all malignant tumours among men. In the year 1999, lung cancer ranked first among men and third among women, with 29.2% and 10.3% of cancer mortality respectively. The age-specific mortality rates were much higher in men in all age groups. Mortality increased with age and the highest rates were found in the age group 70-74 for both sexes. The highest incidence and mortality rates were reported in Belgrade, Moravicki and Sumadijski district. .


2020 ◽  
pp. postgradmedj-2019-137407
Author(s):  
Yong-Jun Mei ◽  
Yan-Mei Mao ◽  
Fan Cao ◽  
Tao Wang ◽  
Zhi-Jun Li

ObjectiveThis study explored the changes of global public interest in internet search of ankylosing spondylitis (AS) based on Google Trends (GT) data, in order to reflect the characteristics of AS itself.MethodsGT was used to obtain the search popularity scores of the term ’AS’ on a global scale, between January 2004 and December 2018, under the ’health’ classification. Based on the global search data of AS provided by GT, the cosinor analysis was used to test whether there was seasonality in AS.ResultsIn general, AS related search volume demonstrated a decreasing trend from January 2004 to December 2014 and then remain stable from January 2015 to December 2018. No obvious seasonal variations were detected in AS related search volume (amplitude=1.54; phase: month=3.9; low point: month=9.9; p>0.025), which peaked in April and bottomed out in October. The top 17 rising topics were adalimumab, spondylolisthesis, Morbus, Vladimir Mikhailovich Bekhterev, autoimmune disease, rheumatoid arthritis, ankylosis, HLA- B27 positive, Crohn’s disease, rheumatology, spondylosis, arthritis, uveitis, rheumatism, sacroiliac, psoriatic arthritis and spondylitis.ConclusionsGlobally, there is no significant seasonal variation in GT for AS. The top fast-growing topics related to AS may be beneficial for doctors to provide targeted health education of the disease to patients and their families.


Author(s):  
Ourania S. Kotsiou ◽  
Vaios S. Kotsios ◽  
Konstantinos I. Gourgoulianis

Background: The Greek National Health System (NHS) has been profoundly affected by the synergy of the economic and refugee crises. We aimed at evaluating the public interest regarding refugee and healthcare issues in Greece. Methods: Google Trends was employed to normalize traffic data on a scale from 0 to 100, presented as monthly relative search volume (RSV) for the search term queries: “refugees”, “health”, “diseases”, “hospital”, and “economic crisis” in Greece, from the period 2008 to 2020. Cross-country comparisons in selected European countries were made. Results: The analysis of RSV data showed an upward trend for the keyword “refugee”, in Greece, in the last five years, with two remarkable peaks from 2015 to 2016 and from 2019 to the present. Interest regarding refugees was more prevalent in the Aegean islands compared to the mainland. The mass influx of refugees has been linked to disease-related concerns. The search terms “hospital” and “health” have been the most popular and constantly quested topics since the beginning of the economic crisis in Greece, in 2009. Similar trends existed across Europe. Conclusion: There is an urgent need for effective public awareness of current politico-ethical and social-economic conditions. The patterns of public interest can formulate public policy.


Sign in / Sign up

Export Citation Format

Share Document