bims-librar Biomed News
on Biomedical librarianship
Issue of 2020‒06‒28
twenty-two papers selected by
Thomas Krichel
Open Library Society


  1. J Med Internet Res. 2020 Jun 26.
    Eysenbach G.
      UNSTRUCTURED: In this issue of the Journal of Medical Internet Research, the World Health Organization (WHO) is presenting a framework for managing the coronavirus disease (COVID-19) infodemic. Infodemiology is now acknowledged by public health organizations and the WHO as an important emerging scientific field and critical area of practice during a pandemic. From the perspective of being the first "infodemiolgist" who originally coined the term almost two decades ago, I am positing four pillars of infodemic management: (1) information monitoring (infoveillance); (2) building eHealth Literacy and science literacy capacity; (3) encouraging knowledge refinement and quality improvement processes such as fact checking and peer-review; and (4) accurate and timely knowledge translation, minimizing distorting factors such as political or commercial influences. In the current COVID-19 pandemic, the United Nations has advocated that facts and science should be promoted and that these constitute the antidote to the current infodemic. This is in stark contrast to the realities of infodemic mismanagement and misguided upstream filtering, where social media platforms such as Twitter have advertising policies that sideline science organizations and science publishers, treating peer-reviewed science as "inappropriate content."
    DOI:  https://doi.org/10.2196/21820
  2. . 2020 Sep;46(5): 102187
    Teixeira da Silva JA.
      The COVID-19 pandemic, which has led to a flood of papers and preprints, has placed multiple challenges on academic publishing, the most obvious one being sustained integrity under the pressure to publish quickly. There are risks of this high volume-to-speed ratio. Many letters, editorials, and supposedly "peer reviewed" papers in ranked and indexed journals were published in a matter of days, suggesting that peer review was either fleeting or non-existential, or that papers were rapidly approved by editors based on their perceived interest and topicality, rather than on their intrinsic academic value. In academic publishing circles, the claim of "peer review", when in fact it has not been conducted, is a core characteristic of "predatory publishing", and is also a "fake" element that may undermine efforts in recent years to build trust in science's budding serials crisis. While the world is still centrally focused on COVID-19, the issue of "predatory publishing" is being ignored, or not being given sufficient attention. The risks to the scholarly community, academic publishing and ultimately public health are at stake when exploitative and predatory publishing are left unchallenged.
    Keywords:  Editorial oversight; Fake; Integrity; Misinformation; Peer review; Post-publication peer review
    DOI:  https://doi.org/10.1016/j.acalib.2020.102187
  3. Stud Health Technol Inform. 2020 Jun 16. 270 448-452
    Ammar N, Bailey JE, Davis RL, Shaban-Nejad A.
      Traditionally, health data management has been EMR-based and mostly handled by health care providers. Mechanisms are needed to give patients more control over their health conditions. Personal Health Libraries (PHLs) provide a single point of secure access to patients' digital health information that can help empower patients to make better-informed decisions about their health care. This paper reports a work-in-progress on leveraging tools and methods from artificial intelligence and knowledge representation to build a private, decentralized PHL that supports interoperability and, ultimately, true care integration. We demonstrate how a social application querying such a decentralized PHL can deliver a tailored push notification intervention focused on improving self-care behaviors in diabetic adults from medically underserved communities.
    Keywords:  Personal health library; digital health; interoperability; privacy
    DOI:  https://doi.org/10.3233/SHTI200200
  4. Stud Health Technol Inform. 2020 Jun 16. 270 813-817
    Hollis KF, Roberts K, Bedrick S, Hersh WR.
      The Text REtrieval Conference (TREC), co-sponsored by the National Institute of Standards and Technology (NIST) in the US and US Department of Defense, was started in 1992. TREC's purpose is to support research within the information retrieval community by providing the infrastructure necessary for large-scale evaluation of text retrieval methodologies. In 2017, the TREC Precision Medicine (Roberts et al., 2017) track grew from the Clinical Decision Support track and focused on a narrower problem domain of precision oncology. After three years of computer runs being evaluated for relevance by physician readers, we provide a unique perspective of how to evaluate computer-generated articles and clinical trials pulled from PubMed and Clinicaltrials.gov to find relevant information on medical cases.
    Keywords:  Precision medicine; information retrieval
    DOI:  https://doi.org/10.3233/SHTI200274
  5. Health Inf Manag. 2020 Jun 26. 1833358320926422
    McGrath N, Foley B, Hurley C, Ryan M, Flynn R.
      Safe and reliable healthcare depends on access to health information that is accurate, valid, reliable, timely, relevant, legible and complete. National data collections are repositories of health and social care data and play a crucial role in healthcare planning and clinical decision-making. We describe the development of an evidence-informed multi-method quality improvement program aimed to improve the quality of health and social care data in Ireland. Specific components involved: development of guidance to support implementation of health information standards; review program to assess compliance with standards; and educating health information stakeholders about health data and information quality. Observations from implementation of the program indicate enhanced health information stakeholder awareness of, and increased adoption of information management standards. The methodology used in the review program has proved to be a robust approach to identify areas of good practice and opportunities for improvement in information management practices. There has been positive adoption of the program among organisations reviewed and acceptance of the proposed recommendations. Early indications are that this multi-method approach will drive improvements in information management practices, leading to an improvement in health and social care data quality in Ireland. Aspects of this approach may be adapted to meet the needs of other countries.
    Keywords:  data management; health information management; health information systems; health policy; information management; information management standards; quality improvement
    DOI:  https://doi.org/10.1177/1833358320926422
  6. Spine Deform. 2020 Jun 23.
    Truumees D, Duncan A, Mayer EK, Geck M, Singh D, Truumees E.
      BACKGROUND CONTEXT: Patients and their families are increasingly turning to the internet for medical information. Most of these patients believe the information to be accurate and reliable. However, the quality and accuracy of that information on the internet is variable and unregulated. Accurate and applicable information may align patients' expectations and improve satisfaction and overall outcomes.PURPOSE: This study aimed to evaluate the quality and accuracy of scoliosis-related information available on the internet.
    STUDY DESIGN: Cross-sectional analysis.
    METHODS: Independent searches were conducted on the three most commonly accessed search engines (Google, Yahoo and Bing) using the keyword "scoliosis". The top 30 sites from each search engine were reviewed. Each website was categorized as per its authorship and sourcing. Each site was then evaluated for accuracy, readability and with quality-assurance markers such as Health on the Net code (HONcode), DISCERN, Journal of American Medical Association (JAMA) systems. The accuracy of the website was assessed by two fellowship-trained spine specialists. Website accuracy was rated on a scale of 1-4 based on the level of agreement with the information presented. Websites with greater than 75% agreement were rated 4. Finally, the sites were evaluated with a validated website distraction index and assessed for mobile friendliness.
    RESULTS: 42 unique websites were analyzed. 31% of the sites were categorized as academic (13 academic, 9 healthcare system, 12 health news outlets and 8 unspecified) and had the highest rate of physician authorship (54%). Accuracy ranged from less than 25% to more than 75% were recorded with a mean accuracy of 3 signifying 50-75% agreement. Academic sites had the highest scoliosis specific accuracy score (P < 0.05). Overall, average readability was at a 12th grade reading level. More than 90% of the sites were mobile friendly. Approximately 71% of the websites did not have HONcode certification, although health news outlets had the highest rate of certification (67%). There was a significant effect of HONcode certification on DISCERN, JAMA, grade level and reading ease. On average, HONcode certified websites had lower grade level readability with greater reading ease and higher DISCERN and JAMA scores than un-certified sites (p < 0.05). On average, health news outlets had the highest DISCERN, JAMA, and reading ease scores and were written at the lowest grade reading level but had the highest level of distraction (p < 0.05).
    CONCLUSION: For the iGeneration and their care-givers, the internet remains the most popular source of health-related and medical information. Despite the wide number of sources available, the quality, accuracy, pertinence and intelligibility of the information remains highly variable. As clinicians, we should direct patients to verifiable sites with regulated information and, where possible, contribute high quality information to those sites.
    Keywords:  Information; Internet; Scoliosis
    DOI:  https://doi.org/10.1007/s43390-020-00156-8
  7. Front Public Health. 2020 ;8 254
    Olaimat AN, Aolymat I, Shahbaz HM, Holley RA.
      Background: Coronavirus disease 2019 (COVID-19) has rapidly spread worldwide, and it was officially declared to be a pandemic by the World Health Organization (WHO) on March 11, 2020. Most countries over the entire globe have reported some COVID-19 cases. The current study aimed to assess student knowledge about COVID-19 at different Jordanian universities and determine where they sourced their information. Methods:A cross-sectional study was conducted among 2,083 undergraduate or postgraduate students from different governmental and private universities during the initial stage of the disease in Jordan (March 19-21, 2020) using a validated, structured, self-administered, online questionnaire. The survey was structured to assess their knowledge about viral sources, incubation period, mortality rate, transmission, symptoms and complications as well as the source of information about COVID-19. Results:Overall, 56.5% of the respondents showed good knowledge and almost 40.5% showed moderate knowledge. On the other hand, 3.0% of the participants showed poor knowledge about COVID-19. The average knowledge score of students was 80.1%, which is considered to be within the scale of good knowledge. Both the college of study and educational level significantly (P < 0.05) associated with student knowledge. Students who majored in medical sciences showed the highest mean score of 82.8%, with 69.0% displaying a good knowledge level. Postgraduate students had significantly higher knowledge scores compared to undergraduate students. The majority of students used the internet, social media and mass media as sources of information about COVID-19. Scientific websites and articles were used more commonly by medical and postgraduate students. Conclusions:The COVID-19 pandemic is a major challenge to the health of the world population; therefore, these results assessing students' knowledge provide an important baseline for planning required educational interventions such as contact tracing and self-quarantine. These results may also help public health authorities by engaging communities in implementation of protective health measures, including positive hygienic practices such as hand washing to reduce the risk of COVID-19.
    Keywords:  COVID-19; Jordan; SARS-CoV-2 virus; awareness; coronavirus; information source; knowledge; university student
    DOI:  https://doi.org/10.3389/fpubh.2020.00254
  8. Infect Dis Health. 2020 Jun 19. pii: S2468-0451(20)30034-1. [Epub ahead of print]
    Meleo-Erwin Z, Basch CH, Fera J, Garcia P.
      BACKGROUND: Dengue is the fastest spreading viral infection. A number of epidemics have occurred in subtropical and tropical regions of the United States. A future expansion and intensification of dengue has been predicted, partly due to changing environmental conditions. In the absence of treatment, prevention and control are vital to addressing this vector-borne disease. Health education and community engagement are a key part of such efforts. Given that the majority of Americans search for health information online, ensuring the readability of web-based dengue information is an important consideration for health officials. To date, there is a dearth of information on this topic.METHODS: To address this gap, "Dengue fever" was searched using Google Chrome to generate a sample of 100 websites. Using Readable.io, the readability of these 100 sites was then determined using five different tests. The data were analyzed using SPSS.
    RESULTS: The results indicate that the vast majority of online materials on dengue are written at a level far beyond what is recommended. Three of the five tests did not find any of the sampled sites to be appropriate in terms of readability. In general, the websites analyzed tended to be written at a level beyond the 10th grade.
    CONCLUSION: These findings have important implications for future dengue outbreaks in the United States.
    Keywords:  Dengue fever; On-line material; Readability
    DOI:  https://doi.org/10.1016/j.idh.2020.04.005
  9. Stud Health Technol Inform. 2020 Jun 16. 270 1108-1112
    Arief M, Rissanen S, Saranto K.
      UN have recommended the adoption of Web Content Accessibility Guidelines (WCAG) version 2.0 from W3C to guarantee that web content is more accessible to everyone. This study aims to evaluate the effectiveness of WCAG 2.0 implementation by comparing the country with and without WCAG2.0 policy adoption. The objective is to compare the impact of adopting the WCAG 2.0 standard in health information provision by analyzing the differences between countries that adopted the WCAG 2.0 standard and those that did not, specifically for health information related to the elderly. To this end, searches were performed on the Google search engine for online health websites with the keyword "Alzheimer" and the specific country settings for Indonesia and UK, in the local language of each country. Website evaluations were performed for ten websites found with this search by using the WCAG 2.0 measurement tool Axe. Statistical analysis using descriptive and Mann-Whitney analysis to measure the impact of the WCAG 2.0 showed a predominance of low violation occurrences in the UK with 40% from selected websites compared to 80% medium violation occurrences and no low violation occurrences in Indonesia. Although the country with WCAG 2.0 implementation had a lower frequency of violation, no significant differences were found between countries and the media type in WCAG 2.0 evaluation, implying the need to improve the effectiveness of policy implementation.
    Keywords:  Alzheimer; Information; Online; Policy; Web Accessibility
    DOI:  https://doi.org/10.3233/SHTI200334
  10. Australas Emerg Care. 2020 Jun 17. pii: S2588-994X(20)30045-2. [Epub ahead of print]
    Yilmaz Ferhatoglu S, Kudsioglu T.
      BACKGROUND: The internet is a widely used source for obtaining medical information both by patients and physicians. YouTube is an essential source of information that may be a useful tool to inform the public and may improve the learning experience if used adequately. In this study, we aimed to estimate the quality and accuracy of videos about cardiopulmonary resuscitation videos aired on YouTube, which is the most popular video platform of the online world.METHODS: We included the first 100 videos, in English, returned by the YouTube search engine in response to "cardiopulmonary resuscitation" keyword query to the study. The popularity of the videos was evaluated with an index called the video power index. The educational quality, accuracy, and transparency of the visual content were measured using the DISCERN questionnaire (DISCERN), Journal of American Medical Association (JAMA-BC) benchmark criteria, and Global Quality Score (GQS). The technical quality was measured by the Cardiopulmonary Resuscitation Scoring System (CPR-SS), which was utilized by three bariatric surgeons.
    RESULTS: The source in 7% of the videos was medical doctors. The content in 89% of the videos was cardiopulmonary resuscitation technique. According to sources, videos uploaded by physicians had significantly higher quality, transparency, educational and technical value. Unlike, videos uploaded by physicians had a lower video power index than videos uploaded by non-medical professionals. Cardiopulmonary resuscitation technique videos had significantly higher quality, transparency, educational and technical value. Cardiopulmonary resuscitation technique videos also had higher popularity scores than "information or lecture" videos. Also, negative correlations were found between the (popularity index and transparency, education and technical quality scores.
    CONCLUSIONS: Online information on cardiopulmonary resuscitation is of low quality, and its contents are of unknown source and accuracy. However, the educational potential of the online video platform, YouTube, cannot be neglected.
    Keywords:  CPR; Cardiopulmonary resucitation; Online; Social media; YouTube
    DOI:  https://doi.org/10.1016/j.auec.2020.05.005
  11. J Neurosurg Spine. 2020 Jun 26. pii: 2020.4.SPINE191539. [Epub ahead of print] 1-6
    White MD, Latour K, Giordano M, Taylor T, Agarwal N.
      OBJECTIVE: There is an increasing trend among patients and their families to seek medical knowledge on the internet. Patients undergoing surgical interventions, including lateral lumbar interbody fusion (LLIF), often rely on online videos as a first source of knowledge to familiarize themselves with the procedure. In this study the authors sought to investigate the reliability and quality of LLIF-related online videos.METHODS: In December 2018, the authors searched the YouTube platform using 3 search terms: lateral lumbar interbody fusion, LLIF surgery, and LLIF. The relevance-based ranking search option was used, and results from the first 3 pages were investigated. Only videos from universities, hospitals, and academic associations were included for final evaluation. By means of the DISCERN instrument, a validated measure of reliability and quality for online patient education resources, 3 authors of the present study independently evaluated the quality of information.
    RESULTS: In total, 296 videos were identified by using the 3 search terms. Ten videos met inclusion criteria and were further evaluated. The average (± SD) DISCERN video quality assessment score for these 10 videos was 3.42 ± 0.16. Two videos (20%) had an average score above 4, corresponding to a high-quality source of information. Of the remaining 8 videos, 6 (60%) scored moderately, in the range of 3-4, indicating that the publication is reliable but important information is missing. The final 2 videos (20%) had a low average score (2 or below), indicating that they are unlikely to be of any benefit and should not be used. Videos with intraoperative clips were significantly more popular, as indicated by the numbers of likes and views (p = 0.01). There was no correlation between video popularity and DISCERN score (p = 0.104). In August 2019, the total number of views for the 10 videos in the final analysis was 537,785.
    CONCLUSIONS: The findings of this study demonstrate that patients who seek to access information about LLIF by using the YouTube platform will be presented with an overall moderate quality of educational content on this procedure. Moreover, compared with videos that provide patient information on treatments used in other medical fields, videos providing information on LLIF surgery are still exiguous. In view of the increasing trend to seek medical knowledge on the YouTube platform, and in order to support and optimize patient education on LLIF surgery, the authors encourage academic neurosurgery institutions in the United States and worldwide to implement the release of reliable video educational content.
    Keywords:  LLIF = lateral lumbar interbody fusion; lateral lumbar interbody fusion; minimally invasive spine surgery; neurosurgery; patient education
    DOI:  https://doi.org/10.3171/2020.4.SPINE191539
  12. Int J Environ Res Public Health. 2020 06 18. pii: E4377. [Epub ahead of print]17(12):
    Le HT, Nguyen DN, Beydoun AS, Le XTT, Nguyen TT, Pham QT, Ta NTK, Nguyen QT, Nguyen AN, Hoang MT, Vu LG, Tran BX, Latkin CA, Ho CSH, Ho RCM.
      Due to the rapid spread of coronavirus, Vietnam introduced its first national partial lockdown on April 1st, 2020. The public relied on online sources, whether through official websites or phone-based applications, to acquire up-to-date health information, provide accurate instructions, and limit misinformation. This study aims to provide insight regarding the current level of awareness of the pandemic, and to identify associated factors in Vietnamese participants to recommend necessary interventions. A cross-sectional study was conducted using a web-based survey during the first week of the lockdown period. There were 341 observations collected using a snowball sampling technique. A Tobit multivariable regression model was used to identify factors associated with the demand for each category of health information. The most requested information was the latest updated news on the epidemic, followed by information about disease symptoms and updated news on the outbreak. The prevalence of diverse socioeconomic, demographic, and ethnic factors in Vietnam requires consideration of the specific health information needs of unique groups. Identifying group-specific demands would be helpful to provide proper information to fulfill each population group's needs.
    Keywords:  COVID-19; Vietnam; health information; news on the pandemic; risk communication
    DOI:  https://doi.org/10.3390/ijerph17124377
  13. Dent Traumatol. 2020 Jun 23.
    Bamashmous N, Cunningham SJ, Parekh S.
      BACKGROUND AND AIM: Dental trauma is a common public health issue that can be associated with high costs and life-long treatment. Children and parents should have the correct information regarding its management to reduce the potential long-term sequelae. Understanding their information seeking behaviour is therefore important to ensure that they are able to access, understand and retain the relevant information. The aim of this study was to investigate (i) the different ways that patients undertaking treatment for dental trauma, and their parents, look for relevant information, (ii) their information needs, and (iii) their preferred format of information.MATERIAL AND METHODS: This was a two-phase study: Phase I involved in-depth interviews with 10 patients who were currently undergoing treatment for dental trauma, and 11 parents. In phase II, the themes and subthemes identified from the interviews were used to develop patient and parent questionnaires, which were distributed to all trauma patients and parents who fulfilled the inclusion criteria and who were attending the dental clinics from March to May 2014.
    RESULTS: It was found that patients and parents had different concerns immediately following the trauma, during treatment and in the long-term, and they wanted different forms of information to answer their concerns. Reliability was the main issue about trusting the information available in the internet, social media and other online sources. Therefore, patients preferred to obtain information verbally from their dentist, while parents preferred written information.
    CONCLUSION: Patients and parents have different information needs and these also vary at different stages of treatment. It is therefore important to customise the way information is provided for each group accordingly.
    Keywords:  Information seeking behaviour; children; dental trauma; in depth-interviews; information provision; knowledge
    DOI:  https://doi.org/10.1111/edt.12581
  14. BMC Pregnancy Childbirth. 2020 Jun 26. 20(1): 377
    Snyder A, Neufeld HT, Forbes L.
      BACKGROUND: The objective of this study was to describe women's processes for finding pregnancy-related nutrition information, their experiences seeking this information online and their ideas for improving internet sources of this information.METHODS: In total, 97 pregnant women completed an online quantitative questionnaire and 10 primiparous pregnant women completed semi-structured telephone interviews. Questionnaires and interviews asked participants to describe sources of pregnancy-related nutrition information; time of seeking; processes of searching online; experiences searching online; ideas for improving information found online. Survey data were analyzed using descriptive statistics and Chi square tests; interview data were analyzed using thematic analysis.
    RESULTS: Nearly all (96%) survey participants sought nutrition information online. Information was most commonly sought during the first trimester of pregnancy. Motivators for using the internet included convenience and lack of support from health care providers. Barriers to using online information included lack of trust, difficulty finding information and worry. Women adapted the information they found online to meet their needs and reported making positive changes to their diets.
    CONCLUSIONS: The internet is a key source of prenatal nutrition information that women report using to make positive dietary changes. Women would benefit from improved access to trustworthy internet sources, increased availability of information on different diets and health conditions, and increased support from health care providers.
    Keywords:  Diet, food and nutrition; Health education; Internet; Pregnancy
    DOI:  https://doi.org/10.1186/s12884-020-03065-w
  15. Am J Health Promot. 2020 Jun 26. 890117120934609
    Fareed N, Swoboda CM, Jonnalagadda P, Walker DM, Huerta RT.
      PURPOSE: Assessed racial disparities in health information-seeking behavior and trust of information sources from 2007 to 2017.DESIGN: Pooled cross-sectional survey data.
    SETTING: Health Information National Trends Survey (HINTS).
    PARTICIPATION: Data included 6 iterations of HINTS (pooled: N = 19 496; 2007: n = 3593; 2011: n = 3959; 2013: n = 3185; Food and Drug Administration [FDA] 2015: n = 3738; 2017: n = 3285; and FDA 2017: n = 1736).
    MEASURES: Outcome variables were health information seeking, high confidence, and high trust of health information from several sources. Independent variable was race group, controlling for other sociodemographic and socioeconomic variables.
    ANALYSIS: Weighted descriptive and multivariate logistic regression for the pooled sample assessed associations by race. Fully interacted models with race-survey year interactions compared differences in outcomes between years.
    RESULTS: Black respondents, relative to white, had greater odds of having high confidence in their ability to attain health information, trust of health information from newspapers and magazines, radio, internet, television, government, charitable organizations, and religious organizations. Hispanic respondents, relative to white, had lower odds of seeking health information and trusting health information from doctors. They had higher odds of trusting health information from the radio, the internet, television, charitable organizations, and religious organizations.
    CONCLUSION: Disparities between races in trust of information sources remained across time. Understanding optimal information media, their reach, and credibility among racial groups could enable more targeted approaches to developing interventions. Our analytical approach minimized limitations present in the HINTS.
    Keywords:  confidence; disparities; health communication; health information; health promotion; information sources; race; trust
    DOI:  https://doi.org/10.1177/0890117120934609
  16. Sci Data. 2020 Jun 26. 7(1): 205
    Xu J, Kim S, Song M, Jeong M, Kim D, Kang J, Rousseau JF, Li X, Xu W, Torvik VI, Bu Y, Chen C, Ebeid IA, Li D, Ding Y.
      PubMed® is an essential resource for the medical domain, but useful concepts are either difficult to extract or are ambiguous, which has significantly hindered knowledge discovery. To address this issue, we constructed a PubMed knowledge graph (PKG) by extracting bio-entities from 29 million PubMed abstracts, disambiguating author names, integrating funding data through the National Institutes of Health (NIH) ExPORTER, collecting affiliation history and educational background of authors from ORCID®, and identifying fine-grained affiliation data from MapAffil. Through the integration of these credible multi-source data, we could create connections among the bio-entities, authors, articles, affiliations, and funding. Data validation revealed that the BioBERT deep learning method of bio-entity extraction significantly outperformed the state-of-the-art models based on the F1 score (by 0.51%), with the author name disambiguation (AND) achieving an F1 score of 98.09%. PKG can trigger broader innovations, not only enabling us to measure scholarly impact, knowledge usage, and knowledge transfer, but also assisting us in profiling authors and organizations based on their connections with bio-entities.
    DOI:  https://doi.org/10.1038/s41597-020-0543-2
  17. Stud Health Technol Inform. 2020 Jun 16. 270 347-351
    Villena F, Eisenmann U, Knaup P, Dunstan J, Ganzinger M.
      The amount of digital data derived from healthcare processes have increased tremendously in the last years. This applies especially to unstructured data, which are often hard to analyze due to the lack of available tools to process and extract information. Natural language processing is often used in medicine, but the majority of tools used by researchers are developed primarily for the English language. For developing and testing natural language processing methods, it is important to have a suitable corpus, specific to the medical domain that covers the intended target language. To improve the potential of natural language processing research, we developed tools to derive language specific medical corpora from publicly available text sources. n order to extract medicine-specific unstructured text data, openly available pub-lications from biomedical journals were used in a four-step process: (1) medical journal databases were scraped to download the articles, (2) the articles were parsed and consolidated into a single repository, (3) the content of the repository was de-scribed, and (4) the text data and the codes were released. In total, 93 969 articles were retrieved, with a word count of 83 868 501 in three different languages (German, English, and Spanish) from two medical journal databases Our results show that unstructured text data extraction from openly available medical journal databases for the construction of unified corpora of medical text data can be achieved through web scraping techniques.
    Keywords:  Data Mining; Information Storage and Retrieval; Linguistics; Medical Informatics; Natural Language Processing
    DOI:  https://doi.org/10.3233/SHTI200180
  18. Stud Health Technol Inform. 2020 Jun 16. 270 208-212
    Robert A, Damachi F, Bjelogrlic M, Goldman JP, Lovis C.
      This paper presents five document retrieval systems for a small (few thousands) and domain specific corpora (weekly peer-reviewed medical journals published in French) as well as an evaluation methodology to quantify the models performance. The proposed methodology does not rely on external annotations and therefore can be used as an ad hoc evaluation procedure for most document retrieval tasks. Statistical models and vector space models are empirically compared on a synthetic document retrieval task. For our dataset size and specificities the statistical approaches consistently performed better than its vector space counterparts.
    Keywords:  Document Retrieval; Medical Information Retrieval; Natural Language Processing; Word Embedding
    DOI:  https://doi.org/10.3233/SHTI200152
  19. Stud Health Technol Inform. 2020 Jun 16. 270 1026-1030
    Monkman H, Schmidt T, Nøhr C.
      Many people take prescription medications and need information about the risks and benefits associated with taking them. Citizens are increasingly turning to the internet for health information and medication information is no exception. There are a variety of websites that offer Online Medication Information for Citizens (OMIC). This study compared six such websites using the Health Literacy Online (HLO) Checklist as a framework. Additionally, we conducted a detailed analysis of the individual content in each OMIC for three different medications. We identified several strengths and weaknesses of the different websites in terms of how they were designed and written and their appropriateness for users with limited eHealth literacy.
    Keywords:  Consumer health informatics; eHealth literacy; health literacy; medication information
    DOI:  https://doi.org/10.3233/SHTI200317
  20. Zhonghua Liu Xing Bing Xue Za Zhi. 2020 Jun 10. 41(6): 952-956
    Zhou JJ, Wang SF, Li LM.
      Python web crawler technology, which automatically and massively getting information from the Internet by mimicking net users' browsing behavior, is a basic supporting technique to extract and integrate multi-source heterogeneous data in the field of Infodemiology. There are two types of Python web crawler: simple and massive-scale, both collect information simultaneously from the database establishment. Advantages of this technique are characterized as: being simple syntax, in high flexibility and low cost in learning and maintenance. Contents of the current application scenarios include surveillance, implementation and evaluation of health intervention programs on public health issues, as well as on smart doctor seeking. For the last two years, the Chinese government started to encourage the integration and utilization of multi-source heterogeneous data including internet information. Hence, the number of application scenarios for Python web crawler technology are bound to increase in the foreseeable future. Corresponding matched talent cultivations and technical innovations are suggested to add to the current education and research systems on public health issues.
    Keywords:  Health intervention; Infodemiology; Public health surveillance; Python web crawler technology; Smart doctor seeking
    DOI:  https://doi.org/10.3760/cma.j.cn112338-20190901-00643
  21. Stud Health Technol Inform. 2020 Jun 16. 270 884-888
    Caucheteur D, Gobeill J, Mottaz A, Pasche E, Michel PA, Mottin L, Stekhoven DJ, Barbié V, Ruch P.
      The Swiss Variant Interpretation Platform for Oncology is a centralized, joint and curated database for clinical somatic variants piloted by a board of Swiss healthcare institutions and operated by the SIB Swiss Institute of Bioinformatics. To support this effort, SIB Text Mining designed a set of text analytics services. This report focuses on three of those services. First, the automatic annotations of the literature with a set of terminologies have been performed, resulting in a large annotated version of MEDLINE and PMC. Second, a generator of variant synonyms for single nucleotide variants has been developed using publicly available data resources, as well as patterns of non-standard formats, often found in the literature. Third, a literature ranking service enables to retrieve a ranked set of MEDLINE abstracts given a variant and optionally a diagnosis. The annotation of MEDLINE and PMC resulted in a total of respectively 785,181,199 and 1,156,060,212 annotations, which means an average of 26 and 425 annotations per abstract and full-text article. The generator of variant synonyms enables to retrieve up to 42 synonyms for a variant. The literature ranking service reaches a precision (P10) of 63%, which means that almost two-thirds of the top-10 returned abstracts are judged relevant. Further services will be implemented to complete this set of services, such as a service to retrieve relevant clinical trials for a patient and a literature ranking service for full-text articles.
    Keywords:  Precision medicine; literature; terminology; text-mining; variant
    DOI:  https://doi.org/10.3233/SHTI200288