Skip to Main Content

Text Mining for Search Strategy Development

Discover text mining resources and applications for search strategy development in Health Sciences and Medicine

PubVenn


PubVenn allows you to explore PubMed visually by generating Venn diagrams. You can enter any multi-term search to generate a Venn diagram that allows you to view the size of the citation set for each term as well as how those sets interact. The tool provides an example PubMed search strategy with links to relevant publications. Selecting the 'Expanded subjects' checkbox will include other relevant terms from PubMed. 

Availability: Free online

Data Source: PubMed

Import Formats: Free text

Export Formats: Save as .png

URL: https://pubvenn.appspot.com/

Also try Search Workbench: https://searchworkbench.info/

PubMed PubReMiner


Enter a free text search into the PubReMiner tool and it will query PubMed for relevant results. The tool analyses these results and provides tables that rank the frequency of words in the title and abstract of the articles and also relevant MeSH headings. Other ranked tables include journals in which your query is published the most and authors which are most active in related fields. 

Availability: Free online

Data Source: PubMed

Tool Type: Text frequency

Import Formats: Free text

Export Formats: Save results as a .txt file

URL: http://hgserver2.amc.nl/cgi-bin/miner/miner2.cgi

Coremine


Coremine Medical is a product of the PubGene Company designed to be used by anyone seeking information on health, medicine and biology. It is ideal for those seeking an overview of a complex subject while allowing the possibility to "drill down" to specific details. Search results are presented in a dashboard format comprised of panels containing various categories of information ranging from introductory sources to the latest scientific articles. Coremine presents search results as a graphic network that describes relationships discovered through text-mining. 

Availability: Register for free account

Data Source: PubMed

Tool Type: Visualisation, clustering, text frequency, relationship networks

Features: File upload, Hyperbrowser, search history, alerts

Import Formats: Free text, tab delimited file can be uploaded

Export Formats: None

URL: https://www.coremine.com/medical/

Instructions: https://www.coremine.com/medical/help.html

MeSH on Demand


The MeSH on Demand tool uses the NLM Medical Text Indexer to identifiy MeSH vocabulary in submitted text (paste up to five pages). The results are displayed in a list as well as highlighted in the pasted text (also defines term frequency within the text). Links to similar PubMed related citations are included. 

Availability: Free online

Data Source: PubMed

Tool Type: Text analyser

Import Formats: Free text

Export Formats: Text file

Known Limitations: Non English text needs to be translated first

URL: https://meshb.nlm.nih.gov/MeSHonDemand

Yale MeSH Analyser


Yale MeSH Analyzer is a tool that retrieves article metadata from Medline records and presents the indexing data in an easy to scan grid format. This allows comparison of MeSH headings across publications. 

Availability: Free online

Data Source: PubMed

Tool Type: Comparison

Features: Easy to view comparison of article indexing

Import Formats: Free text (type or paste)

Export Formats: Excel, HTML table

Known Limitations: Query with PubMed IDs only - maximum 20

URL: http://mesh.med.yale.edu

Carrot2


Carrot2 is an Open Source Results Clustering Engine that can automatically organise search results into topics. Carrot2 can query PubMed and allows boolean searching.  

Availability: Free online, download

Data Source: PubMed

Tool Type: Clustering

Features: Simple online interface

Import Formats: Free text

Export Formats: View on screen

Known Limitations: No export options

URL: https://search.carrot2.org/#/search/web


Library Instagram

Library Blogs

Library Contacts