Healthcare Data: Medical Transcriptions Analysis

Case Studies

Home
/
Case Studies
/
Data Mining and Visualization
/
Featured
/
Recommended
/
Text Mining and Topic Modelling
/
Healthcare Data: Medical Transcriptions Analysis

Posted by Nodus Labs | April 7, 2022

Healthcare Data: Medical Transcriptions Analysis

In this case study, we will demonstrate how text network analysis can be used to better understand medical transcriptions data. This approach can be used to better understand the patients’ complaints, classify them into categories, and also reveal some insights about the treatment provided. As a data source, we will use freely available, anonymized medical records from Kaggle. For data analysis, we will be using InfraNodus visual text network analysis tool.

Preparing the Data

As the first step, we will inspect the medical records in order to see what kind of data is contained within. The CSV table provided has several columns that may be interesting for our analysis:

1) description of the problem (written by a doctor, from the patient’s words)
2) medical specialty (e.g. gastroenterology, neurology)
3) description of the treatment the patient obtained

Based on this information, we will create two separate graphs. The first graph will be based on the description of the problem field. The other graph will be based on the description of the treatment. Both graphs will use the “medical specialty” field to categorize the entries, so we can sample the data based on the typology provided.

Visualizing the Data as a Graph

To visualize the data, we will use the InfraNodus CSV import functionality. As InfraNodus takes maximum 3 Mb at once on the small accounts, we will split the original file into 6 parts, 3 Mb each. We will then take one of those files containing the information on neurology, oncology, general treatment, and gastroenterology types — quite diverse. We will then choose the “description” field as the column to import:

As the next step we choose the “medical specialty” field as the categorization field and import the data as a graph:

The logic behind this graph is simple: the words are converted into lemmas, which are represented as nodes in the graph. The co-occurrence of those words are represented as the connections. The resulting graph is then analyzed using various algorithms from network science, which identify the most influential nodes (bigger on the graph) and the topical clusters: the groups of words that tend to co-occur in the same context (shown with the same color and in the Analytics panel on the right).

Most of the common stopwords (such as “a”, “the”, “is”, etc.) are removed and we also removed some more words that were frequently used but do not carry a specific meaning, such as “male”, “female”, “left”, “right”, “history”, “patient” — these are quite generic to the context (at the top right).

Try InfraNodus Text Network Visualization Tool developed by Nodus Labs. You can use it to make sense of disjointed bits and pieces of information, get visual summaries for text documents, and generate insight for your research process: www.infranodus.com

As a result, we see a graph of the most common symptoms that the patients have, according to the medical records:

a) pain in the upper region also connected to nausea (meaning those appear in the same complaints often),
b) chronic hypertension (also connected to the word renal),
c) brain and spine problems

Cutting the Data by Categories

Next, let’s cut the data by one of the categories (medical specialty), to see what kind of problems occur in the case of, for example, nephrology (the study of kindeys). In order to do that, we use the top filter panel in the graph, to show only the statements and the parts of the graph that belong to the “nephrology” category:

We will see that the most common symptoms in this category are renal problems, hypertension, chest pain, and nausea.

Interestingly, these results once again demonstrate how interconnected the body is and that kidney problems are related to hypertension.

To be continued....

Post Views: 1,012

On the internet people come and go, but we would like to stay in touch. If you like what you're reading, please, consider connecting to Nodus Labs on Facebook, Twitter and Patreon, so we can inform you about the latest updates and engage in a dialogue.
- - Tags »
  - health
  - medicine
- ← Sentiment Analysis: AFINN vs Bert AI Algorithms (using the Twitter and Amazon examples)
- AI Writing Tool: GPT-3 Text Generator of Research Questions →

Try InfraNodus — Text Network Visualization Tool Learn More

Nodus Labs

Exploring society and cognition through the framework of network science. Contact Us

By using this site you agree to our Terms of Use and Privacy Policy.
Stay in Touch
Recent Posts
- Reveal Blind Spots in Content with AI-Powered InfraNodus Browser Extension January 24, 2024
- How to Measure Heart Rate Variability (HRV) using Fractals October 1, 2023
- AI Tools for Writing June 18, 2023
- How to Increase the Duration of Deep Sleep with Daily Activities April 10, 2023
- Personal Journal and Diary App with AI Text Analysis Features April 9, 2023
- Using AI for Introspection and Psychology of Self April 7, 2023
- Competitive Intelligence and Market Research with GPT-3 AI and Networks April 2, 2023
- Knowledge Base Text Analysis with NLP January 30, 2023
- How to Improve ChatGPT Generated Text December 15, 2022
- Network Thinking and Mindmapping for Ideation and Brainstorming November 14, 2022
Our Publications

Below you will find a list of our scientific publications. You can also check our page on Google Scholar.

Paranyushkin, D (2019). InfraNodus: Generating Insight Using Text Network Analysis

Paranyushkin, D (2018). Direct Visual Feedback on the Process of Ideation using Text Network Graphs Encourages a more Coherent Expression of Ideas, Nodus Labs

Paranyushkin, D (2013). Addresses to the Federal Assembly of the Russian Federation by Russian presidents, 2008–2012: comparative analysis, Russian Journal of Communication, Volume 5, Issue 3

Paranyushkin, D (2012). Metastability of Cognition in Body-Mind-Environment Network, Nodus Labs

Paranyushkin, D (2012). Informational Epidemics and Synchronized Viral Social Contagion. Nodus Labs.

Paranyushkin, D (2012). Visualization of Text’s Polysingularity Using Network Analysis. Nodus Labs.

Paranyushkin, D. (2011). Identifying the Pathways for Meaning Circulation Using Text Network Analyis. Nodus Labs.

Paranyushkin, D (2011). Inclusive Exclusivity: How to Build Open and Innovative Cultural Networks. Nodus Labs.

Case Studies

Posted by Nodus Labs | April 7, 2022

Healthcare Data: Medical Transcriptions Analysis

Preparing the Data

Visualizing the Data as a Graph

Cutting the Data by Categories

Doing a Research?

Connect on Twitter:

Connect on Facebook:

RSS Feeds:

Nodus Labs

Stay in Touch

Recent Posts

Our Publications

Case Studies

Posted by Nodus Labs | April 7, 2022

Healthcare Data: Medical Transcriptions Analysis

Preparing the Data

Visualizing the Data as a Graph

Cutting the Data by Categories

Doing a Research?

Explore Nodus Labs:

Connect on Twitter:

Connect on Facebook:

RSS Feeds:

Nodus Labs

Stay in Touch

Recent Posts

Our Publications