Differentiating Viral and Bacterial Infections: A Machine Learning Model Based on Routine Blood Test Values

Read original: arXiv:2305.07877 - Published 4/24/2024 by Gregor Gunv{c}ar, Matjav{z} Kukar, Tim Smole, Sav{s}o Mov{s}kon, Tomav{z} Vovko, Simon Podnar, Peter v{C}ernelv{c}, Miran Brvar, Mateja Notar, Manca Koster and 2 others

📈

Overview

This study developed a machine learning model to distinguish between bacterial and viral infections using routine blood test results, C-reactive protein (CRP) levels, biological sex, and age.
The model achieved high accuracy, sensitivity, and specificity in differentiating between bacterial and viral infections, outperforming a CRP-based decision rule.
The model particularly improved accuracy within the CRP range of 10-40 mg/L, where CRP alone is less informative.
The findings highlight the potential of integrating multiple blood parameters for enhanced diagnostic capabilities using machine learning.

Plain English Explanation

When people get sick, it's important for doctors to know if the infection is caused by bacteria or a virus. This helps them decide if the patient needs antibiotics or not. Antibiotics are medications that can kill bacteria, but they don't work against viruses.

In this study, the researchers developed a machine learning model to help make this distinction. The model looked at 16 different measurements from a patient's routine blood tests, as well as their C-reactive protein (CRP) level, biological sex, and age. CRP is a protein that increases when there is inflammation in the body, and it can sometimes help doctors tell if an infection is caused by bacteria or a virus.

The researchers tested the model on a large dataset of over 44,000 cases from a single medical center. The model was able to correctly identify whether an infection was bacterial or viral 82.2% of the time. It was also very good at correctly identifying viral infections (79.7% sensitivity) and bacterial infections (84.5% specificity).

Importantly, the model was particularly helpful in the CRP range of 10-40 mg/L, where CRP alone is less useful for making the distinction. This shows the advantage of using multiple blood test results together, rather than relying on just one marker like CRP.

Overall, this machine learning model has the potential to be a valuable tool for doctors, helping them make more accurate decisions about when to prescribe antibiotics. This could help prevent the overuse of antibiotics, which is a growing problem and can lead to the development of antibiotic-resistant bacteria.

Technical Explanation

The researchers developed a "Virus vs. Bacteria" machine learning model to differentiate between bacterial and viral infections using routine blood test results, C-reactive protein (CRP) concentration, biological sex, and age. They trained and tested the model on a dataset of 44,120 cases from a single medical center.

The model achieved an overall accuracy of 82.2%, with a sensitivity of 79.7% for viral infections and a specificity of 84.5% for bacterial infections. It also had a Brier score of 0.129 and an area under the ROC curve (AUC) of 0.905, indicating strong predictive performance.

Notably, the machine learning model outperformed a CRP-based decision rule. It was particularly effective within the CRP range of 10-40 mg/L, where CRP alone is less informative for distinguishing between bacterial and viral infections. This suggests that integrating multiple blood parameters can enhance diagnostic accuracy compared to using a single marker like CRP.

The researchers posit that this Virus vs. Bacteria model could serve as the foundation for advanced diagnostic tools that leverage machine learning to optimize infection management, especially in the context of the growing threat of antibiotic resistance.

Critical Analysis

The study provides a compelling demonstration of how machine learning can be used to improve diagnostic accuracy in distinguishing between bacterial and viral infections. The large dataset, rigorous evaluation metrics, and comparative analysis against a CRP-based approach lend credibility to the findings.

However, the study is limited to a single medical center, and further validation across different populations and healthcare settings would be valuable to assess the model's generalizability. Additionally, the reliance on routine blood tests, while convenient, may not be feasible in all clinical scenarios, such as in resource-limited or emergency settings.

It would also be interesting to explore the model's performance in specific disease contexts, such as differentiating between viral and bacterial pneumonia, or its ability to identify co-infections. Incorporating additional clinical data, such as symptoms, imaging, or genomic markers, could potentially further improve the model's diagnostic capabilities.

Overall, the study represents a valuable step forward in leveraging machine learning to enhance infection management, but ongoing research and validation will be necessary to fully realize the potential of this approach.

Conclusion

This study developed a machine learning model that can accurately differentiate between bacterial and viral infections using routine blood test results, CRP levels, biological sex, and age. The model outperformed a CRP-based decision rule, particularly in the CRP range where it is less informative alone.

These findings highlight the power of integrating multiple biomarkers, rather than relying on a single indicator, to improve diagnostic accuracy. The Virus vs. Bacteria model paves the way for advanced, machine learning-powered diagnostic tools that can optimize infection management and help address the growing threat of antibiotic resistance.

[As the field of predictive analytics in healthcare continues to evolve, this research demonstrates the potential for data-driven approaches to enhance clinical decision-making and improve patient outcomes.](https://aimodels.fyi/papers/arxiv/early-detection-disease-outbreaks-non-outbreaks-using)

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📈

Differentiating Viral and Bacterial Infections: A Machine Learning Model Based on Routine Blood Test Values

Gregor Gunv{c}ar, Matjav{z} Kukar, Tim Smole, Sav{s}o Mov{s}kon, Tomav{z} Vovko, Simon Podnar, Peter v{C}ernelv{c}, Miran Brvar, Mateja Notar, Manca Koster, Marjeta Tuv{s}ek Jelenc, Marko Notar

The growing threat of antibiotic resistance necessitates accurate differentiation between bacterial and viral infections for proper antibiotic administration. In this study, a Virus vs. Bacteria machine learning model was developed to distinguish between these infection types using 16 routine blood test results, C-reactive protein concentration (CRP), biological sex, and age. With a dataset of 44,120 cases from a single medical center, the model achieved an accuracy of 82.2 %, a sensitivity of 79.7 %, a specificity of 84.5 %, a Brier score of 0.129, and an area under the ROC curve (AUC) of 0.905, outperforming a CRP-based decision rule. Notably, the machine learning model enhanced accuracy within the CRP range of 10-40 mg/L, a range where CRP alone is less informative. These results highlight the advantage of integrating multiple blood parameters in diagnostics. The Virus vs. Bacteria model paves the way for advanced diagnostic tools, leveraging machine learning to optimize infection management.

4/24/2024

Development of Machine Learning Classifiers for Blood-based Diagnosis and Prognosis of Suspected Acute Infections and Sepsis

Ljubomir Buturovic, Michael Mayhew, Roland Luethy, Kirindi Choi, Uros Midic, Nandita Damaraju, Yehudit Hasin-Brumshtein, Amitesh Pratap, Rhys M. Adams, Joao Fonseca, Ambika Srinath, Paul Fleming, Claudia Pereira, Oliver Liesenfeld, Purvesh Khatri, Timothy Sweeney

We applied machine learning to the unmet medical need of rapid and accurate diagnosis and prognosis of acute infections and sepsis in emergency departments. Our solution consists of a Myrna (TM) Instrument and embedded TriVerity (TM) classifiers. The instrument measures abundances of 29 messenger RNAs in patient's blood, subsequently used as features for machine learning. The classifiers convert the input features to an intuitive test report comprising the separate likelihoods of (1) a bacterial infection (2) a viral infection, and (3) severity (need for Intensive Care Unit-level care). In internal validation, the system achieved AUROC = 0.83 on the three-class disease diagnosis (bacterial, viral, or non-infected) and AUROC = 0.77 on binary prognosis of disease severity. The Myrna, TriVerity system was granted breakthrough device designation by the United States Food and Drug Administration (FDA). This engineering manuscript teaches the standard and novel machine learning methods used to translate an academic research concept to a clinical product aimed at improving patient care, and discusses lessons learned.

7/4/2024

COVID-19 Detection Based on Blood Test Parameters using Various Artificial Intelligence Methods

Kavian Khanjani, Seyed Rasoul Hosseini, Hamid Taheri, Shahrzad Shashaani, Mohammad Teshnehlab

In 2019, the world faced a new challenge: a COVID-19 disease caused by the novel coronavirus, SARS-CoV-2. The virus rapidly spread across the globe, leading to a high rate of mortality, which prompted health organizations to take measures to control its transmission. Early disease detection is crucial in the treatment process, and computer-based automatic detection systems have been developed to aid in this effort. These systems often rely on artificial intelligence (AI) approaches such as machine learning, neural networks, fuzzy systems, and deep learning to classify diseases. This study aimed to differentiate COVID-19 patients from others using self-categorizing classifiers and employing various AI methods. This study used two datasets: the blood test samples and radiography images. The best results for the blood test samples obtained from San Raphael Hospital, which include two classes of individuals, those with COVID-19 and those with non-COVID diseases, were achieved through the use of the Ensemble method (a combination of a neural network and two machines learning methods). The results showed that this approach for COVID-19 diagnosis is cost-effective and provides results in a shorter amount of time than other methods. The proposed model achieved an accuracy of 94.09% on the dataset used. Secondly, the radiographic images were divided into four classes: normal, viral pneumonia, ground glass opacity, and COVID-19 infection. These were used for segmentation and classification. The lung lobes were extracted from the images and then categorized into specific classes. We achieved an accuracy of 91.1% on the image dataset. Generally, this study highlights the potential of AI in detecting and managing COVID-19 and underscores the importance of continued research and development in this field.

8/9/2024

Machine learning augmented diagnostic testing to identify sources of variability in test performance

Christopher J. Banks, Aeron Sanchez, Vicki Stewart, Kate Bowen, Graham Smith, Rowland R. Kao

Diagnostic tests which can detect pre-clinical or sub-clinical infection, are one of the most powerful tools in our armoury of weapons to control infectious diseases. Considerable effort has been therefore paid to improving diagnostic testing for human, plant and animal diseases, including strategies for targeting the use of diagnostic tests towards individuals who are more likely to be infected. Here, we follow other recent proposals to further refine this concept, by using machine learning to assess the situational risk under which a diagnostic test is applied to augment its interpretation . We develop this to predict the occurrence of breakdowns of cattle herds due to bovine tuberculosis, exploiting the availability of exceptionally detailed testing records. We show that, without compromising test specificity, test sensitivity can be improved so that the proportion of infected herds detected by the skin test, improves by over 16 percentage points. While many risk factors are associated with increased risk of becoming infected, of note are several factors which suggest that, in some herds there is a higher risk of infection going undetected, including effects that are correlated to the veterinary practice conducting the test, and number of livestock moved off the herd.

4/8/2024