New AI technique significantly boosts Medicare fraud detection: Florida Atlantic University
2024 FEB 13 (NewsRx) -- By a
Traditionally, to detect Medicare fraud, a limited number of auditors, or investigators, are responsible for manually inspecting thousands of claims, but only have enough time to look for very specific patterns indicating suspicious behaviors. Moreover, there are not enough investigators to keep up with the various Medicare fraud schemes.
Utilizing big data, such as from patient records and provider payments, often is considered the best way to produce effective machine learning models to detect fraud. However, in the domain of Medicare insurance fraud detection, handling imbalanced big data and high dimensionality - data in which the number of features is staggeringly high so that calculations become extremely difficult - remains a significant challenge.
New research from the
For the study, researchers systematically tested two imbalanced big Medicare datasets, Part B and Part
Researchers delved deep into the influence of Random Undersampling (RUS), a straightforward, yet potent data sampling technique, and their novel ensemble supervised feature selection technique. RUS works by randomly removing samples from the majority class until a specific balance between the minority and majority classes is met.
The experimental design investigated various scenarios, ranging from using each technique in isolation to employing them in combination. Following analyses of the individual scenarios, researchers again selected the techniques that yielded the best results and performed an analysis of results between all scenarios.
Results of the study, published in the
Consequently, in the classification of either dataset, researchers discovered that a technique with the largest amount of data reduction also yields the best performance, which is the technique of performing feature selection, then applying RUS. Reduction in the number of features leads to more explainable models and performance is significantly better than using all features.
“The performance of a classifier or algorithm can be swayed by multiple effects,” said Taghi Khoshgoftaar, Ph.D., senior author and Motorola Professor,
For feature selection, researchers incorporated a supervised feature selection method based on feature ranking lists. Subsequently, through the implementation of an innovative approach, these lists were combined to yield a conclusive feature ranking. To furnish a benchmark, models also were built utilizing all features of the datasets. Upon the derivation of this consolidated ranking, features were selected based on their position in the list.
“Our systematic approach provided a greater comprehension regarding the interplay between feature selection and model robustness within the context of multiple learning algorithms,” said
For both Medicare Part B and Part D datasets, researchers conducted experiments in five scenarios that exhausted the possible ways to utilize, or omit, the RUS and feature selection data reduction techniques. For both datasets, researchers found that data reduction techniques also improve classification results.
“Given the enormous financial implications of Medicare fraud, findings from this important study not only offer computational advantages but also significantly enhance the effectiveness of fraud detection systems,” said
Study co-authors are Huanjing Wang, Ph.D., a professor of computer science,
- FAU -
About FAU’s
About
(Our reports deliver fact-based news of research and discoveries from around the world.)
Mercury General: Q4 Earnings Snapshot
Research on Agricultural Insurance Reported by Researchers at Czech University of Life Sciences Prague (The impact of digital technologies on agricultural insurance): Agriculture – Agricultural Insurance
Advisor News
Annuity News
Health/Employee Benefits News
Life Insurance News