Patent Issued for Data Platform For Automated Data Extraction, Transformation, And/Or Loading (USPTO 10,810,223)
2020 OCT 30 (NewsRx) -- By a
Patent number 10,810,223 is assigned to
The following quote was obtained by the news editors from the background information supplied by the inventors: βAs more and more industries become digitized, it is not uncommon for different kinds of information to be exchanged electronically. The healthcare industry is one in which an Electronic Data Interchange (EDI) plays a central role in facilitating the electronic communication and exchange of healthcare related data, including, for example, data pertaining to health insurance claims, health insurance enrollment, eligibility data, claims settlement, medical records, and/or the like. The Health insurance Portability and Accountability Act (HIPAA) has led to standardized claims administration and automation in the healthcare industry by employing EDI messages to exchange data.β
In addition to the background information obtained for this patent, NewsRx journalists also obtained the inventorsβ summary information for this patent: βAccording to some possible implementations, a method may include receiving, by a computing resource of a cloud computing environment, a plurality of data files from a healthcare electronic data interchange (EDI). The plurality of data files may be received in a plurality of different data formats, and the plurality of data files may include data elements associated with healthcare data. The method may include converting, by a computing resource of the cloud computing environment, the plurality of data files received in the plurality of different data formats to a common data format. The method may include extracting, by a computing resource of the cloud computing environment, data elements from the plurality of data files converted to the common data format. The method may include assigning, by a computing resource of the cloud computing environment, the data elements extracted from the plurality of data files to file identifiers that identify from which of the plurality of data files the data elements were extracted. The method may include assigning, by a computing resource of the cloud computing environment, the data elements extracted from the plurality of data files to attribute identifiers that identify types of healthcare data represented by the data elements. The method may include aggregating, by a computing resource of the cloud computing environment, the data elements based on the file identifiers and the attribute identifiers to create a standardized data set. The method may include mapping, by a computing resource of the cloud computing environment, the data elements in the standardized data set to a plurality of functions contained in at least one function library based on a mapping between the attribute identifiers and the plurality of functions. The method may include generating, by a computing resource of the cloud computing environment, a plurality of values based on mapping the data elements to the plurality of functions. The method may include determining, by a computing resource of the cloud computing environment, a healthcare metric based on combining the plurality of values according to a healthcare metric definition. The method may include posting, by a computing resource of the cloud computing environment, the healthcare metric to the healthcare EDI for consumption by healthcare data clients.
βAccording to some possible implementations, a device may include one or more memories, and one or more processors, communicatively coupled to the one or more memories, to receive a plurality of data files. The plurality of data files may be received in a plurality of different data formats, and the plurality of data files may include data elements associated with healthcare data. The one or more processors may convert the plurality of data files received in the plurality of different data formats to a common data format, extract data elements from the plurality of data files converted to the common data format, assign the data elements extracted from the plurality of data files to file identifiers that identify from which of the plurality of data files the data elements were extracted, and assign the data elements extracted from the plurality of data files to attribute identifiers that identify types of healthcare data represented by the data elements. The one or more processors may aggregate the data elements based on the file identifiers and the attribute identifiers to create a standardized data set, examine the standardized data set to identify the attribute identifiers present in the standardized data set, and determine, using a data model, a list of healthcare metrics that are derivable from the standardized data set based on the attribute identifiers present in the standardized data set. The one or more processors may map the data elements in the standardized data set to a plurality of functions based on a mapping between the attribute identifiers and the plurality of functions. The plurality of functions may be configured to generate a healthcare metric included in the list of healthcare metrics. The one or more processors may generate a plurality of values based on processing the data elements using the plurality of functions, derive the healthcare metric based on combining the plurality of values according to a healthcare metric definition, and post the healthcare metric to a healthcare electronic data interchange (EDI) for consumption by healthcare data client.
βAccording to some possible implementations, a non-transitory computer-readable medium may store one or more instructions that, when executed by one or more processors, cause the one or more processors to receive a plurality of data files. The plurality of data files may be received in a plurality of different data formats. The plurality of data files may include data elements. The one or more instructions, when executed by the one or more processors, may cause the one or more processors to convert the plurality of data files received in the plurality of different data formats to a common data format, and extract data elements from the plurality of data files converted to the common data format. The one or more instructions, when executed by the one or more processors, may cause the one or more processors to assign the data elements extracted from the plurality of data files to file identifiers that identify from which of the plurality of data files the data elements were extracted, and assign the data elements extracted from the plurality of data files to attribute identifiers that identify types of data represented by the data elements. The one or more instructions, when executed by the one or more processors, may cause the one or more processors to examine a data file of the plurality of data files to identify a combination of data elements present in the data file, and determine, using a first machine learning model, a first score for a data element in the data file based on the combination of data elements present in the data file. The first score may predict a type of data represented by the data element based on the combination of data elements present in the data file. The one or more instructions, when executed by the one or more processors, may cause the one or more processors to assign the data element to an attribute identifier based on the first score, aggregate the data elements based on the file identifiers and the attribute identifiers to create a standardized data set, and map the data elements in the standardized data set to a plurality of functions contained in at least one function library based on a mapping between the attribute identifiers and the plurality of functions. The one or more instructions, when executed by the one or more processors, may cause the one or more processors to generate a plurality of values based on mapping the data elements to the plurality of functions, derive a metric based on combining the plurality of values according to a metric definition, and post the metric to an electronic data interchange (EDI) for consumption by data clients.β
The claims supplied by the inventors are:
βWhat is claimed is:
β1. A method, comprising: receiving, by a computing resource of a cloud computing environment, a plurality of data files from a healthcare electronic data interchange (EDI), wherein the plurality of data files are received in a plurality of different data formats, and wherein the plurality of data files include data elements associated with healthcare data; converting, by the computing resource of the cloud computing environment, the plurality of data files received in the plurality of different data formats to a common data format; extracting, by the computing resource of the cloud computing environment, data elements from the plurality of data files converted to the common data format; extracting, by the computing resource of the cloud computing environment, a first data element, of the data elements, from a first data file of the plurality of data files; extracting, by the computing resource of the cloud computing environment, a second data element, of the data elements from a second data file of the plurality of data files; determining, by the computing resource of the cloud computing environment, that the first data element and the second data element require a data transformation; mapping, by the computing resource of the cloud computing environment, the first data element and the second data element to a common transformation algorithm; transforming, by the computing resource of the cloud computing environment, the first data element, using the common transformation algorithm, into a modified first data element; transforming, by the computing resource of the cloud computing environment, the second data element, using the common transformation algorithm, into a modified second data element; assigning, by the computing resource of the cloud computing environment, the data elements extracted from the plurality of data files to file identifiers that identify from which of the plurality of data files the data elements were extracted; assigning, by the computing resource of the cloud computing environment, the data elements extracted from the plurality of data files to attribute identifiers that identify types of healthcare data represented by the data elements; assigning, by the computing resource of the cloud computing environment, the modified first data element and the modified second data element to a first attribute identifier of the attribute identifiers; aggregating, by the computing resource of the cloud computing environment, the data elements based on the file identifiers and the attribute identifiers to create a standardized data set; mapping, by the computing resource of the cloud computing environment, the data elements in the standardized data set to a plurality of functions contained in at least one function library based on a mapping between the attribute identifiers and the plurality of functions; generating, by the computing resource of the cloud computing environment, a plurality of values based on mapping the data elements to the plurality of functions; determining, by the computing resource of the cloud computing environment, a healthcare metric based on combining the plurality of values according to a healthcare metric definition; and posting, by the computing resource of the cloud computing environment, the healthcare metric to the healthcare EDI for consumption by healthcare data clients.
β2. The method of claim 1, wherein assigning the data elements extracted from the plurality of data files to the attribute identifiers comprises: examining a data file, of the plurality of data files, to identify a combination of data elements present in the data file; and determining, using a machine learning model, a score for a data element in the data file based on the combination of data elements present in the data file, wherein the score predicts a type of healthcare data represented by the data element based on the combination of data elements present in the data file; and assigning the data element to one of the attribute identifiers based on the score.
β3. The method of claim 1, further comprising: examining a data file, of the plurality of data files, to identify a combination of data elements present in the data file; and determining, using a machine learning model, a score for the data file based on the combination of data elements present in the data file, wherein the score predicts a healthcare subject area associated with the data file based on the combination of data elements present in the data file; and assigning the data file to a healthcare subject area repository based on the score.
β4. The method of claim 1, further comprising: examining the standardized data set to identify the attribute identifiers present in the standardized data set; determining, using a data model, a list of healthcare metrics that are derivable from the standardized data set based on the attribute identifiers present in the standardized data set; and presenting the list of healthcare metrics that are derivable from the standardized data set to one or more healthcare data clients.
β5. The method of claim 1, further comprising validating decimal and integer fields in the plurality of data files converted to the common data format.
β6. The method of claim 1, further comprising: calculating a plurality of key performance indicators (KPIs) based on mapping the data elements in the standardized data set to the plurality of functions contained in the at least one function library; and posting the plurality of KPIs to the healthcare EDI for consumption by healthcare data clients.
β7. The method of claim 1, wherein the plurality of data files are received in two or more data formats including: a HL7 message format, a
β8. A device, comprising: one or more memory devices; and one or more processors, implemented at least partially in hardware and communicatively coupled to the one or more memory devices, to: receive a plurality of data files, wherein the plurality of data files are received in a plurality of different data formats, and wherein the plurality of data files include data elements associated with healthcare data; convert the plurality of data files received in the plurality of different data formats to a common data format; extract data elements from the plurality of data files converted to the common data format; extract a first data element, of the data elements, from a first data file of the plurality of data files; extract a second data element, of the data elements from a second data file of the plurality of data files; determine that the first data element and the second data element require a data transformation; map the first data element and the second data element to a common transformation algorithm; transform the first data element, using the common transformation algorithm, into a modified first data element; transform the second data element, using the common transformation algorithm, into a modified second data element; assign the data elements extracted from the plurality of data files to file identifiers that identify from which of the plurality of data files the data elements were extracted; assign the data elements extracted from the plurality of data files to attribute identifiers that identify types of healthcare data represented by the data elements; assign the modified first data element and the modified second data element to a first attribute identifier of the attribute identifiers; aggregate the data elements based on the file identifiers and the attribute identifiers to create a standardized data set; examine the standardized data set to identify the attribute identifiers present in the standardized data set; determine, using a data model, a list of healthcare metrics that are derivable from the standardized data set based on the attribute identifiers present in the standardized data set; map the data elements in the standardized data set to a plurality of functions based on a mapping between the attribute identifiers and the plurality of functions, wherein the plurality of functions is configured to generate a healthcare metric included in the list of healthcare metrics; generate a plurality of values based on processing the data elements using the plurality of functions; derive the healthcare metric based on combining the plurality of values according to a healthcare metric definition; and post the healthcare metric to a healthcare electronic data interchange (EDI) for consumption by healthcare data clients.
β9. The device of claim 8, wherein the one or more processors are further configured to: examine a data file, of the plurality of data files, to identify a combination of data elements present in the data file; determine, using a machine learning model, a score for the data file based on the combination of data elements present in the data file, wherein the score predicts a healthcare subject area associated with the data file based on the combination of data elements present in the data file; and assign the data file to a healthcare subject area repository based on the score.
β10. The device of claim 8, wherein the attribute identifiers are associated with a health insurance member, a health insurance claim, a healthcare provider, a hospital, or a pharmacy.
β11. The device of claim 8, wherein the plurality of functions is configured to calculate a plurality of key performance indicators (KPIs) associated with a healthcare subject area.
β12. The device of claim 11, wherein the healthcare subject area includes one of: a first subject area relating to a pharmacy, a second subject area relating to a hospital, a third subject area relating to a primary care physician, or a fourth subject area relating to health insurance.
β13. The device of claim 8, wherein the plurality of data files are received from the healthcare EDI.
β14. The device of claim 13, wherein the plurality of data files are received in two or more data formats including: a HL7 message format, a
β15. A non-transitory computer-readable medium storing instructions, the instructions comprising: one or more instructions that, when executed by one or more processors, cause the one or more processors of a device to: receive a plurality of data files, wherein the plurality of data files are received in a plurality of different data formats, and wherein the plurality of data files include data elements; convert the plurality of data files received in the plurality of different data formats to a common data format; extract data elements from the plurality of data files converted to the common data format; extract a first data element, of the data elements, from a first data file of the plurality of data files; extract a second data element, of the data elements from a second data file of the plurality of data files; determine that the first data element and the second data file require a data transformation; map the first data element and the second data element to a common transformation algorithm; transform the first data element, using the common transformation algorithm, into a modified first data element; transform the second data element, using the common transformation algorithm, into a modified second data element; assign the data elements extracted from the plurality of data files to file identifiers that identify from which of the plurality of data files the data elements were extracted; assign the data elements extracted from the plurality of data files to attribute identifiers that identify types of data represented by the data elements, wherein for a data file of the plurality of data files: examine the data file to identify a combination of data elements present in the data file, determine, using a first machine learning model, a first score for a data element in the data file based on the combination of data elements present in the data file, wherein the first score predicts a type of data represented by the data element based on the combination of data elements present in the data file, and assign the data element to an attribute identifier based on the first score; assign the modified first data element and the modified second data element to a first attribute identifier of the attribute identifiers; aggregate the data elements based on the file identifiers and the attribute identifiers to create a standardized data set; map the data elements in the standardized data set to a plurality of functions contained in at least one function library based on a mapping between the attribute identifiers and the plurality of functions; generate a plurality of values based on mapping the data elements to the plurality of functions; derive a metric based on combining the plurality of values according to a metric definition; and post the metric to an electronic data interchange (EDI) for consumption by data clients.
β16. The non-transitory computer-readable medium of claim 15, wherein the one or more instructions, when executed by the one or more processors, further cause the one or more processors to: determine, using a second machine learning model, a second score for the data file based on the combination of data elements present in the data file, wherein the second score predicts a subject area associated with the data file based on the combination of data elements present in the data file; and assign the data file to a subject area repository based on the second score.
β17. The non-transitory computer-readable medium of claim 15, wherein the one or more instructions, when executed by the one or more processors, further cause the one or more processors to: examine the standardized data set to identify the attribute identifiers present in the standardized data set; determine, using a data model, a list of metrics that are derivable from the standardized data set based on the attribute identifiers present in the standardized data set; and present the list of metrics that are derivable from the standardized data set to one or more data clients.
β18. The non-transitory computer-readable medium of claim 15, wherein the one or more instructions, when executed by the one or more processors, further cause the one or more processors to: generate a plurality of key performance indicators (KPIs) based on mapping the data elements in the standardized data set to the plurality of functions contained in the at least one function library; and post the plurality of KPIs to the EDI for consumption by healthcare data clients.
β19. The non-transitory computer-readable medium of claim 15, wherein the one or more instructions, when executed by the one or more processors, further cause the one or more processors to: transmit the metric to one or more data clients.
β20. The method of claim 1, further comprising: receiving the plurality of data files in a streaming manner; and receiving the plurality of data files using application programming interface (API) calls.β
URL and more information on this patent, see: Sundararaman, Arun; Ramamoorthy, Udayakumar; Pargunarajan, Sureshkumar; Appusamy, Sangeetha. Data Platform For Automated Data Extraction, Transformation, And/Or Loading.
(Our reports deliver fact-based news of research and discoveries from around the world.)
UMaine System retirees picket, plan legal action over benefits change
Revisions to Publication Requirements for Community Eligibility Status Information Under the National Flood Insurance Program
Advisor News
Annuity News
Health/Employee Benefits News
Life Insurance News