Patent Application Titled “System and Methods for Processing Large Scale Data” Published Online (USPTO 20170364558)
By a
No assignee for this patent application has been made.
Reporters obtained the following quote from the background information supplied by the inventors: "Companies in various industries manage massive amounts of data. For example, companies in the health insurance industry store large amounts of data pertaining to insureds' personal identifying and health information, insurance coverage, claims, prescriptions, pharmacy information, doctor information, etc. This data is often collected and stored over the span of decades. This does not just occur in the health insurance industry. In every industry sector massive amounts of data is piling up. As the amount of data increases and data analysis demands also increase and evolve, companies not only need a way to ensure proper storage and retrieval, but a way to analyze large amounts of data in little time. For a company managing significant volumes of data, a database query of relatively low complexity may not return a result for hours or even days. The time it takes to perform queries is often unreasonable and detrimental to system users, who typically need answers in real-time.
"Accuracy of data querying is also a concern, since as the data set increases in size and complexity, there is more room for error.
"Historically, data injection methods have been centralized solutions based on a lowest common denominator. Although this has made for a simple process for the user, it is a very slow process.
"Exemplary system and method embodiments described herein solve these and other problems, and additionally offer a novel approach to the manner in which databases can be queried."
In addition to obtaining background information on this patent application, NewsRx editors also obtained the inventors' summary information for this patent application: "Exemplary system and method embodiments described and depicted herein are directed generally to increasing the speed at which large amounts of data can be queried, as well as increasing the accuracy of query results. To this end, exemplary system and method embodiments include improved systems and methods for distributed and interactive cube exploration.
"In one embodiment, systems and methods for importing data into the distributed system are provided through the use of an API that allows for individual inserts, bulk load, and customizable table creation. Data management and manipulation, including data replication, data partitioning, data repair, and data deletion, allow for a system with increased query efficiency and accuracy. Data injection takes advantage of every node in the cluster, and also leverages each underlying storage engine's unique features. Every node contributes to the process and the user is able to select the storage engine to use, further enhancing the process.
"Customization allows for the user to control both the speed and accuracy of a given query. Queries can be performed on a pre-determined sampling percentage, which can be manipulated on a per query basis. Sampling percentages dictate the number of table partitions that are queried, and allow for a subset of table partitions to be queried, decreasing the amount of time it takes to calculate a result.
"Speculation queries can be processed by the system between the receipt of real-time queries. Results from speculation queries may be cached and accessed when a relevant real-time query is received by the system, serving to reduce query response time.
"Certain systems and methods provide for a master application in control of numerous distributed slave applications. The number of slave applications utilized for a query may be dictated by the sample rate set by a user. Results obtained by slave applications are aggregated by the master application and returned to the user. Slave applications also run speculation queries at the direction of the master application. If requested by the master, slave applications may also provide query accuracy information that is eventually passed along to the user. Accuracy information allows a user to appreciate the implications of the sample rate that they have selected."
For more information, see this patent application: Thorne, Victor; Liaw, Mac; Zender, Nathan. System and Methods for Processing Large Scale Data. Filed
Keywords for this news article include:
Our reports deliver fact-based news of research and discoveries from around the world. Copyright 2018, NewsRx LLC
Sacto 911
Researchers at University of Colorado Target Managed Care (Clinic-Level Population Health Intervention by PGY2 Ambulatory Care Pharmacy Residents to…
Advisor News
Annuity News
Health/Employee Benefits News
Life Insurance News