Patent Issued for Method of controlling for undesired factors in machine learning models (USPTO 11676217): State Farm Mutual Automobile Insurance Company
2023 JUL 05 (NewsRx) -- By a
Patent number 11676217 is assigned to
The following quote was obtained by the news editors from the background information supplied by the inventors: “Machine learning models may be trained to analyze information for particular purposes involving identifying correlations and making predictions. During training, the models may learn to include illegitimate, non-useful, irrelevant, misleading, or otherwise undesired factors, especially if such biases are present in the training data sets. In particular, while training with structured data involves limiting the data that a model considers, training with unstructured data allows the model to consider all available data, including background information and other undesired factors. For example, a neural network trained with unstructured data including people’s appearances to make correlations and predictions about those people may consider such undesired factors as age, sex, ethnicity, and/or race in its subsequent analyses.”
In addition to the background information obtained for this patent, NewsRx journalists also obtained the inventors’ summary information for this patent: “Embodiments of the present technology relate to machine learning models that control for consideration of one or more undesired factors which might otherwise be considered by the machine learning model when analyzing new data. For example, one embodiment of the present invention may be configured for training and using a neural network that controls for consideration of one or more undesired factors which might otherwise be considered by the neural network when analyzing new data as part of an underwriting process to determine an appropriate insurance premium.
“In a first aspect, a method of training and using a machine learning model that controls for consideration of one or more undesired factors which might otherwise be considered by the machine learning model may broadly comprise the following. The machine learning model may be trained using a training data set that contains information including the undesired factors. The undesired factors and one or more relevant interaction terms between the undesired factors may be identified. The machine learning model may then be caused to not consider the identified undesired factors when analyzing the new data to control for undesired prejudice or discrimination in machine learning models.
“In a second aspect, a computer-implemented method for training and using a machine learning model to evaluate an insurance applicant as part of an underwriting process to determine an appropriate insurance premium, wherein the machine learning model controls for consideration of one or more undesired factors which might otherwise be considered by the machine learning model, may broadly comprise the following. The machine learning model may be trained to probabilistically correlate an aspect of appearance with a personal and/or health-related characteristic by providing machine learning model with a training data set of images of individuals having known personal or health-related characteristics, including the undesired factors. The undesired factors and one or more relevant interaction terms between the undesired factors may be identified. An image of the insurance applicant may be received via a communication element. The machine learning model may analyze the image of the insurance applicant to probabilistically determine the personal and/or health-related characteristics for the insurance applicant, wherein such analysis excludes the identified undesired factors. The machine learning model may then suggest the appropriate insurance premium based at least in part on the probabilistically determined personal and/or health-related characteristic but not on the undesired factors.
“Various implementations of these aspects may include any one or more of the following additional features. Identifying the undesired factors and relevant interaction terms may include training a second machine learning model using a second training data set that contains only the undesired factors and the relevant interaction terms. Further, causing the machine learning model to not consider the identified undesired factors when analyzing the new data may include combining the machine learning model and the second machine learning model to eliminate a bias created by the undesired factors from the machine learning model’s consideration prior to employing the machine learning model to analyze the new data. Alternatively or additionally, identifying the undesired factors and relevant interaction terms may include training the machine learning model to identify the undesired factors and the one or more relevant interaction terms. Further, causing the machine learning model to not consider the identified undesired factors when analyzing the new data may include instructing the machine learning model to not consider the identified undesired factors while analyzing the new data. The machine learning model may be a neural network. The second machine learning model may be a linear model. The machine learning model may be trained to analyze the new data as part of an underwriting process to determine an appropriate insurance premium, and the new data may include images of a person applying for life insurance or health insurance or images of a piece of property for which a person is applying for property insurance. The machine learning model may be further trained to analyze the new data as part of the underwriting process to determine one or more appropriate terms of coverage.
“Advantages of these and other embodiments will become more apparent to those skilled in the art from the following description of the exemplary embodiments which have been shown and described by way of illustration. As will be realized, the present embodiments described herein may be capable of other and different embodiments, and their details are capable of modification in various respects. Accordingly, the drawings and description are to be regarded as illustrative in nature and not as restrictive.”
The claims supplied by the inventors are:
“1. A computer-implemented method for training and using a neural network to evaluate an insurance applicant as part of an underwriting process to determine an appropriate insurance premium, wherein the neural network controls for consideration of one or more undesired factors which might otherwise be considered by the neural network, the computer-implemented method comprising, via one or more processors: training the neural network to probabilistically correlate an aspect of appearance with a personal and/or health-related characteristic by providing the neural network with a training data set of images of individuals having known personal and/or health-related characteristics, including the one or more undesired factors; identifying the one or more undesired factors; identifying one or more relevant interaction terms between the one or more undesired factors; receiving via a communication element an image of the insurance applicant; analyzing with the neural network the image of the insurance applicant to probabilistically determine the personal and/or health-related characteristics for the insurance applicant, wherein such analysis excludes the identified one or more undesired factors; and suggesting with the neural network the appropriate insurance premium based at least in part on the probabilistically determined personal and/or health-related characteristics but not on the one or more undesired factors to control for undesired prejudice or discrimination in neural networks.
“2. The computer-implemented method as set forth in claim 1, wherein identifying the one or more undesired factors includes training a second neural network using a second training data set that contains only the one or more undesired factors and one or more relevant interaction terms between the one or more undesired factors.
“3. The computer-implemented method as set forth in claim 2, wherein causing the neural network to exclude the identified one or more undesired factors when analyzing the image includes combining the neural network and the second neural network to eliminate a bias created by the one or more undesired factors from the neural network’s consideration prior to employing the neural network to analyze the image.
“4. The computer-implemented method as set forth in claim 1, wherein causing the neural network to exclude the identified one or more undesired factors when analyzing the image includes training the neural network to identify the one or more undesired factors and one or more relevant interaction terms between the one or more undesired factors.
“5. The computer-implemented method as set forth in claim 4, wherein causing the neural network to exclude the identified one or more undesired factors when analyzing the image includes instructing the neural network to not consider the identified one or more undesired factors while analyzing the image.
“6. The computer-implemented method as set forth in claim 1, wherein the image of the insurance applicant is a selfie image taken with a smartphone and transmitted via a wireless communications network.
“7. A computer system configured to train and use a neural network to evaluate an insurance applicant as part of an underwriting process to determine an appropriate insurance premium, wherein the neural network controls for consideration of one or more undesired factors which might otherwise be considered by the neural network, the computer system comprising one or more processors configured to: train the neural network to probabilistically correlate an aspect of appearance with a personal and/or health-related characteristic by providing the neural network with a training data set of images of individuals having known personal and/or health-related characteristics, including the one or more undesired factors; identify the one or more undesired factors; identify one or more relevant interaction terms between the one or more undesired factors; receive via a communication element an image of the insurance applicant; analyze with the neural network the image of the insurance applicant to probabilistically determine the personal and/or health-related characteristics for the insurance applicant, wherein such analysis excludes the identified one or more undesired factors; and suggest or recommend with the neural network the appropriate insurance premium based at least in part on the probabilistically determined personal and/or health-related characteristics but not on the one or more undesired factors to control for undesired prejudice or discrimination in machine learning models.
“8. The computer system as set forth in claim 7, wherein identifying the one or more undesired factors includes the one or more processors training a second neural network using a second training data set that contains only the one or more undesired factors and one or more relevant interaction terms between the one or more undesired factors.
“9. The computer system as set forth in claim 8, wherein causing the neural network to exclude the identified one or more undesired factors when analyzing the image includes the one or more processors combining the neural network and the second neural network to eliminate a bias created by the one or more undesired factors from the neural network’s consideration prior to employing the neural network to analyze the image.
“10. The computer system as set forth in claim 7, wherein causing the neural network to exclude the identified one or more undesired factors when analyzing the image includes the one or more processors training the neural network to identify the one or more undesired factors and one or more relevant interaction terms between the one or more undesired factors.
“11. The computer system as set forth in claim 7, wherein causing the neural network to exclude the identified one or more undesired factors when analyzing the image includes the one or more processors instructing the neural network to not consider the identified one or more undesired factors while analyzing the image.
“12. The computer system as set forth in claim 7, wherein the image of the insurance applicant is a selfie image taken with a smartphone and transmitted via a wireless communications network.
“13. A computer-implemented method for training and using a neural network to evaluate an insurance applicant as part of an underwriting process to determine an appropriate insurance premium, wherein the neural network controls for consideration of one or more undesired factors which might otherwise be considered by the neural network, the computer-implemented method comprising, via one or more processors: training the neural network to probabilistically correlate an aspect of appearance with a personal and/or health-related characteristic by providing the neural network with a training data set of images of individuals having known personal and/or health-related characteristics, including the one or more undesired factors; identifying the one or more undesired factors; identifying one or more relevant interaction terms between the one or more undesired factors; receiving via a communication element a selfie image of the insurance applicant taken with a smartphone and transmitted via a wireless communications network; analyzing with the neural network the selfie image of the insurance applicant to probabilistically determine the personal and/or health-related characteristics for the insurance applicant, wherein such analysis excludes the identified one or more undesired factors; and suggesting with the neural network the appropriate insurance premium based at least in part on the probabilistically determined personal and/or health-related characteristics but not on the one or more undesired factors to control for undesired prejudice or discrimination in the neural network.
“14. The computer-implemented method as set forth in claim 13, wherein identifying the one or more undesired factors includes training a linear machine learning model using a second training data set that contains only the one or more undesired factors and one or more relevant interaction terms between the one or more undesired factors.
“15. The computer-implemented method as set forth in claim 13, wherein causing the neural network to exclude the identified one or more undesired factors when analyzing the selfie image includes combining the neural network and the linear machine learning model to eliminate a bias created by the one or more undesired factors from consideration by the neural network prior to employing the neural network to analyze the selfie image.
“16. The computer-implemented method as set forth in claim 13, wherein causing the neural network to exclude the identified one or more undesired factors when analyzing the selfie image includes instructing the neural network to not consider the identified one or more undesired factors while analyzing the selfie image.”
URL and more information on this patent, see: Bernico,
(Our reports deliver fact-based news of research and discoveries from around the world.)
Research Reports from National Institute of Forest Science Provide New Insights into Risk Management (Risk Management of Methane Reduction Clean Development Mechanism Projects in Rice Paddy Fields): Risk Management
Hamilton's Flood Mitigation Efforts Saving Residents on Insurance Says National Program
Advisor News
Annuity News
Health/Employee Benefits News
Life Insurance News