Patent Issued for System and method to represent conversational flows as graph embeddings and to conduct classification and clustering based on such embeddings (USPTO 11797768): MuyVentive LLC
2023 NOV 13 (NewsRx) -- By a
The patent’s assignee for patent number 11797768 is
News editors obtained the following quote from the background information supplied by the inventors: “Conversational data is generated by interacting with natural language interfaces such as voice interfaces and chatbots. A designer of the natural language interface cannot easily determine how well a natural language interface will handle a conversation.
“Users of the natural language interfaces have meta data that characterize them, such as their location, age, gender, etc. Further, the natural language interface may collect other conversational meta data, such as an outcome variable for the conversation that characterizes and/or describes the conversation in some way. For example, the outcome variable may be one of: a categorial variable that describes whether the conversation was successful or not, based on some metric; a numerical variable indicative of a length of the conversation, such as a number of times the user interacted with the natural language interface; and an ordinal variable indicative of the user’s indicated satisfaction ranking of the conversation based on rating in the range of one-unsatisfied to five-very satisfied.”
As a supplement to the background information on this patent, NewsRx correspondents also obtained the inventors’ summary information for this patent: “One aspect of the present embodiments includes the realization that there is a nascent but growing space called conversational analytics with a need for tools that facilitate development of natural language interfaces. The present embodiments solve this problem by providing a conversational analytics toolset that generates reports of summary statistics of popular intents and entities appearing in conversational transcripts and control flow diagrams that are generates to describe the conversations in a graphical representation. Advantageously, the conversational analytics toolset processes conversational data and generates summary statistics reports and graphical representations that allow the developer to see problems with the intents used by the natural language interface and learn how to adjust the intents to improve the quality of the natural language interface.
“Another aspect of the present embodiments includes the realization that a natural language interface could steer a current conversation towards a positive outcome if it knew that the current conversation was likely to have a negative outcome. The present embodiments solve this problem by using graph embedding to identify a previous conversation that is similar to the current conversation and then determining whether that previous conversation had a negative outcome. When the previous conversation had a negative outcome, the natural language interface may be controlled to steer the current conversation towards a more positive outcome.
“Another aspect of the present embodiments includes the realization that when sharing conversational datasets (potentially for research and development analysis purposes with other practitioners or researchers or to the public-for transparency purposes when possible) there are a few challenges that can arise: privacy and anonymization-textual content of conversations typically includes personally identifying information (PII), which is sensitive private information that a person or small group of people may not wish to disclose, including information that may lead to the person or group of people being identifiable, information about a nature of the conversation, and other sensitive information about the person or group. Advantageously, the embodiments described herein solve this problem by grouping such conversations and ensuring k-anonymity.
“Another aspect of the present embodiments includes the realization that there is increasing regulatory requirements for privacy and personal information disclosure. Existing regulations include the General Data Protection Regulation (GDPR), which is a European regulation implemented in 2018 to enhance EU citizens’ control over the personal data that companies can legally hold, Health Insurance Portability and Accountability Act (HIPPA) Privacy Rule implemented on
“In one embodiment, a method develops natural language interface. Conversational data including user utterances is received for a plurality of conversations from a natural language interface. Each of the conversations is classified to determine one or more intents for each of the user utterances, and, for each of the conversations, a control flow diagram showing the intents and sequential flow of the conversation is generated. Each of the control flow diagrams is processed to generate a graph embedding representative of the conversation.”
The claims supplied by the inventors are:
“1. A method for ensuring k-anonymity in shared conversation datasets, comprising: generating graph embeddings for each of a plurality of conversations from conversational data for N different users; determining at least one cluster of the graph embeddings using a clustering algorithm; determining number K of points in the at least one cluster; sharing at least part of the conversational data corresponding to the at least one cluster when K is greater than or equal to N; extracting at least one representative conversation corresponding to at least one graph embedding within the at least one cluster to form the at least part of the conversational data; storing the at least one representative conversation in a cache for fast access; and storing the conversational data in secondary storage having slower access than the cache.
“2. The method of claim 1, the at least one graph embedding corresponding to a centroid of the at least one cluster.
“3. The method of claim 1, the clustering algorithm implementing one or both of k-means and k-medioids.
“4. A method for ensuring k-anonymity in shared conversation datasets, comprising: generating graph embeddings for each of a plurality of conversations from conversational data for N different users; determining at least one cluster of the graph embeddings using a clustering algorithm; determining number K of points in the at least one cluster; sharing at least part of the conversational data corresponding to the at least one cluster when K is greater than or equal to N; determining results by filtering the conversational data based on at least one of a metadata dimension and an outcome variable; and processing the results to generate filtered graph embeddings related to each of the at least one of a metadata dimension and an outcome variable.
“5. The method of claim 4, further comprising: clustering the filtered graph embeddings using the clustering algorithm; identifying clusters having fewer than a threshold value T of graph embeddings; and indicating that the identified clusters require collection of more conversational data corresponding to the at least one of the metadata dimension and/or the outcome variable.
“6. The method of claim 5, further comprising generating an alert to indicate that the corresponding types of conversation are under-represented in conversational data.
“7. The method of claim 1, further comprising displaying, for each cluster, a histogram indicative of at least one attribute of the cluster.
“8. The method of claim 7, the attribute being age of a person having the conversation.
“9. A method for efficient searching of conversations, comprising: generating graph embeddings for each of a plurality of conversations from conversational data for different users; determining at least one cluster of the graph embeddings using a clustering algorithm; determining a representative conversation of the at least one cluster, wherein the representative conversation removes repetitive information; storing the representative conversation in a cache; and searching the cache to find the representative conversation based on input parameters.
“10. A method for efficient searching of conversations, comprising: generating graph embeddings for each of a plurality of conversations from conversational data for different users; determining at least one cluster of the graph embeddings using a clustering algorithm; determining a representative conversation of the at least one cluster; storing the representative conversation in a cache; and searching the cache to find the representative conversation based on input parameters and to allow representative searches to be performed rapidly.”
For additional information on this patent, see: Topol, Zvi. System and method to represent conversational flows as graph embeddings and to conduct classification and clustering based on such embeddings.
(Our reports deliver fact-based news of research and discoveries from around the world.)


Researchers Submit Patent Application, “Self-Optimizing, Multi-channel, Cognitive Digital Insurance Rate Quoting, Comparison Shopping and Enrollment System and Method”, for Approval (USPTO 20230342860): Patent Application
Patent Issued for Systems and methods for adaptive route optimization for learned task planning (USPTO 11797931): State Farm Mutual Automobile Insurance Company
Advisor News
- Global economic growth will moderate as the labor force shrinks
- Estate planning during the great wealth transfer
- Main Street families need trusted financial guidance to navigate the new Trump Accounts
- Are the holidays a good time to have a long-term care conversation?
- Gen X unsure whether they can catch up with retirement saving
More Advisor NewsAnnuity News
- Pension buy-in sales up, PRT sales down in mixed Q3, LIMRA reports
- Life insurance and annuities: Reassuring ‘tired’ clients in 2026
- Insurance Compact warns NAIC some annuity designs ‘quite complicated’
- MONTGOMERY COUNTY MAN SENTENCED TO FEDERAL PRISON FOR DEFRAUDING ELDERLY VICTIMS OF HUNDREDS OF THOUSANDS OF DOLLARS
- New York Life continues to close in on Athene; annuity sales up 50%
More Annuity NewsHealth/Employee Benefits News
Life Insurance News
- AM Best Affirms Credit Ratings of Manulife Financial Corporation and Its Subsidiaries
- AM Best Upgrades Credit Ratings of Starr International Insurance (Thailand) Public Company Limited
- PROMOTING INNOVATION WHILE GUARDING AGAINST FINANCIAL STABILITY RISKS SPEECH BY RANDY KROSZNER
- Life insurance and annuities: Reassuring ‘tired’ clients in 2026
- Reliance Standard Life Insurance Company Trademark Application for “RELIANCEMATRIX” Filed: Reliance Standard Life Insurance Company
More Life Insurance News