The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jan. 07, 2025
Filed:
Jun. 05, 2024
Roku, Inc., San Jose, CA (US);
Fei Xiao, San Jose, CA (US);
Amit Verma, Sunnyvale, CA (US);
Rohit Mahto, San Jose, CA (US);
Rameen Mahdavi, San Jose, CA (US);
Nam Vo, San Jose, CA (US);
Zidong Wang, San Jose, CA (US);
Lian Liu, Rancho Palos Verdes, CA (US);
Jose Sanchez, San Jose, CA (US);
Pulkit Aggarwal, San Jose, CA (US);
Atishay Jain, San Bruno, CA (US);
Abhishek Bambha, Burlingame, CA (US);
Ronica Jethwa, Mountain View, CA (US);
Roku, Inc., San Jose, CA (US);
Abstract
Disclosed herein are system, method and/or computer program product embodiments, and/or combinations thereof, for training a conversational recommendation system. An embodiment generates a probabilistic pseudo-user neural network model based on at least one interest probability distribution corresponding to a pseudo-user profile. The embodiment trains, using the pseudo-user neural network model, the conversational recommendation system to learn a recommendation policy, where the conversational recommendation system includes an interest-exploration engine and a prompt-decision engine. The training includes performing an iterative learning process that includes selecting an interest-exploration strategy based on one or more of the following: an interest-exploration policy, an earlier pseudo-user response generated by the pseudo-user neural network model, content data, and pseudo-user interaction history. The embodiment then generates, using the trained conversational recommendation system, a real-time recommendation having high play probability based on the minimal number of iterations of conversation between a user and the trained conversational recommendation system.