The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Apr. 29, 2025
Filed:
Oct. 31, 2023
Microsoft Technology Licensing, Llc, Redmond, WA (US);
Sayan Dev Pathak, Kirkland, WA (US);
Osama Abuelsorour, Menlo Park, CA (US);
Christopher Hakan Basoglu, Everett, WA (US);
Harini Kesavamoorthy, Bellevue, WA (US);
Girish Milind Mahajan, Redmond, WA (US);
Salman Mohammad Quazi, Mountain View, CA (US);
Valeriy Viktorovich Kirshin, Kirkland, WA (US);
Microsoft Technology Licensing, LLC, Redmond, WA (US);
Abstract
A technique partitions a user's original query into plural smaller component queries, each of which has a common part and an instance-specific part. The technique distributes the component queries to plural processor instances of a processor. The plural processor instances transform the respective component queries into query-component responses by acting in parallel, independent of each other. The technique generates a final response based on the query-component responses, e.g., by assembling the component-query responses into the final response. The technique reduces latency because the processor instances work on parts of the user's original query at the same time, rather than as a single stream of consecutive tokens. The plural processor instances have access to a shared cache memory, and utilize relevant data that has been computed in response to previous queries.