The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Nov. 07, 2023
Filed:
Apr. 12, 2021
Meta Platforms, Inc., Menlo Park, CA (US);
Kristen Lorraine Grauman, Austin, TX (US);
Senthil Purushwalkam Shiva Prakash, Pittsburgh, PA (US);
Sebastia Vicenc Amengual Gari, Redmond, WA (US);
Vamsi Krishna Ithapu, Kirkland, WA (US);
Carl Schissler, Redmond, WA (US);
Philip Robinson, Seattle, WA (US);
Abhinav Gupta, Pittsburgh, PA (US);
Meta Platforms, Inc., Menlo Park, CA (US);
Abstract
The present disclosure relates to systems, non-transitory computer-readable media, and methods for utilizing multiple modalities to generate accurate two-dimensional floorplans based on sparse digital videos depicting three-dimensional space. In particular, in one or more embodiments, the disclosed systems extract both visual and audio information from sparse digital video coverage of portions of a three-dimensional space and utilize the extracted visual and audio information to generate a two-dimensional floorplan representing both viewed and unviewed portions of the three-dimensional space. For example, the disclosed systems utilize self-attention layers of a specialized machine learning model to maintain and leverage bi-directional relationships among sequences of visual and audio features to generate floorplan predictions associated with the three-dimensional space. The disclosed systems then combine the predictions to generate the two-dimensional floorplan including a geometric layout and one or more semantic room labels.