The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jul. 08, 2025
Filed:
Jan. 17, 2025
Beijing University of Chemical Technology, Beijing, CN;
Zhiwei Li, Beijing, CN;
Tingzhen Zhang, Beijing, CN;
Haohan Wu, Beijing, CN;
Weizheng Zhang, Beijing, CN;
Weiye Xiao, Beijing, CN;
Kunfeng Wang, Beijing, CN;
Wei Zhang, Beijing, CN;
Tianyu Shen, Beijing, CN;
Li Wang, Beijing, CN;
Qifan Tan, Beijing, CN;
Beijing University of Chemical Technology, Beijing, CN;
Abstract
A multimodal perception decision-making method for autonomous driving based on a large language model includes: acquiring an RGB image and an infrared image of a target area at current time; processing the RGB image using a target detection model to obtain a predicted bounding box and a corresponding target detection category; processing the infrared image and the predicted bounding box and the corresponding target detection categories by using a segmentation model to obtain a target mask image; fusing the RGB image, the target mask image and the infrared image using a fusion model to obtain a fused feature map; performing fusion processing on first prompt information representing a user intent, second prompt information representing target detection category priorities, and the fused feature map, using a large Vision-Language Model to obtain textual information; and processing the textual information using a large natural language model to obtain a perception decision-making result.