Company Filing History:
Years Active: 2019-2023
Title: Innovations of Zhiheng Huang
Introduction
Zhiheng Huang is a prominent inventor based in Sunnyvale, CA, known for his contributions to the field of artificial intelligence and image processing. He holds a total of five patents that showcase his innovative approaches to multimodal systems.
Latest Patents
One of his latest patents is focused on intelligent image captioning. This invention presents embodiments of a multimodal Recurrent Neural Network (m-RNN) model designed for generating novel image captions. The model directly models the probability distribution of generating a word based on previous words and an image. It consists of two sub-networks: a deep recurrent neural network for sentences and a deep convolutional network for images. These sub-networks interact within a multimodal layer to form the complete m-RNN model. The effectiveness of this model has been validated on four benchmark datasets, outperforming state-of-the-art methods. Additionally, the m-RNN model can be applied to retrieval tasks for images or captions.
Another significant patent is related to multilingual image question answering. This invention presents a multimodal question answering (mQA) system that answers questions about the content of images. The model comprises four components: a Long Short-Term Memory (LSTM) component for question representation, a Convolutional Neural Network (CNN) for visual representation, another LSTM for storing linguistic context in an answer, and a fusing component to combine information from the first three components to generate the answer. A Freestyle Multilingual Image Question Answering (FM-IQA) dataset was constructed to train and evaluate the mQA model, with the quality of generated answers assessed by human judges through a Turing Test.
Career Highlights
Zhiheng Huang is currently employed at Baidu USA LLC, where he continues to develop innovative technologies in artificial intelligence. His work has significantly impacted the way machines understand and interpret visual data.
Collaborations
He has collaborated with notable colleagues such as Wei Xu and Jiang Wang, contributing to advancements in their respective fields.
Conclusion
Zhiheng Huang's innovative patents in intelligent image captioning and multilingual image question answering highlight his significant contributions to artificial intelligence. His work continues to influence the development of advanced multimodal systems.
Inventor’s Patent Attorneys refers to legal professionals with specialized expertise in representing inventors throughout the patent process. These attorneys assist inventors in navigating the complexities of patent law, including filing patent applications, conducting patent searches, and protecting intellectual property rights. They play a crucial role in helping inventors secure patents for their innovative creations.