18466773. POINT CLOUD SEARCH USING MULTI-MODAL EMBEDDINGS (GM Cruise Holdings LLC)
POINT CLOUD SEARCH USING MULTI-MODAL EMBEDDINGS
Organization Name
Inventor(s)
Carden Bagwell of San Francisco CA (US)
POINT CLOUD SEARCH USING MULTI-MODAL EMBEDDINGS
This abstract first appeared for US patent application 18466773 titled 'POINT CLOUD SEARCH USING MULTI-MODAL EMBEDDINGS
Original Abstract Submitted
Aspects of the disclosed technology provide solutions for searching point cloud data, such as Light Detection and Ranging (LiDAR) data and in particular, for using multi-modal embeddings for searching objects within a LiDAR data set. A process of the disclosed technology can include steps for receiving road data, wherein the road data represents a real-world environment encountered by an autonomous vehicle (AV) and wherein the road data comprises point cloud data representing a plurality of objects and generating, for each of the plurality of objects, a corresponding set of first embeddings. The process can further include steps for receiving a text string corresponding to a searched object, generating a second embedding corresponding to the searched object and identifying a matching object among the plurality of objects based on a comparison of the set of first embeddings and the second embedding. System and machine-readable media are also provided.