US Patent Application 18233657. UNSTRUCTURED TEXT CLASSIFICATION simplified abstract
Contents
UNSTRUCTURED TEXT CLASSIFICATION
Organization Name
Microsoft Technology Licensing, LLC==Inventor(s)==
[[Category:Arunkumar Gururajan of Redmond WA (US)]]
[[Category:Jack Wilson Stokes, Iii of Redmond WA (US)]]
[[Category:Farid Tajaddodianfar of Redmond WA (US)]]
UNSTRUCTURED TEXT CLASSIFICATION - A simplified explanation of the abstract
This abstract first appeared for US patent application 18233657 titled 'UNSTRUCTURED TEXT CLASSIFICATION
Simplified Explanation
- The patent application describes a technology that can quickly and accurately identify malicious URLs. - The technology is designed to be used as a real-time URL security analysis tool. - It can process a URL quickly and issue a warning if it identifies a malicious URL. - The technology achieves fast processing speed by using only the URL as input. - It achieves high accuracy by analyzing the unstructured text of the URL on both a character-by-character and word-by-word level. - The technology utilizes both character-level and word-level information from the incoming URL.
Original Abstract Submitted
The technology described herein identifies malicious URLs using a classifier that is both accurate and fast. Aspects of the technology are particularly well adapted for use as a real-time URL security analysis tool because the technology is able to quickly process a URL and produce a warning when a malicious URL is identified. The rapid processing speed of the technology described herein is produced, in part, by use of only a single input signal, which is the URL itself. The high accuracy produced by the technology described herein is achieved by analyzing the unstructured text on both a character-by-character level and a word-by-word level. The technology described herein uses both character-level and word-level information from the incoming URL.