US Patent Application 18233657. UNSTRUCTURED TEXT CLASSIFICATION simplified abstract

From WikiPatents
Jump to navigation Jump to search

UNSTRUCTURED TEXT CLASSIFICATION

Organization Name

Microsoft Technology Licensing, LLC==Inventor(s)==

[[Category:Arunkumar Gururajan of Redmond WA (US)]]

[[Category:Jack Wilson Stokes, Iii of Redmond WA (US)]]

[[Category:Farid Tajaddodianfar of Redmond WA (US)]]

UNSTRUCTURED TEXT CLASSIFICATION - A simplified explanation of the abstract

This abstract first appeared for US patent application 18233657 titled 'UNSTRUCTURED TEXT CLASSIFICATION

Simplified Explanation

- The patent application describes a technology that can quickly and accurately identify malicious URLs. - The technology is designed to be used as a real-time URL security analysis tool. - It can process a URL quickly and issue a warning if it identifies a malicious URL. - The technology achieves fast processing speed by using only the URL as input. - It achieves high accuracy by analyzing the unstructured text of the URL on both a character-by-character and word-by-word level. - The technology utilizes both character-level and word-level information from the incoming URL.


Original Abstract Submitted

The technology described herein identifies malicious URLs using a classifier that is both accurate and fast. Aspects of the technology are particularly well adapted for use as a real-time URL security analysis tool because the technology is able to quickly process a URL and produce a warning when a malicious URL is identified. The rapid processing speed of the technology described herein is produced, in part, by use of only a single input signal, which is the URL itself. The high accuracy produced by the technology described herein is achieved by analyzing the unstructured text on both a character-by-character level and a word-by-word level. The technology described herein uses both character-level and word-level information from the incoming URL.