18209273. AUTOMATED DATA QUALITY DETECTION FOR UNSTRUCTURED DATA (DISH Wireless L.L.C.)

From WikiPatents
Jump to navigation Jump to search

AUTOMATED DATA QUALITY DETECTION FOR UNSTRUCTURED DATA

Organization Name

DISH Wireless L.L.C.

Inventor(s)

Darshit Gandhi of Centennial CO (US)

Sindhu Chowdary Chirumamilla of Englewood CO (US)

AUTOMATED DATA QUALITY DETECTION FOR UNSTRUCTURED DATA

This abstract first appeared for US patent application 18209273 titled 'AUTOMATED DATA QUALITY DETECTION FOR UNSTRUCTURED DATA



Original Abstract Submitted

This disclosure relates to assessment of data quality for unstructured data. In some aspects, a method includes obtaining, by one or more computing devices, metadata of multiple data files; analyzing a graph database representative of the multiple data files and generated using the metadata, to identify unstructured data included in one or more data files, the graph database representing features of the multiple data files, and relationships among the features of the multiple data files; obtaining a set of customized rules for the unstructured data based on context of the unstructured data; determining that the unstructured data fails to satisfy the set of customized rules; and in response to determining that the unstructured data fails to satisfy the set of customized rules, modifying the unstructured data to satisfy the set of customized rules.