Patent No. US11107098 (titled "System And Method For Content Recognition And Data Categorization") was filed by Content Aware Llc on Apr 1, 2020.
’098 is related to the field of content recognition and data categorization. The patent addresses the problem of commercial entities and consumers only leveraging shared content at face value, missing opportunities to extract and structure useful data from it. Current extraction tools capture only minimal data, leaving a vast amount of untapped information within the content. The goal is to provide a system that can discern, catalog, and categorize content in real-time, analyzing demographics, goods, locations, and brands present in the content.
The underlying idea behind ’098 is to create a system that uses machine learning and AI to analyze captured content (photos, videos, audio, text) from various sources. This system identifies objects within the content, extracts relevant details, and catalogs them into a structured database. The system then uses this data to determine relationships between objects and user profiles, providing insights into brand correlation and consumer behavior. The core insight is to go beyond simple content sharing and use AI to mine the content for valuable marketing and business intelligence.
The claims of ’098 focus on a content recognition and data categorization system that includes one or more databases and a computing system. The system maintains a database of user profiles, each associated with a unique user and their selected brand identifiers. It collects captured content from third-party sources, identifies objects within the content using Optical Character Recognition (OCR) , and transforms the extracted data into a standardized format. The system then searches the database to identify potential associated brand identifiers based on user profiles and collected data, determines brand correlations, and presents this data to the user.
In practice, the system works by first gathering content from various sources, including social media platforms and direct uploads. The content processing module then analyzes this content, identifying objects in the foreground and background. This analysis involves techniques like OCR, facial recognition, and audio analysis. The identified objects are then cataloged with descriptors, creating a hierarchical structure that facilitates searching and analysis. The system then uses this structured data to identify relationships between users, brands, and content.
The differentiation from prior approaches lies in the system's ability to perform real-time analysis and correlation of data from diverse sources. Unlike traditional methods that rely on manual tagging or limited data extraction, this system uses AI and machine learning to automatically identify and categorize objects within content. This allows for a more comprehensive and nuanced understanding of consumer behavior and brand associations. The system also provides a platform for cross-promotion and targeted advertising based on the identified relationships.
In the late 2010s when ’098 was filed, content sharing via web-enabled devices was ubiquitous, at a time when applications commonly relied on cloud-based storage and APIs for accessing and processing multimedia data. Machine learning and artificial intelligence were increasingly used for data analysis, but extracting structured data from unstructured content remained a challenge. The ability to discern, catalog, and categorize content in real-time, while also analyzing demographics, location, and brands, was a non-trivial task.
The examiner allowed the claims because the applicant's amendments and arguments were persuasive in reciting a specific way of managing remote content providers to share content and other information in real-time in a standardized format, regardless of the content type. This was deemed a practical application in computer-related technology that overcame an Alice 101 rejection. The examiner also stated that the claims were allowable over the prior art of record after further searching.
This patent includes 20 claims, with claim 1 being the only independent claim. Independent claim 1 is directed to a content recognition and data categorization system. The dependent claims generally elaborate on and add limitations to the features and functionalities described in the independent claim, such as specific methods for determining brand identifiers, generating virtual maps, and integrating data from various sources.
Definitions of key terms used in the patent claims.

The dossier documents provide a comprehensive record of the patent's prosecution history - including filings, correspondence, and decisions made by patent offices - and are crucial for understanding the patent's legal journey and any challenges it may have faced during examination.
Date
Description
Get instant alerts for new documents