System And Method For Content Recognition And Data Categorization

Patent No. US11107098 (titled "System And Method For Content Recognition And Data Categorization") was filed by Content Aware Llc on Apr 1, 2020.

What is this patent about?

’098 is related to the field of content recognition and data categorization. The patent addresses the problem of commercial entities and consumers only leveraging shared content at face value, missing opportunities to extract and structure useful data from it. Current extraction tools capture only minimal data, leaving a vast amount of untapped information within the content. The goal is to provide a system that can discern, catalog, and categorize content in real-time, analyzing demographics, goods, locations, and brands present in the content.

The underlying idea behind ’098 is to create a system that uses machine learning and AI to analyze captured content (photos, videos, audio, text) from various sources. This system identifies objects within the content, extracts relevant details, and catalogs them into a structured database. The system then uses this data to determine relationships between objects and user profiles, providing insights into brand correlation and consumer behavior. The core insight is to go beyond simple content sharing and use AI to mine the content for valuable marketing and business intelligence.

The claims of ’098 focus on a content recognition and data categorization system that includes one or more databases and a computing system. The system maintains a database of user profiles, each associated with a unique user and their selected brand identifiers. It collects captured content from third-party sources, identifies objects within the content using Optical Character Recognition (OCR) , and transforms the extracted data into a standardized format. The system then searches the database to identify potential associated brand identifiers based on user profiles and collected data, determines brand correlations, and presents this data to the user.

In practice, the system works by first gathering content from various sources, including social media platforms and direct uploads. The content processing module then analyzes this content, identifying objects in the foreground and background. This analysis involves techniques like OCR, facial recognition, and audio analysis. The identified objects are then cataloged with descriptors, creating a hierarchical structure that facilitates searching and analysis. The system then uses this structured data to identify relationships between users, brands, and content.

The differentiation from prior approaches lies in the system's ability to perform real-time analysis and correlation of data from diverse sources. Unlike traditional methods that rely on manual tagging or limited data extraction, this system uses AI and machine learning to automatically identify and categorize objects within content. This allows for a more comprehensive and nuanced understanding of consumer behavior and brand associations. The system also provides a platform for cross-promotion and targeted advertising based on the identified relationships.

How does this patent fit in bigger picture?

Technical landscape at the time

In the late 2010s when ’098 was filed, content sharing via web-enabled devices was ubiquitous, at a time when applications commonly relied on cloud-based storage and APIs for accessing and processing multimedia data. Machine learning and artificial intelligence were increasingly used for data analysis, but extracting structured data from unstructured content remained a challenge. The ability to discern, catalog, and categorize content in real-time, while also analyzing demographics, location, and brands, was a non-trivial task.

Novelty and Inventive Step

The examiner allowed the claims because the applicant's amendments and arguments were persuasive in reciting a specific way of managing remote content providers to share content and other information in real-time in a standardized format, regardless of the content type. This was deemed a practical application in computer-related technology that overcame an Alice 101 rejection. The examiner also stated that the claims were allowable over the prior art of record after further searching.

Claims

This patent includes 20 claims, with claim 1 being the only independent claim. Independent claim 1 is directed to a content recognition and data categorization system. The dependent claims generally elaborate on and add limitations to the features and functionalities described in the independent claim, such as specific methods for determining brand identifiers, generating virtual maps, and integrating data from various sources.

Key Claim Terms New

Definitions of key terms used in the patent claims.

Term (Source)Support for SpecificationInterpretation
Brand correlation
(Claim 1)
“The system in the invention provides a free service or subscription based services, whereby subscription based members who acquire a subscription will have the ability to decentralize data and determine performance in all markets because social capturing is at a global scale. Members will have access to complimentary goods of their products or other existing products to see overlapping purchases as well as customer demographics within a particular zip code, based on past purchases, shares, consumed content, interests of their current customer base, or other products related to the business the consumers already own.”A determination of the relationship between the at least one brand identifier and the one more potential associated brand identifiers to assist the first unique user in determining additional brands that have a consumer overlap.
Brand identifier
(Claim 1)
“A brand or logo may include, but is not limited to a trademark, animation, text, movies, movie clip, movie still, TV shows, books, musical bands or genres, celebrities, historical or religious figures, geographic locations, colors, patterns, occupations, hobbies or any other thing that can be associated with some demographic information such as a sports team.”Information representing a brand that has been selected to be associated with a unique user on the content recognition and data categorization system. This information is stored in the user's profile.
Captured content
(Claim 1)
“Content may include multimedia such as photographs, text, links, and locations. Content may be geared to capture and share memories of (or live) events, activities, and purchases. In relation to photo-sharing, most commercial and consumer users who capture and share content leverage their web-enabled devices including iPhone, iPad, Droid, Surface, Mac, PC, locks, action cameras, 360 cameras, fusion, home security systems, and webcams.”Content in different formats collected from third-party content provider computing device systems via content search queries or by receiving shared content.
Optical Character Recognition technology
(Claim 1)
“Through a cloud-based data warehouse, the invention may utilize in real-time various existing proprietary APIs, and micro services that discern background, text, audio, video, geo-tracking location, time stamping, and facial recognition details from each piece of content captured by the system.”Technology used to identify one or more objects associated with the captured content.

Patent Family

Patent Family

File Wrapper

The dossier documents provide a comprehensive record of the patent's prosecution history - including filings, correspondence, and decisions made by patent offices - and are crucial for understanding the patent's legal journey and any challenges it may have faced during examination.

  • Get instant alerts for new documents

US11107098

CONTENT AWARE LLC
Application Number
US16838021
Filing Date
Apr 1, 2020
Status
Granted
Expiry Date
Apr 1, 2040
External Links
Slate, USPTO, Google Patents