Frequent pattern-growth (FP-Growth) is the mining of pattern itemsets, subsequences, and substructures that appear frequently in a dataset.
A Frequent itemset
refers to the most common items bought together. A Subsequence
where items are bought by a customer is called a frequent sequential pattern.
A Substructure
refers to different structural forms, such as subgraphs and subtrees, that are combined with frequent structural patterns to analyze and find relations with different items of frequent itemsets.
Note: An example of frequent itemset mining is Market Basket Analysis.
To generate frequent pattern-growth, we do the following:
itemsets
.root
node. It is labeled null
.itemset
, we can create a child
node and expand it if the itemset
follows the root
node.root
node.itemset
until we reach the end of the itemsets
in the transaction.node-links
.Frequent Item Sets are:
{K,E,M,O:1},{K,E,O:1},{K,M:1}}
{{K,E,M:1},{K,E:2}}
{{K,E:2},{K:1}}
{{K:4}}