Clustbigfim: Mapreduce Cf For Big Data Itemset Mining

KOUSALYADEVI S; ISHWARYA L

Volume 12, Issue 4 (April 2026)

Clustbigfim: Mapreduce Cf For Big Data Itemset Mining

Impact Factor

7.883

Call For Paper

Volume 12 Issue 07

July 2026

Download Paper Format

Copyright Form

Under License Of

Creative Commons Attribution-NonCommercial
4.0 International License

Share on:

Author(s)

KOUSALYADEVI S ISHWARYA L

Abstract

Frequent Itemset Mining (FIM) Is Essential For Discovering Patterns In Large-scale Data, But Traditional Algorithms Struggle With Big Data Volumes Due To Scalability Issues. ClustBigFIM Introduces A Hybrid MapReduce-based Framework That Integrates Parallel K-means Clustering As Preprocessing To Partition Datasets Into Manageable Clusters, Followed By Modified BigFIM Employing Apriori And Eclat Algorithms For Efficient Extraction Of Frequent Itemsets. In The MapReduce Paradigm, The Map Phase Computes Distances And Assigns Itemsets To Clusters, While The Reduce Phase Aggregates Results And Generates Patterns Useful For Business Analytics Like Market Basket Analysis. Evaluated On Large Synthetic And Real-world Datasets, ClustBigFIM Achieves Superior Speedup, Scalability, And Execution Time Compared To Standalone BigFIM By Reducing Data Redundancy Through Clustering. This Approach Leverages Hadoop’s Fault-tolerant Processing To Handle Petabyte-scale Data, Enabling Robust FIM In Distributed Environments.

Keywords

Frequent Itemset Mining MapReduce K-means Clustering BigFIM Apriori Eclat Big Data Scalable Pattern Discovery.

Paper ID

IJSARTV12I4105057

Publication Date

April 18, 2026

Research Area

Computer Science And Engineering

Download Full Article

Clustbigfim: Mapreduce Cf For Big Data Itemset Mining

Impact Factor

Call For Paper

Volume 12 Issue 07

Download Paper Format

Copyright Form

Under License Of

Author(s)

Abstract

Keywords

Paper ID

Publication Date

Research Area

Submit Your Paper to IJSART

ISSN Number