Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Distributed Adaptive Model Rules for mining big data streams
KTH.
2015 (English)In: Proceedings - 2014 IEEE International Conference on Big Data, IEEE Big Data 2014, 2015, 345-353 p.Conference paper, Published paper (Refereed)
Abstract [en]

Decision rules are among the most expressive data mining models. We propose the first distributed streaming algorithm to learn decision rules for regression tasks. The algorithm is available in samoa (Scalable Advanced Massive Online Analysis), an open-source platform for mining big data streams. It uses a hybrid of vertical and horizontal parallelism to distribute Adaptive Model Rules (AMRules) on a cluster. The decision rules built by AMRules are comprehensible models, where the antecedent of a rule is a conjunction of conditions on the attribute values, and the consequent is a linear combination of the attributes. Our evaluation shows that this implementation is scalable in relation to CPU and memory consumption. On a small commodity Samza cluster of 9 nodes, it can handle a rate of more than 30000 instances per second, and achieve a speedup of up to 4.7x over the sequential version.

Place, publisher, year, edition, pages
2015. 345-353 p.
Keyword [en]
Algorithms, Data communication systems, Data mining, Adaptive modeling, Attribute values, Data mining models, Distributed streaming, Linear combinations, Memory consumption, On-line analysis, Open source platforms, Big data
National Category
Computer and Information Science
Identifiers
URN: urn:nbn:se:kth:diva-168334DOI: 10.1109/BigData.2014.7004251Scopus ID: 2-s2.0-84921726705ISBN: 9781479956654 (print)OAI: oai:DiVA.org:kth-168334DiVA: diva2:818192
Conference
2nd IEEE International Conference on Big Data, IEEE Big Data 2014, 27 October 2014 through 30 October 2014
Note

QC 20150608

Available from: 2015-06-08 Created: 2015-06-02 Last updated: 2015-06-08Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Vu, Anh Thu
By organisation
KTH
Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 30 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf