The Data Mining Application Based on WEKA: Geographical Original of Music

Yuchen WU

Abstract


In this article, we use Weka as our tool for data mining. In the first step, we retrieve the dataset from the UCI database. At the same time, we get the purpose of analysis. Then we classify the datasets and found class attributes. We classify the datasets into two attributes: latitude and longitude. The second step, we finish the data cleaning. The tools used for data cleansing are Microsoft Excel, Google Maps, and Weka. The next is aggregation and Skewed Data. Then the appropriate attributes of dataset are selected. The third step, it’s the experiment design. We choose three classifiers: Naive Bayes, J48 and IBk. The fourth step, we finally get the experimental results through the appropriate classifiers, and the results are summarized. The fifth step, to make ROC Curve. The sixth step, the analysis of final results. Of three aspects: classifier analysis, attribute analysis, noise analysis. In the seventh step, we get the final conclusion that IB1 is the most successful model for our dataset.


Keywords


Data mining; Weka; Naïve Bayes; J48; IBk; IB1; ROC curve

Full Text:

PDF

References


Fawcett, T. (2006). An introduction to ROC analysis. Pattern Recognition Letters, 27(8), 861-874. doi:10.1016/j.patrec.2005.10.010

Russell, S., & Norvig, P. (2003). Artificial intelligence: A modern approach (2nd ed.). Prentice Hall.

Swets, J. A. (1996). Signal detection theory and ROC analysis in psychology and diagnostics: Collected papers. Mahwah, NJ: Lawrence Erlbaum Associates.

Wu, S. (2013). A review on coarse warranty data and analysis. Reliability Engineering and System, 114, 1-11.




DOI: http://dx.doi.org/10.3968/n

Refbacks

  • There are currently no refbacks.


Copyright (c) 2017 Yuchen WU

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.


Share us to:   


Reminder

  • We are currently accepting submissions via email only.

    The registration and online submission functions have been disabled.

    Please send your manuscripts to mse@cscanada.net,or mse@cscanada.org  for consideration.

    We look forward to receiving your work.

 


We only use three mailboxes as follows to deal with issues about paper acceptance, payment and submission of electronic versions of our journals to databases:
caooc@hotmail.com; mse@cscanada.net; mse@cscanada.org

 Articles published in Management Science and Engineering are licensed under Creative Commons Attribution 4.0 (CC-BY).

 MANAGEMENT SCIENCE AND ENGINEERING Editorial Office

Address:1055 Rue Lucien-L'Allier, Unit #772, Montreal, QC H3G 3C4, Canada.

Telephone: 1-514-558 6138
Http://www.cscanada.net Http://www.cscanada.org

Copyright © 2010 Canadian Research & Development Centre of Sciences and Cultures